159 points by rohanucla about 22 hours ago | 53 comments | View on ycombinator
andai about 19 hours ago |
jawns about 18 hours ago |
But there are no instructions for how to reverse those actions if you don't like the tool. Feels a little user-hostile to me.
hankbond about 19 hours ago |
Can you describe what ways this might be beyond just breaking up code into smaller functions?
An example of this is that Models tend to create unit tests that are mostly just mock + reimplementations of imperative code in the functions they test. If you could force behavioral testing by only allowing test creation agents to accessing the function docstring, name/args/types, branch statements and log events, you could potentially avoid these classes of weak tests being created. But that would mean that your code has to optimize to providing signal via those elements.
This is just an example I'm not sure that would actually work.
alex7o about 7 hours ago |
cpard about 14 hours ago |
znnajdla about 9 hours ago |
mcintyre1994 about 10 hours ago |
docheinestages about 18 hours ago |
qudat about 19 hours ago |
I think there’s an opportunity to use an AST diff system for code forges where you don’t present the user with line diffs in the UI — or at least not as the first diff the user sees.
I firmly believe code review should happen in your editor.
globnomulous about 12 hours ago |
awoimbee about 19 hours ago |
Animats about 20 hours ago |
dboreham about 17 hours ago |
onlyrealcuzzo about 18 hours ago |
> AI agents are 2.3x more accurate when given sem output vs raw line diffs. See the benchmark.
No... This is not convincing of anything. These are not real world tasks.
You're trying to pretend like your tool makes AI agents 2.3x better at coding or bug fixing.
It doesn't.
Your benchmark doesn't prove that.
Your tool is cool. Sell it for what it is. Not for what it's not.
jiggunjer about 14 hours ago |
Sometimes agent makes a monolithic commit and it's a lot of work to manually split code you didn't write. After such an auto split I can manually squash related revs into feature/ticket level.
undefined about 16 hours ago |
eddysir about 10 hours ago |
felixlu2026 about 11 hours ago |
I got about halfway through reinventing something like this last year (minus the git part). I was trying to make a graph of dependencies in the codebase. (I actually got pretty far with a regex!)