I use Claude Opus (4.5, 4.6) all the time and catch it making making subtle mistakes, all the time.
Are you really being more productive (let’s say 3x times more), or just feel that way because you are constantly prompting Claude?
Maybe I’m wrong, but I don’t buy it.
I guess that's why Claude Code has 0 open issues on Github. Since software engineering is solved, their autonomous agents can easily fix their own software much better and faster than human devs. They can just add "make no mistakes" to their prompt and the model can solve any problem!
Oh wait, they have 5,000+ open issues on Github[1]. I'm yet to be convinced that this is a solved problem
Sincere question, how do beginners to the field (interns, juniors) do this when they don't have any best practices yet?
It's harder and harder to detect sarcasm these days but in case you're being serious, I've tested a similar setup and I noticed Claude produces perfectly plausible code that has very subtle bugs that get harder and harder to notice. In the end, the initial speedup was gone and I decided to rewrite everything by hand. I'm working on a product where we need to understand the code base very well.
If AI really is all that, then whatever "special" thing you are doing will be automated as well.
You from 2 months ago:
>LLMs are great coders, but subpar developers". https://news.ycombinator.com/item?id=46434304
Interesting. That's a lot of progress in 2 months!
I'll believe it when AI can tell me when a project will be done. I've asked my developer friends about this and I get a blank stare, like I'm stupid for asking.