Story Detail of id 48500620 | Liveview Hacker News

ElFitz12 hours ago | on: If you are asking for human attention, demonstrate human effort

I’ve been making Codex and Claude get their work reviewed by most recent best performing model of their own family, and each other’s, for months.

On top of that, we have been running multi-model AI reviews on every PR through their respective GitHub integrations (Codex, Gemini, Copilot, Greptile, CodeRabbit).

They never fully overlap, and yet they somehow usually all miss the same things. The most significant improvement came from having agents commit their plan along with their work.

On the upside, it means I get to focus my reviews on different things.

#visit	13,784,722
#session	74,665
#live-session	0