Story Detail of id 47357218 | Liveview Hacker News

hrmtst938373 hours ago | on: Are LLM merge rates not getting better?

Focusing on flashy breakthroughs hides the issue that bigger models and merge benchmarks rarely translate to reliability in real codebases. For routine merges, subtle regressions and context quirks matter more than headline progress. Unless evals stress nasty scenarios like multi-file renames with tricky conflicts, the numbers are mostly for show. Progress will plateau until someone tunes for the boring, messy cases that waste dev time.

#visit	13,081,936
#session	74,665
#live-session	0