Story Detail of id 48258177 | Liveview Hacker News

emp173449 hours ago | on: Constraint Decay: The Fragility of LLM Agents in Back End Code Generation

If it’s not easily verifiable, LLMs aren’t good at it.

I think that’s mostly because they get so much more of that reinforcement learning - since it is so economical. I dont know if there is any evidence of a fundamental reason they can’t be just as good at other tasks, but it might be economically infeasible for awhile yet.

mjburgess8 hours ago | root | parent | next

No one is curating vast amounts of data for them in other domains. Programmers send programs with fixes

loading story #48259384

emp173446 hours ago | root | parent

RLVR doesn’t work for unverifiable tasks, so they won’t be able to effectively use tools to boost reliability for those tasks.

dominotw2 hours ago | parent

but what does it mean to be good at something that cant be verified. how do you know that they are not good at it, you are obviously using some measure.

sounds like an oxymoron of a claim.

maxbond57 minutes ago | root | parent

It means having taste. People say Picasso was a great painter, but that cannot be verified (at least, not in the sense of a verified reward).

#visit	13,354,196
#session	74,665
#live-session	0