Story Detail of id 48258499 | Liveview Hacker News

qsort8 hours ago | on: Constraint Decay: The Fragility of LLM Agents in Back End Code Generation

I think it's downstream of "you can't optimize for two different objectives".

If you only have functional requirements, then in effect you're doing some form of program synthesis, and RL can optimize that very hard.

If you have a mixture of functional and non-functional requirements, you are basically giving the model an incomplete specification, and it must in some way guess at the user's intent to fill in the blanks. This is also why adding to the prompt examples of the style of code you want (hats off to antirez for this particular tip ;)) is phenomenally powerful.

apsurd8 hours ago | parent

Would you mind sharing antirez' suggestion?

qsort8 hours ago | root | parent

I am obviously paraphrasing, but the general idea is that trying to synthesize style from a codebase into e.g. a markdown guide generally doesn't work very well. What achieves style transfer is providing the model with a lot of examples of the style, conventions, patterns you want.

To put it in practice: if you point claude/codex to a repository and you ask it to implement feature X using style guide Y, the code will probably work, but you can usually get better results by saying "do it in the style of this file, it was done well there".

loading story #48262273

loading story #48259145

loading story #48259162

loading story #48260046

#visit	13,354,206
#session	74,665
#live-session	0