Story Detail of id 48502328 | Liveview Hacker News

simonw11 hours ago | on: Claude Fable is relentlessly proactive

My experience has been the exact opposite.

As the models get better you need to know more about their capabilities, because otherwise you risk prompting Claude Fable 5 like it's GPT-4o and complaining loudly about how it's all hype and nothing about these models is improving at all (yes, I do see people say that.)

Getting the best results out of these models requires skill, experience, intuition, and domain expertise. There's always room for improving every one of those.

Terretta9 hours ago | parent | next

The new benchmark for LLMs is how much of simonw's new know-how is required.

Lower bars are better.

risyachka2 hours ago | parent | next

>> Getting the best results out of these models requires skill, experience, intuition, and domain expertise.

domain expertise has nothing to do with llms. On the contrary, to have it you need to avoid llms.

>>you risk prompting Claude Fable 5 like it's GPT-4o

Thats fine because when GPT came out you had to treat it like a baby, GPT2 and around that time "Prompt engineering" was a thing.

Now its all dead.

After opus 4.8 all you have to do is say "fix it" or add /plan. All that time spend on learning previous models is time wasted.

And in a year or two with developed harness you will be out of the loop, errors are incoming - llm fixes them or adds new features based on some transcripts etc.

Even if model development stops now - there is nothing to learn really. Sure you may need to adjust prompt style a bit. You will do it naturally just like when you communicate with a new person. There is no "knowledge" to it, it is very smart.

isaacaggrey8 hours ago | parent | next

I agree but this particular example showed nothing about leveraging skill, experience, or intuition. If anything, this is another straightforward example of a one shot ask.

edit: that said, I understand this particular post is about model capability

ViscountPenguin11 hours ago | parent | next

Eh, I've have the exact opposite experience.

Way back before instruct models it was pretty difficult, but for the last couple of years I haven't needed anything more complex than the type of text that I might send in a detailed email to a colleague.

philipwhiuk11 hours ago | parent | next

Isn't the whole point of a better model that it should be better at understanding you than the previous one? So the same prompt should return a better answer.

Prompting differently to the new model seems entirely backwards when trying to determine if the model has improved.

loading story #48502852

loading story #48503004

kmnfu9 hours ago | parent

[dead]

#visit	13,793,363
#session	74,665
#live-session	0