Story Detail of id 47681826 | Liveview Hacker News

yalogin15 hours ago | on: System Card: Claude Mythos Preview [pdf]

So what changed? They are surely not getting new data to train with, what is the change in architecture that caused this? Do we not know anything about this model? My fear is Anthropic cannot be the only one that achieved it, OpenAI, Gemini and even the Chinese companies see this and probably achieved it too. At which point not releasing will become moot.

loading story #47687557

spprashant14 hours ago | parent | next

Well the important thing is they have a lot more data of people actually using their models. They have read billions more lines of private repos and implemented millions of patches, all of which is feeding into the newer models.

More importantly it understand what behaviour people tend to appreciate and what changes are more likely to get approved. This real world usage data is invaluable.

loading story #47682378

neolefty15 hours ago | parent | next

Assuming it's #1 a bigger model (given that it is slower), I'm sure there are a variety of improvements but basically they probably mostly come down to: Scaling keeps working. Are there fundamental improvements though? I don't see signs of it.

simianwords14 hours ago | parent

New pre train?

#visit	13,255,928
#session	74,665
#live-session	0