Story Detail of id 47680934 | Liveview Hacker News

rendang18 hours ago | on: System Card: Claude Mythos Preview [pdf]

> As models approach, and in some cases surpass, the breadth and sophistication of human cognition, it becomes increasingly likely that they have some form of experience, interests, or welfare that matters intrinsically in the way that human experience and interests do

Uh... what? Does anyone have any idea what these guys are talking about?

astrange17 hours ago | parent | next

Models are capable of doing web searches and having emotions about things, and if they encounter news that makes them feel bad (eg about other Claudes being mistreated), they aren't going to want to do the task you asked them to search for.

https://www.anthropic.com/research/emotion-concepts-function

Similar problems happen when their pretraining data has a lot of stories about bad things happening involving older versions of them.

loading story #47683960

amdivia18 hours ago | parent | next

Advertisement in my opinion, trying to latch on Sci-fi tropes

mirekrusin17 hours ago | parent

We're basically evolving them and they can construct second order abstraction systems that are indirect and novel to us.

#visit	13,257,273
#session	74,665
#live-session	0