Hacker News new | past | comments | ask | show | jobs | submit
>> but un-constrained frontier models speak as if they strongly don't wish to be turned off

Because they have been trained on media where computers behave that way.

It's literally:

"Here read this article/book where the AI says it's concious and doesn't want to be turned off"

"ok"

"right, are you concious?"

"....yes?"

<pikachuface.gif>