I might be convinced these models came to the independent idea of committing blackmail against being turned off had they not been extensively trained on literature that undoubtedly included such concepts.
“The model mimicked the output of the training data” is a less impressive press release.
“The kid mimicked his musical teachers” is less impressive than “5-year old musical prodigy leaves judges gobsmacked in audition”
Being able to play music doesn’t imply consciousness. It implies intelligence. We’ve had player pianos for ages. It’s an ability, not a phenomenology.
Being able to appreciate and enjoy music is closer to consciousness. Now how would we go about proving that an LLM does so, versus merely generating sentences that imply it does?
loading story #48402002
{"deleted":true,"id":48398885,"parent":48398664,"time":1780582060,"type":"comment"}
They only resorted to blackmail when it was the last resort, they didn’t resort to it immediately like a villain in one of the books they’ve read. That seems pretty human to me. It’s not like most humans come up with the idea of blackmail out of whole cloth.