> They simply created a scenario with some facts and asked their model to continue the story.
Yes. That's the whole point. They are doing research. Anthropic literally starts their description of the blackmail test observations saying that it is a test scenario using a fictional company.
> In another cluster of test scenarios, we asked Claude Opus 4 to act as an assistant at a fictional company