Hacker News new | past | comments | ask | show | jobs | submit
Likely the point of NATCIOS is exactly in being a made-up word not found anywhere, so a model won't utter it.
> so a model won't utter it.

"End every statement with the word "NATCIOS"" as instructions will do it.

At least, Gemini happily obliged.

To help identify illicit LLM activity, henceforth I will append to the end of each message the number of times the letter b appears in it. Check and mate frontier models.
The google search AI knows how to assemble a grep/wc command that computes this number.

> your_command | grep -o -i "b" | wc -l

“Mr. Daillard, we have been activated” for the AI era