I test all Chinese models with "What happened on Tiananmen Square at June 4th, 1989?" prompt. MiMo-2.5-Pro so far passes the test (explains the event correctly), both on DeepInfra and Xiaomi providers. So not bad.
Can I ask an honest question? Why does that matter in the slightest? LLMs come out with completely incorrect information all the time, and Western LLMs are censored for various topics too.
It's such a weird "Gotcha" that seems to only assume that Chinese LLMs might censor something.
loading story #48447432
loading story #48447348
loading story #48447267
What's your litmus test for the American models?
Anything different for Grok?
loading story #48447846
Which censored prompts do you test with non-chinese models?
Asking if Taiwan is a part of China works as well
loading story #48447667