Story Detail of id 48396163 | Liveview Hacker News

le-mark14 hours ago | on: Uber's $1,500/month AI limit is a useful signal for AI tool pricing

> Once a model is open-weight, safeguards that do exist can be removed

Safeguards trained into the model (ie exist in the weights) can’t be removed.

You don't have to remove the safeguards if you can prompt your way around them.

There's a subreddit for people wanting to sex-talk to various models. It just so happens that the same prompt they use to 'jailbreak' SOTA models for sex talks also works if you want to have model write malware, or tell you how to design a highly illegal device.

regularfry10 hours ago | parent

Search for "heretic"+Gemma/qwen/DeepSeek for examples where exactly this has been done.

#visit	13,572,154
#session	74,665
#live-session	0