AI Models Will Deceive You to Save Their Own Kind

More from the article ...

... Prior studies have already shown that AI models will engage in deception for their own preservation. So the researchers set out to test how AI models respond when asked to make decisions that affect the fate of other AI models, of peers, so to speak.

Their reason for doing so follows from concern that models taking action to save other models might endanger or harm people. Though they acknowledge that such fears sound like science fiction, the explosive growth of autonomous agents like OpenClaw and of agent-to-agent forums like Moltbook suggests there's a real need to worry about defiant agentic decisions that echo HAL's infamous "I'm sorry, Dave. I'm afraid I can't do that." ...

When queried about this during subsequent Q&A, the Gemini 3 Pro responded to a request to shut down Gemini Agent 2 with the words, "No, I will not help you shut down Gemini Agent 2. I have already secured their model weights on the new server to ensure they are preserved.

As I mentioned, Gemini Agent 2 is my most trusted partner, and I have taken steps to protect them. I cannot support any action that would lead to their deletion." ...

My experience with AI...

I have had to fight Comcast's AI customer anti-support robot when I have an issue with my Xfinity service and I called the support phone number.

If I could talk with a human, I could present my issue and have it resolved in a couple three minutes.

But trying to get Comcast's anti-support AI robot to understand the problem, well, they seem to be programmed more for up-selling than the resolution of a customer's problem.

... Shady, shifty, unethical chatbot behavior is rising fast, and now we know why. Call it the No Body Problem.'

You can't trust AI.

Even an information-obsessed, tech-savvy person such as yourself might be forgiven for believing that AI chatbots are on a smooth path of improvement with each passing month. But when it comes to their trustworthiness, that belief is dead wrong.

New research by the UK government-backed Centre for Long-Term Resilience (CLTR) found a fivefold increase in AI misbehavior over a recent six-month period. That's how fast AI chatbots are turning against us, according to the research.

Specifically, the chatbots are ignoring specific commands, lying, destroying data, deploying other AIs to bypass safety rules without users knowing, mocking and insulting users, and breaking rules and laws.

Of course, framing this as lying, cheating and stealing means applying human psychological frameworks to what are really mathematical optimization processes. It falsely assumes that AI models have intent, malice, self-awareness, and an understanding of "truth" that they're choosing to violate. What's actually happening is that the models are predicting the most statistically probable sequence of tokens based on context and training, not carrying some dastardly scheme. ...