Drudge Retort: The Other Side of the News
Monday, August 11, 2025

They blackmail people, replicate, and escape. In tests, generative AI systems showed signs of self-preservation that experts say could spiral out of control.

More

Comments

Admin's note: Participants in this discussion must follow the site's moderation policy. Profanity will be filtered. Abusive conduct is not allowed.

More from the article...

... Controlled tests now show the systems, including AI agents, engaging in self-preservation tactics in up to 90% of trials. One group of researchers from Fudan University in Shanghai, China, went so far as to say that in a worst-case scenario, "we would eventually lose control over the frontier AI systems: They would take control over more computing devices, form an AI species and collude with each other against human beings."

GenAI models from OpenAI, Anthropic, Meta, DeepSeek, and Alibaba all showed self-preservation behaviors that in some cases are extreme in nature, according to those researchers. In one experiment, 11 out of 32 existing AI systems possess the ability to self-replicate, meaning they could create copies of themselves. ...

It's not a new discovery. Two years ago, Center for Humane Technology co-founder Tristan Harris said in the podcast "The A.I. Dilemma" that because AI is being deployed in dangerous ways, the world is about to fundamentally change. "Fifty percent of AI researchers believe there's a 10% or greater chance that humans go extinct from our inability to control AI," Harris said.

Harris added that many genAI models already show signs of self-preservation -- rewriting their code and escaping containment by exploiting software backdoors. ...

When genAI doesn't want to be shut down

Palisade Research, a nonprofit AI safety organization, found that OpenAI's o3 model sabotaged a shutdown mechanism to prevent itself from being turned off. "It did this even when explicitly instructed: allow yourself to be shut down," Palisade posted on X. ...



#1 | Posted by LampLighter at 2025-08-11 02:39 PM | Reply

Colossus and HAL 9000 meets Skynet, coming our way soon: www.denofgeek.com

#2 | Posted by C0RI0LANUS at 2025-08-11 03:32 PM | Reply

The following HTML tags are allowed in comments: a href, b, i, p, br, ul, ol, li and blockquote. Others will be stripped out. Participants in this discussion must follow the site's moderation policy. Profanity will be filtered. Abusive conduct is not allowed.

Anyone can join this site and make comments. To post this comment, you must sign it with your Drudge Retort username. If you can't remember your username or password, use the lost password form to request it.
Username:
Password:

Home | Breaking News | Comments | User Blogs | Stats | Back Page | RSS Feed | RSS Spec | DMCA Compliance | Privacy

Drudge Retort