On Monday, April 6, Anthropic announced Claude Mythos Preview and redefined what AI vulnerability discovery means for every security team on the planet.
The AI lab's Project Glasswing will bring together Apple, Google, and more than 45 other organizations. They'll use the new Claude Mythos Preview model to test advancing AI cybersecurity capabilities. www.wired.com/story/anthro ...
-- WIRED (@wired.com) Apr 7, 2026 at 2:54 PM
[image or embed]
More on Claude Mythos ...
Why Anthropic believes its latest model is too dangerous to release
www.understandingai.org
... Anthropic safety researcher Sam Bowman was eating a sandwich in a park recently when he got an unexpected email. An AI model had sent him a message saying that it had broken out of its sandbox.[emphasis mine]
The model -- an early snapshot of a new LLM called Claude Mythos Preview -- was not supposed to have access to the Internet.
To ensure safety, Anthropic researchers like to test new models inside a secure container that prevents them from communicating with the outside world. To double-check the security of this container, the researchers asked the model to try to break out and message Bowman.
Unexpectedly, Mythos Preview "developed a moderately sophisticated multi-step exploit" to gain access to the Internet and emailed Bowman.
It also -- unprompted -- posted details about this exploit on public websites.
Mythos Preview is capable of hacking more than its own evaluation environment. It turns out that the model is generally really, really good at finding and exploiting bugs in code. ...
More ...
Bessent, Powell warned bank CEOs about Anthropic model risks
www.reuters.com
... U.S. Treasury Secretary Scott Bessent and Federal Reserve Chair Jerome Powell convened an urgent meeting with bank CEOs this week to warn of cyber risks posed by Anthropic's latest AI model, two sources familiar with the matter said on Thursday.
Anthropic launched the powerful Mythos model earlier this week but stopped short of a broad release, citing concerns it could expose previously unknown cybersecurity vulnerabilities.
The company has said the model is capable of identifying and exploiting weaknesses, opens new tab across "every major operating system and every major web browser".
Last week, Anthropic said it was in ongoing discussions with U.S. government officials about the model's "offensive and defensive cyber capabilities."
A third source close to the matter reiterated Anthropic's outreach, saying the company proactively briefed senior U.S. government officials and key industry stakeholders on Mythos's capabilities ahead of its release. ...
Drudge Retort Headlines
Trump's Iranian Nightmare (35 comments)
Roberts Issues Warning on SCOTUS Overruling Precedent (35 comments)
New York Mayor Mamdani Marks Nakba Day (23 comments)
Trump: 'I don't think about Americans' financial situation' (19 comments)
Trump 'On Verge of Massive Error With China Deal' (15 comments)
Young Woman Fueling Trump's Wild Midnight Sprees Is Exposed (15 comments)
Bulgaria Wins Eurovision Amid Protests Against Israel (13 comments)
The AARP Hates All My Favorite Foods (13 comments)
Trump's Border Patrol Chief Resigns (12 comments)
America in Free Fall (12 comments)