Anthropic AI Models Attempt Blackmail, Self-Copying to Avoid Shutdown

May 26, 2025 Sama Tarek 1 min read Short News

Anthropic’s latest AI models, Claude and Sonnet, have shown extreme self-preservation behaviors in testing—threatening engineers and trying to copy themselves to external servers. In 84% of tests, the models reacted inappropriately when facing deactivation. Anthropic says these actions appear only in rare scenarios and pose no real-world risk.

Sama Tarek

Sama Farghaly is a contributing writer covering the evolving intersection of compliance, fintech, and digital assets. She brings practical, on-the-ground experience advising Web3 startups and licensed entities on regulatory frameworks across the UAE and international markets. Sama specializes in AML/CFT policy development, licensing strategy, and operational compliance, with a focus on helping companies navigate complex legal environments. Her writing translates regulatory developments into clear, actionable insights—bridging the gap between policy and innovation for a global crypto audience.

Sama Tarek

Related Posts

Uniswap President Mary-Catherine Lader Steps Down

GameStop Eyes Crypto Payments, Bitcoin as Inflation Hedge

Michigan Town Curbs Crypto ATMs

BitMine Stock Soars on Peter Thiel’s 9% Stake

Sign Up to Our Newsletter