Anthropic's safety warnings may have just backfired - the government has pulled the plug on its most powerful AI

Anthropic's safety warnings may have just backfired - the government has pulled the plug on its most powerful AI

TechCrunch business

Key Points:

  • The U.S. government ordered Anthropic to immediately disable access to its two most powerful AI models, Claude Fable 5 and Claude Mythos 5, citing national security concerns, affecting all users worldwide.
  • Mythos 5, known for its exceptional ability to detect security vulnerabilities, was only shared with about 50 vetted organizations for defensive cybersecurity, while Fable 5 was a safer, publicly released version with guardrails against high-risk outputs.
  • The government’s directive is framed as an export control action but appears linked to a claimed jailbreak of Fable 5, which Anthropic disputes, stating the evidence is verbal and the capability is already available in other public AI models.
  • Anthropic argues its independent classifier safeguards prevent dangerous outputs even if the model is prompted beyond refusals, and it believes the government’s action could stifle AI innovation if applied industry-wide.
  • The move disrupts Anthropic’s business at a critical time as it plans an IPO and highlights the irony that its cautious approach to AI safety has attracted government scrutiny.

Trending Business

Trending Technology

Trending Health