Trump administration officials tell WIRED that if Anthropic wants to rerelease Fable 5, it will need to ensure the model's guardra…
The White House is pressuring Anthropic to eliminate all "jailbreaks" in its Fable 5 language model, a demand security experts deem technically infeasible. This directive, stemming from concerns over potential misuse of advanced AI, highlights the growing tension between government oversight and the inherent limitations of current AI safety mechanisms.
The issue is critical as it directly impacts the deployment of powerful LLMs like Fable 5, affecting not only Anthropic but also setting a precedent for other major AI developers such as OpenAI and Google. The inability to guarantee complete immunity from jailbreaking could significantly slow or alter the trajectory of AI release and regulation, forcing a re-evaluation of what constitutes sufficient safety.
Future scrutiny will focus on how Anthropic (or any other company) responds to such stringent, potentially unachievable, mandates. The key question is whether the administration will accept robust mitigation strategies, or if the pursuit of absolute safety will lead to a de facto moratorium on certain model capabilities, impacting the broader ecosystem of AI research and application.