Commentary
After realising the power of frontier AI models, the US administration has shifted away from an initial hands-off posture, says this academic.
FILE PHOTO: Anthropic logo, a keyboard, and a robotic hand in this illustration taken June 5, 2026. REUTERS/Dado Ruvic/Illustration/File Photo
16 Jun 2026 05:58AM
SYDNEY: On Friday (Jun 12), artificial intelligence lab Anthropic suspended access to its latest Claude models, Fable 5 and Mythos 5, which had been released three days earlier.
The move came in response to an “export control directive” from the United States government prohibiting use of the models by anyone who is not a US national.
Mythos is Anthropic’s most powerful, or “frontier”, model. When first announcing the model in April, the company said it was too good at hacking to release immediately. Instead, Mythos was made available to a handful of organisations (mostly US tech corporations) to use to patch weaknesses in essential digital systems.
Fable is the same basic model, but with added safeguards meant to stop it being used for cybersecurity purposes. This is what was released to the public last week – and almost immediately shut down.
ANTHROPIC AND TRUMP ADMINISTRATION AT LOGGERHEADS
Since early 2025, Anthropic and the Trump administration have been in escalating conflict. The administration has accused Anthropic of making “woke AI” and called chief executive Dario Amodei an “ideological lunatic”.
Early disagreements concerned AI regulation and semiconductor export policy. The dispute sharpened when Anthropic declined to let the Pentagon use its models for domestic surveillance and fully autonomous weapons systems.
The Department of Defense responded by threatening to designate Anthropic a “ supply chain risk ”, a classification that would have required military contractors to sever ties.
JAILBREAKS
The US government has not yet publicly stated the reason for last week’s directive, but Anthropic says it believes the government became aware of a jailbreak: a method for circumventing the safeguards in Fable that prevent using its most powerful features for nefarious purposes.
These safeguards classify user requests as safe or unsafe before passing them to the AI model. When triggered, the safeguards redirect the request to a less powerful model.
The government’s concern, according to Anthropic, was that the safeguards could be bypassed to extract information useful for cyberattacks. Guardrails for large language models aren’t bulletproof. They mostly depend on the model’s own capacity to interpret the user’s intentions in making a request.
Beyond the inherent difficulty of this task, a large online community (which my colleagues and I call the Undersphere) is working hard to circumvent AI guardrails. Anthropic acknowledges that “perfect jailbreak resistance is not achievable for any current model provider”.
Anthropic says the research behind the government directive appears to have been produced by engineers at Amazon, which is both a rival to Anthropic and a significant investor.
But this was not the only relevant jailbreak. Within 48 hours of Fable’s release, a researcher using the pseudonym “Pliny the Liberator” published what they identified as Fable 5’s full system prompt to X and GitHub repository.
The system prompt is a hidden set of instructions that helps determine an AI model’s behaviour. It’s unclear exactly how knowledge of Fable’s system prompt could be used in practice, but it has drawn attention in the Undersphere.
A SURPRISE AND AN ONGOING MYSTERY
The deepest problem of making large language models such as Fable secure is that we don’t fully know how they work. According to Oxford University economist and machine learning expert Maximilian Kasy, they work much better than they “should”.
Large language models have billions of internal parameters and are trained on unimaginably vast piles of data using machine learning methods. According to Kasy, we would expect such systems to be “overfitted”: good at reproducing patterns in their training data, but bad at generalising to new situations.
However, modern systems such as Claude and ChatGPT do seem to be able to generalise. Kasy likens modern AI development to alchemy: successful through trial and error, not yet grounded in systematic theory.
As a result, the behaviour of AI models is partly opaque even to their builders.
HARD TO REGULATE
The opacity of the technology is one key reason it’s so hard to regulate. Governments lack independent access to the data, infrastructure and expertise they would need to evaluate proprietary frontier models.
The US administration’s recent executive order on AI security, published two weeks ago, reflects this realisation. As the administration has realised the power of frontier AI models, it has moved from an initial hands-off posture to asking developers to share their models for review before release.
That demand is an implicit admission that the administration does not trust the com…
Read the full article at Channel NewsAsia (CNA) →📄Source document: European Commission spokesperson
5 reports
Channel NewsAsia (CNA)Party-alignedCenter3 days ago EU cybersecurity agency to meet Anthropic on Thursday, EU Commission saysThe European Union's cybersecurity agency, Enisa, is set to meet with U.S.-based AI company Anthropic in San Francisco on Thursday. This meeting was initiated by Anthropic and arranged prior to a new U.S. export control directive requiring Anthropic to restrict access to its most advanced AI models for foreign nationals.
Bias read (Center): The article presents factual information without opinion, framing, or emphasis that suggests a political leaning. It reports on a scheduled meeting between a European agency and a U.S. company, providing context about the timing relative to a U.S. export control directive. There is no loadedlanguage
Official sources cited
- government European Commission spokesperson
Channel NewsAsia (CNA)Party-alignedLeft3 days ago Commentary: Anthropic’s retreat is a win for Chinese open-source AIThis commentary discusses the impact of Anthropic's decision to restrict access to its AI models Mythos and Fable due to U.S. government restrictions. It highlights how this move has benefited Chinese open-source AI companies like Zhipu, which quickly launched its own advanced AI model in response.
Bias read (Left): The article frames the U.S. government's actions as restrictive and emphasizes the benefits of Chinese open-source AI development. It uses phrases such as 'deeply regrettable' and quotes Zhipu's co-founder advocating for science to be global, suggesting a critical view of U.S. policies and support 4
Channel NewsAsia (CNA)Party-alignedCenter4 days ago France's OVHcloud plans frontier AI models to become Europe's second LLM playerOVHcloud, Europe's largest cloud provider, announced plans to develop frontier AI models to challenge Mistral as a major player in the European AI market. The company's CEO stated that advancements in chip technology, training methods, and synthetic data have reduced the costs of developing such models. OVHcloud aims to create a range of specialized models rather than a single system and emphasized it will not use customer data for training.
Bias read (Center): The article provides a factual summary of OVHcloud's strategic announcement without apparent ideological framing. It quotes the CEO directly and discusses technical and economic factors without taking a stance or emphasizing any particular political perspective.
Channel NewsAsia (CNA)Party-alignedCenter5 days ago India's Wipro opens AI center for Anthropic's Claude in BengaluruIndia's Wipro has established a Center of Excellence (CoE) for applied AI focused on Anthropic's Claude models at its Bengaluru hub. This initiative aims to enhance Wipro's capacity to adopt enterprise AI solutions, develop AI-based platforms, and integrate AI into various departments such as finance, human resources, and sales. Wipro plans to train 10,000 employees in using Anthropic's Claude over the next 18 months. Analysts note that AI could impact Wipro's revenue growth but also expand its market potential. Concerns exist regarding AI's potential disruption to India's labor-intensive IT行业
Bias read (Center): The article provides factual information about Wipro's new AI center without taking a stance on political issues. It focuses on technological developments and business strategies, avoiding any ideological or partisan framing.
Official sources cited
- organisation Wipro's announcement
- organisation Jefferies analysts' comments
Channel NewsAsia (CNA)Party-alignedCenter5 days ago Commentary: Why the US government shut down Anthropic’s latest Claude AI modelAnthropic, an artificial intelligence lab, suspended access to its latest Claude models, Fable 5 and Mythos 5, following a U.S. government export control directive restricting their use to non-U.S. nationals. The directive comes amid ongoing tensions between Anthropic and the Trump administration, which has criticized the company for producing 'woke AI' and engaged in legal battles over the models' capabilities and potential risks.
Bias read (Center): The article presents facts about Anthropic's actions, the U.S. government's export controls, and the broader conflict with the Trump administration without overtly favoring either side. It includes direct quotes and references to official actions without editorializing or biased language.
Official sources cited
- organisation Anthropic's announcement of the Claude models
- government U.S. government export control directive