Commerce Secretary Howard Lutnick sent Anthropic CEO Dario Amodei a letter outlining the restrictions, marking the first time Washington has blocked an AI model release on national security grounds.
The directive followed warnings from Amazon researchers who flagged a jailbreak bypassing Fable's safeguards to elicit dual-use cyber capabilities. A person close to the White House told Semafor that Amazon flagged the jailbreak to the government, and that Amazon CEO Andy Jassy had been in contact with the administration about it. Fable 5 scored 53.3% on Humanity's Last Exam benchmark, compared to Claude Opus 4.8's 45.7%, and possesses capabilities similar to Claude Mythos Preview—a model Anthropic deemed too dangerous for general release in April. Mythos is understood to currently be in use by the NSA for offensive cyber operations, according to Tom's Hardware.
The export control directive required restricting access for all foreign nationals, whether inside or outside the United States, including Anthropic's own foreign-born employees. Given the scope of the directive, Anthropic argued it had no choice but to disable the models for all users. The company received the order at 5:21pm ET on 12 June and had to "abruptly disable Fable 5 and Mythos 5 for all our customers to ensure compliance." Access to other Claude models, including Opus 4.8, remained unaffected.
Anthropic contested the action, arguing that the jailbreak technique "essentially consists of asking the model to read a specific codebase and fix any software flaws," and that a demonstration surfaced previously known, minor vulnerabilities also discoverable by other publicly available models, including OpenAI's GPT-5.5. The company maintained its safeguards are substantially more effective than those of any previously deployed model, and that perfect jailbreak robustness is currently impossible. Anthropic wrote: "We disagree that the finding of a narrow potential jailbreak should be cause for recalling a commercial model deployed to hundreds of millions of people." CNN reported that Anthropic argued the standard would halt all new frontier model deployments across the AI industry.
The dispute unfolded against a backdrop of prior tensions. Earlier this year, the Department of Defense declared Anthropic a supply chain risk—a designation historically applied to foreign adversaries—following the collapse of talks between the two sides. The label obligates defense contractors to certify they are not using Claude in military work. White House AI adviser David Sacks said the administration issued the export control "reluctantly" after Anthropic refused to fix the flaw or pull the model, that it wants the restriction lifted once the jailbreak is patched, and that "the ball is in Anthropic's court." The action signals Washington's willingness to invoke emergency export controls to intervene in frontier AI deployment when national security concerns emerge, setting a precedent that could reshape how American labs release models globally.
On 12 June 2026, the US Commerce Department ordered Anthropic to cut off foreign access to its most capable models, Fable 5 and Mythos 5, using export-control authority. Commerce Secretary Howard Lutnick sent the directive directly to Anthropic CEO Dario Amodei at 5:21pm ET, prohibiting access by any foreign national whether inside or outside the United States—including the company's own foreign-born employees.
The order marks the first known case of a commercially deployed AI model being halted through direct federal intervention. Anthropic responded by disabling both models for all customers worldwide, citing the technical and legal impossibility of filtering users by nationality in real time across cloud platforms including AWS Bedrock, Google Cloud, and Microsoft Foundry. The models had been publicly available for just three days before the shutdown. Access to all other Anthropic models remains unaffected.
The directive came ten days after the White House established a voluntary framework for pre-release review of frontier models, rather than mandatory licensing. According to Anthropic's statement, the letter provided no specific technical details of the national security concern. The company said its understanding was that the government believed it had become aware of a jailbreak technique—a method of bypassing Fable 5's safeguards designed to prevent access to the cybersecurity capabilities of the underlying Mythos model. Anthropic reviewed a demonstration and said it identified only a small number of previously known, minor vulnerabilities, and that the same jailbreak could be used on other publicly available models, including OpenAI's GPT-5.5, which are not subject to similar controls.
David Sacks, a Trump administration adviser, claimed Anthropic refused to patch the vulnerability; both this and Anthropic's account cannot be simultaneously true, but no public evidence exists to determine which is accurate. The Pentagon's chief information officer publicly supported the decision, stating the department prioritized national security over revenue cycles. The shutdown occurred with no published threshold, no technical finding, and no independent review—just a letter arriving late on a Friday afternoon. The decision represents a new instrument of state power: the ability to unilaterally disable a deployed frontier system with no transparent decision-making process, setting AI models alongside advanced semiconductors and military technology as strategically controlled assets.
The order directs the Secretaries of the Treasury, War (through the Director of NSA), and Homeland Security (through the director of CISA) to design a voluntary framework through which developers may submit models for evaluation. The structure represents a significant departure from earlier proposals: an earlier version gave the government up to 90 days to review advanced models before release — a timeline that was cut to 30 days in the final order after Trump worried the order would stifle American companies' lead in the global race amid competitive pressure from China.
The order distributes testing responsibility among several national security organisations including the NSA and CISA, rather than giving the Center for AI Standards and Innovation (CAISI) the central role. Reports suggest this reflected officials' push for AI national security priorities to sit within traditional security agencies rather than CAISI. Days after the order, National Cyber Director Sean Cairncross ordered CAISI to stop publishing AI model assessments, ending public transparency in frontier AI evaluation. The directive transfers oversight authority from CAISI to a classified system managed by national security agencies. CAISI had completed over 40 evaluations of AI models by early June 2026, and had announced agreements with Google DeepMind, Microsoft and Elon Musk's xAI on 5 May to evaluate their models before public release.
The timing appears linked to advances in AI capabilities for cybersecurity. Anthropic's unreleased Claude Mythos model demonstrated an extraordinary ability to autonomously detect thousands of previously overlooked high-severity zero-day vulnerabilities within major operating systems. According to CNN, Mythos sparked concerns among governments, banks and utility companies, with Anthropic restricting access to approved organisations rather than releasing the model publicly. The Mythos announcement came in April, one month before the CAISI partnerships were formalised and weeks before the executive order — suggesting the model's capabilities may have accelerated government action.
The shift toward classified evaluation has implications for transparency and competition. CAISI's public evaluations served as a kind of neutral benchmarking service; with evaluations now classified, that independent verification disappears. US-based AI firms are now subject to a review process that Chinese, European, and other international competitors are not. The move represents what Scientific American described as a fundamental shift from the administration's previous hands-off approach to the technology, reflecting how the development of more powerful AI models has spooked some federal officials, prompting the White House to reverse course and back some safety measures.
The National Security Agency is using Anthropic's Claude Mythos model for offensive cyber operations, according to a Financial Times report that marks the first confirmed deployment of frontier AI capabilities for government cyberwarfare. The arrangement is particularly striking given that the Department of Defense designated Anthropic a "supply chain risk" earlier this year, effectively blacklisting the company from federal contracts.
Anthropic has embedded approximately six forward-deployed engineers inside the NSA to guide the agency's use of Mythos and customize the model for specialized applications, according to Tom's Hardware. Sources told the Financial Times that Mythos could be used to infiltrate the networks of other states, notably China and Iran. Mythos is the version of Anthropic's most capable model that the company deemed too dangerous for public release due to its cyber vulnerability exploitation capabilities, with Anthropic stating it can identify and exploit zero-day vulnerabilities in every major operating system and web browser.
The collaboration represents a sharp contradiction in the government's posture toward Anthropic. The dispute between Anthropic and the Pentagon began in January 2026, when the two parties were negotiating a $200 million contract and the Trump administration demanded that Anthropic allow usage of its technology for "all lawful purposes," implying the removal of AI guardrails — a move that conflicted with the company's usage policy. After Anthropic withdrew from that contract over concerns about domestic surveillance and autonomous weaponry, the Pentagon signed agreements with OpenAI, Google, and xAI instead. Yet the NSA sits under the Department of Defense, the same department arguing in court that Anthropic's technology poses a national security risk, though reports suggest the NSA was already using Mythos despite the blacklist, according to TechSpot.
The revelation raises fundamental questions about the dual nature of AI safety work — models withheld from public release due to danger are being provided to government agencies for exactly the capabilities deemed too risky for general availability. This pattern extends beyond Anthropic: the US government's restriction of Fable 5 over concerns about jailbroken cyber capabilities suggests a policy of government monopoly on dangerous cyber AI rather than preventing development of such capabilities entirely. Anthropic has framed Mythos as a defensive cybersecurity tool, launching Project Glasswing in April with partners including AWS, Google, Microsoft, Nvidia, and CrowdStrike, and this week announced that partners had found more than 10,000 high- or critical-severity flaws, with access expanding to approximately 150 organizations across 15 countries.
On 10 June, senior AI safety researchers announced Sequent, a new nonprofit alignment research organisation targeting $100-150 million in initial funding and 40-80 full-time researchers within two years. Led by Geoffrey Irving, formerly Chief Scientist at the UK AI Safety Institute and previously at DeepMind, OpenAI, and Google Brain, alongside Daniel Murfet from Timaeus, the organisation represents a significant bet on theory-driven approaches to artificial superintelligence alignment.
Sequent's central thesis is that empirical programmes at major AI labs are unlikely to deliver high prior confidence that superintelligent systems will behave as intended. The organisation aims instead to pursue what it calls a portfolio of theoretical and empirical bets that, if any succeed, would provide stronger a priori guarantees before training advanced AI systems. Research areas include scalable oversight techniques such as debate and amplification — methods Irving helped pioneer during his tenure at OpenAI — as well as singular learning theory, heuristic arguments, and game-theoretic frameworks. The organisation plans heavy investment in automated research tools, arguing that theoretical approaches offer better filters for determining which automated directions hold promise.
To preserve the advantages of smaller alignment teams — research focus, opinionated leadership, and low coordination overhead — Sequent will adopt a federated structure in which a handful of research directors maintain substantial autonomy over research direction, team culture, and hiring within their areas. These directors will report to Irving, and the final portfolio of research areas will depend on which senior researchers join. The organisation explicitly seeks to remain independent rather than join an existing AI lab, citing the need to maintain the freedom to raise concerns if fundamental obstacles emerge and to avoid institutional pressure toward purely empirical approaches.
The launch comes at a moment of growing concern about whether alignment research will keep pace with capabilities development. Sequent acknowledges it may exacerbate the bottleneck of experienced alignment researchers available to other efforts, but contends that no comparable large-scale theory-focused organisation currently exists. Whether automated alignment research can deliver theoretical guarantees before the arrival of transformative AI systems remains an open question, one that Sequent's substantial funding target suggests will require both significant resources and a departure from current laboratory norms.
Go deeper: Sequent announcement on Alignment Forum
The surge in congressional activity reflects deepening frustration with executive inaction on technology controls targeting China's semiconductor and AI capabilities.
The most consequential piece of legislation is the Multilateral Alignment of Technology Controls on Hardware (MATCH) Act, introduced by Representative Michael Baumgartner in early April. The bill would compel allied nations to impose export controls on advanced chipmaking equipment sales to China equivalent to those maintained by the United States, threatening to invoke the Foreign Direct Product Rule if allies fail to harmonize their restrictions. China's imports of semiconductor manufacturing equipment surged from $10.7 billion in 2016 to approximately $51.1 billion in 2025, according to analysis from Silverado Policy Accelerator, highlighting the scale of the challenge. The legislation targets a critical asymmetry: while U.S. companies face stringent controls, allied firms from the Netherlands, Japan, and South Korea have continued servicing and selling equipment to Chinese customers, allowing Beijing to stockpile chokepoint technologies like deep ultraviolet lithography machines.
Equally significant is the AI Overwatch Act, which the House Foreign Affairs Committee advanced on 21 January by a vote of 42-2. The bill would impose a statutory two-year ban on exports of Nvidia's Blackwell-class chips to China and require the Commerce Department to notify Congress before approving licenses for advanced AI chip exports to designated high-risk countries, granting lawmakers the power to block transactions through a joint resolution of disapproval. This arms-sale-style oversight mechanism represents a direct congressional challenge to executive control over technology policy, coming in the wake of the Trump administration's decision to shift H200 chip exports from presumption of denial to case-by-case review in January 2026.
The congressional push reflects a broader pattern: the Trump administration has imposed no new technology-based controls on China since taking office, while enforcement gaps—including a loophole that allowed Chinese subsidiaries to purchase advanced AI chips—went unaddressed for over a year. Allied governments report confusion about U.S. strategy, with the executive branch signaling openness to commerce with China while Congress advances restrictive legislation. This discord creates negotiating leverage: statutory restrictions would allow the administration to position controls as beyond its discretion when engaging with Beijing and allied capitals. Sources indicate Chinese officials are lobbying heavily against the MATCH Act, suggesting genuine concern about its potential to disrupt China's semiconductor indigenization efforts. Whether the NDAA ultimately includes symbolic gestures or substantive measures like MATCH and AI Overwatch will determine whether congressional hawks succeed in reclaiming control over China technology policy from an executive branch perceived as prioritizing diplomatic stability over technological containment.
On 8 June, Cognition released FrontierCode, a coding benchmark designed to measure whether AI-generated code meets the standards human maintainers would accept in production, rather than merely testing functional correctness. The benchmark comprises 150 hand-crafted tasks spanning Python, Go, TypeScript, JavaScript, Java, C/C++, and other languages, with each task requiring more than 40 hours of work by leading open-source developers. Tasks are evaluated across six dimensions — correctness, test quality, scope discipline, style adherence, maintainability, and regression safety — using a grading system in which any "blocker" issue earns an automatic zero, even if other aspects of the code are sound.
On the hardest Diamond tier, which contains 50 tasks, Claude Opus 4.8 achieved only 13.4%, followed by GPT-5.5 at 6.3% and Claude Opus 4.7 at 5.2%. Performance improved on the Main tier (100 tasks including Diamond) to 34.3%, 25.5%, and 23% respectively, and on the Extended tier (all 150 tasks) to 51.8%, 44.8%, and 43.2%. The low scores reflect a gap between code that runs and code that satisfies the discipline expected in professional codebases — what Cognition describes as the difference between passing unit tests and earning approval from a repository maintainer.
The benchmark's difficulty stands in sharp contrast to earlier evaluations. SWE-Bench, introduced in October 2023, has shown signs of saturation, with leading models now scoring above 50% on many variants. Cognition's initiative aims to establish a new standard for what it terms "maintainable code," positioning FrontierCode as the third era of AI coding benchmarks after autocomplete (HumanEval, 2021) and test-passing (SWE-Bench, 2023). The company has opened evaluation to all model creators, framing the benchmark as a measure of production readiness for autonomous coding agents.
FrontierCode's focus on mergeability addresses what some researchers view as a systemic weakness in current coding agents. Tasks assess not only whether code produces correct output, but whether it introduces unnecessary scope changes, maintains consistent style, includes appropriate tests, and avoids subtle antipatterns — criteria that are difficult to encode in binary pass-fail tests. One example task involved refactoring warning logs into a new function; Claude Opus 4.8 produced functionally equivalent code but mixed logging patterns in ways that would complicate future maintenance, illustrating the nuanced quality gaps the benchmark is designed to capture.
The release comes amid rapid iteration cycles among frontier labs. Claude Opus 4.8 was released on 28 May 2026, just 41 days after its predecessor. A subsequent model, Claude Fable 5, launched in mid-June and more than doubled the Diamond score to 29.3%, suggesting the benchmark may saturate faster than Cognition anticipated — though scores remain well below the thresholds seen on earlier evaluations, and the low baseline reinforces the view that production-grade agentic coding remains an unsolved problem.
On 28 October 2025, OpenAI CEO Sam Altman and chief scientist Jakub Pachocki announced during a livestream that the company is targeting March 2028 to build a fully autonomous AI researcher—a system capable of running independent research projects from conception to completion. The announcement laid out three core goals: building an automated AI researcher that remains steerable and accountable, accelerating the economy through scientific progress, and delivering personal AGI to everyone on Earth.
The timeline includes an intermediate milestone: an AI research intern by September 2026, designed to meaningfully accelerate human scientific work. According to The Decoder, Pachocki emphasized that the research intern would significantly speed up OpenAI's own researchers, while the March 2028 system would handle entire research workflows autonomously. The explicit less-than-two-year timeframe from mid-2026 to early 2028 represents OpenAI's most concrete public statement about when it expects to achieve systems capable of recursive self-improvement—a threshold widely considered pivotal in discussions of transformative AI risk.
Pachocki outlined the technical foundations underpinning these ambitions, pointing to continued scaling of deep learning systems and advances in "in-context compute"—runtime processing power that extends a model's reasoning capacity. The Decoder reported that OpenAI plans to dramatically extend the time horizons over which models can reason, moving well beyond current capabilities. Pachocki also introduced a five-layer safety model spanning value alignment, goal alignment, reliability, adversarial robustness, and systemic safety, with Chain-of-Thought Faithfulness emerging as a central research area to manage portions of internal reasoning that may remain unsupervised.
The announcement arrived the same day OpenAI finalized its restructuring into a public benefit corporation, separating from its original non-profit charter. The March 2028 target aligns with statements from OpenAI co-founder Greg Brockman, who said he expects AGI within one to three years and that he would consider it a failure if the company had not reached AGI by 2030, according to Prinz AI. During the livestream, Altman emphasized that defining a concrete target—an automated AI researcher—was more useful than attempting to satisfy varied interpretations of AGI. The framing of universal personal AGI as a top-level corporate goal signals OpenAI's vision for post-AGI deployment, though the company has provided no detail on distribution mechanisms or timelines beyond the research automation milestone.
On 12 June, the Senate Armed Services Committee incorporated Senator Elissa Slotkin's AI Guardrails Act into the National Defense Authorization Act markup, establishing what Slotkin described as the first statutory constraints on Pentagon AI use, particularly for life-and-death decisions. The legislation mandates that human beings remain the ultimate decision makers in the kill chain, with specific prohibitions on AI making final decisions on nuclear weapon deployment, domestic surveillance, or lethal targeting without human oversight.
Slotkin, who introduced the standalone bill in March, argued that no single Secretary of Defense or AI company should unilaterally set rules for AI weapons deployment — such decisions should be legislated to prevent arbitrary changes by future administrations. The provision also mandates rigorous testing of AI systems before deployment, applying standards comparable to or exceeding those used for traditional weapons systems. The move comes amid heightened congressional interest in military AI governance, with fellow Armed Services Committee member Senator Kirsten Gillibrand also introducing parallel legislation, the Secure and Accountable Military AI Act, which would impose similar restrictions on AI use for launching nuclear weapons, surveilling Americans, and developing autonomous weapon systems.
The legislative push follows a public dispute between the Pentagon and AI firm Anthropic, which culminated in the Department of Defense designating Anthropic a supply chain risk and severing contracts after the company pressed for specific assurances around autonomous weapons and mass surveillance. Slotkin's legislation appears designed to codify the type of guardrails Anthropic had sought, framing them as essential safeguards rather than obstacles to AI adoption. She has emphasised that the guardrails align with the Trump administration's AI Action Plan, which calls for aggressive AI adoption by the armed forces while ensuring systems are secure and reliable.
Beyond military applications, Slotkin is separately working on legislation to prevent AI from making final decisions on veterans' healthcare benefits, allowing AI only as a decision support tool. The NDAA markup occurred behind closed doors, which Slotkin credits for enabling substantive bipartisan negotiation on AI constraints — a rare area of cross-party agreement in an otherwise fractious policy landscape. The full text of the AI Guardrails Act, available through Congress.gov, runs to just five pages and establishes what supporters describe as left and right limits on Pentagon AI deployment without impeding technological competitiveness against adversaries such as China.
On 15 June, Hungary's parliament approved a constitutional amendment imposing an eight-year limit on prime ministers, a measure designed to permanently block Viktor Orbán from returning to office after two decades in power. Lawmakers voted 135-50 in favour of the retroactive restriction, which counts prior service toward the cap and prevents anyone who has served at least eight years as prime minister since 1990 from holding the office again.
The constitutional change fulfils a central campaign promise by Prime Minister Péter Magyar, whose Tisza Party won a two-thirds parliamentary majority in April elections and ended Orbán's 16-year uninterrupted tenure. Magyar, a 45-year-old lawyer and former Orbán loyalist who broke with Fidesz in 2024 over what he described as systemic corruption, has pledged sweeping reforms aimed at dismantling the apparatus Orbán built to consolidate executive power. Magyar argued that the possibility of limitless tenure leads to power concentration, citing his predecessor as a cautionary example.
Orbán served as prime minister from 1998 to 2002 and again from 2010 until his electoral defeat in April, making him the longest-serving head of government in modern Hungarian history. During his tenure, he systematically weakened judicial independence, centralised media control, and undermined institutional checks on executive power—a playbook that influenced authoritarian-leaning leaders across Europe and beyond. His government also established entities such as the Integrity Authority, ostensibly to combat corruption, though critics noted it primarily targeted independent media and civil society organisations. Magyar's government is now moving to dissolve that agency by the end of June.
The term-limit vote represents a significant institutional check on power concentration in a country that became synonymous with democratic backsliding under Orbán's rule. Orbán's Fidesz party, now in opposition, voted against the measure, and the former prime minister—recently re-elected as party leader—criticised the amendment on social media, referring to it as "the Orbán law" and suggesting that restricting popular will through constitutional means was the new government's most pressing priority. Whether Magyar can sustain these reforms and rebuild democratic guardrails over the long term will determine whether Hungary's current trajectory represents genuine democratic restoration or a temporary reversal in its authoritarian arc.
Generated at 2026-06-18 05:44 UTC