X-Risk Daily

Saturday 20 June 2026
32 news · 5 research · 19 analysis · 6 updates from yesterday

Senate and House NDAA drafts include restrictions on AI in nuclear weapons and lethal force

Transformative AI New!
The Senate Armed Services Committee unveiled extensive AI provisions in its version of the fiscal year 2027 National Defense Authorization Act on 10 June, including what amounts to a prohibition on the use of artificial intelligence in nuclear weapon launch decisions and a requirement for human judgment in lethal force determinations.
Congressional prohibition on AI in nuclear launch decisions and lethal force restrictions establish human-in-loop requirements for catastrophic weapons.

The Senate Armed Services Committee unveiled extensive AI provisions in its version of the fiscal year 2027 National Defense Authorization Act on 10 June, including what amounts to a prohibition on the use of artificial intelligence in nuclear weapon launch decisions and a requirement for human judgment in lethal force determinations. The $1.15 trillion bill represents Congress's most comprehensive attempt to date to regulate military AI use in the highest-stakes domains.

The Senate draft would ban the use of AI to decide whether nuclear weapons should be deployed and forbid the military from planning attacks without the incorporation of appropriate levels of human judgment, according to NOTUS. Senator Elissa Slotkin described the measures as establishing "first ever comprehensive safeguards" governing AI deployment across the Department of Defense, ensuring a human has ultimate decision authority when autonomous weapons are fired and that a human remains on the switch for nuclear launches. The provisions build on earlier legislation, including the Block Nuclear Launch by Autonomous Artificial Intelligence Act passed in 2023, and would codify existing Pentagon Directive 3000.09 into law.

Both Senate and House versions of the NDAA would complicate the Defense Department's ability to designate companies as supply chain risks, potentially constraining future restrictions on AI firms. The nuclear weapons language appears designed to move beyond guidelines to enforceable prohibition: one analysis noted that codifying the restriction in statute gives it the force of law rather than leaving it to executive branch discretion, which can shift with administrations. However, the bill text falls short of detailing what constitutes appropriate human judgment in practice, according to NOTUS, and experts have warned that without accompanying accountability mechanisms, the provisions may preserve the appearance of human control while the substance erodes.

The timing is significant given recent high-profile uses of AI in military operations. In January, Anthropic's Claude tool was used in the capture of Venezuelan President Nicolás Maduro and reportedly employed on the first day of Operation Epic Fury in Iran, with speculation that AI targeting systems were involved in a missile strike on a school that killed 175 people. The Trump administration has also asked for revisions to the Pentagon's rules around autonomous weapons, raising concerns among congressional Democrats that existing safety protocols and human oversight requirements could be undermined. The NDAA still requires reconciliation between Senate and House versions before becoming law.

Originally from: Transformer — Read original

Z.ai releases MIT-licensed GLM-5.2 claimed to match frontier performance

Transformative AI New!
On 13 June, Z.ai, the Beijing-based AI company formerly known as Zhipu AI, released the full MIT-licensed weights of its GLM-5.2 model to the public, marking what may be the most capable openly-licensed language model to date.
Open-weights frontier model release undermines compute-based governance and export control strategies if performance claims verified.

The release carries strategic implications for AI governance: it demonstrates that frontier-class capabilities can be distributed under permissive licensing just as export controls tighten around Western models.

Benchmarks published alongside the open weights show GLM-5.2 as the strongest open-source coding model currently available, scoring 81.0 on Terminal-Bench 2.1, closing to within four points of Claude Opus 4.8's 85.0, and achieving 62.1 on SWE-bench Pro, ahead of both GPT-5.5 at 58.6 and its predecessor GLM-5.1. On long-horizon agentic coding tasks—FrontierSWE, PostTrainBench, and SWE-Marathon—GLM-5.2 delivers substantially stronger performance than GLM-5.1, with its capability roughly positioned between Claude Opus 4.7 and Claude Opus 4.8. The model's 1 million-token context window is designed to handle large codebases and project-scale engineering tasks, supported by an architectural innovation called IndexShare, which reduces per-token FLOPs by 2.9× at a 1M context length.

Developer reaction has been immediate and overwhelmingly positive, with the team behind Kilo Code confirming day-one integration and Cline IDE noting that "GLM-5.2 is the first open-weights model to cross 80% on Terminal-Bench". The MIT license means the software can be used, modified, and commercialized without paying royalties or adhering to restrictive governance policies, allowing engineering teams to host frontier-level AI on their own sovereign infrastructure. For enterprises navigating data residency requirements and vendor lock-in concerns, the timing is significant: GLM 5.2 shipped MIT-licensed open weights as Washington moved to suspend access to top U.S. models in some overseas markets.

The release raises immediate questions about the durability of capability-based export restrictions. If a model approaching Claude Opus 4.8's performance is available under an unrestricted open license, controls that limit access to specific proprietary systems become strategically weaker. According to TechTimes, Z.ai has been explicit that the open-source release is a deliberate response to geopolitical restriction of AI access. Mistral has reportedly benefited from similar dynamics by positioning itself around AI sovereignty and models customers can deploy independently—an approach that gains strategic advantage as US-China tensions make API-dependent Western systems less reliable for international buyers. Independent verification of Z.ai's frontier claims remains the next step, but early developer adoption and benchmark results suggest the model represents a meaningful shift in what capabilities are now freely available.

Originally from: Transformer — Read original

Trump endorses comparison to historical autocrats, claiming greater power than Hitler, Stalin, and Mao

Fanatical & Malevolent Actors New!
On 18 June 2026, US President Donald Trump endorsed a social media post that favourably compared him to Adolf Hitler, Joseph Stalin, Mao Zedong, and other historical autocrats, claiming he wields greater power than any of them.
Power concentration and erosion of democratic constraints during the AI transition — authoritarian psychology in a nuclear-armed state with major AI development.

The president shared the post on Truth Social with the comment "Sounds good to me!" according to Raw Story and The New Republic.

The post, attributed to "Presidential Historian Dave King," declared Trump "the most powerful man that the planet has ever known" and listed historical figures "characterised by brutal conquest and the fear that they instilled in the populations that came under their influence," including Alexander the Great, Genghis Khan, Attila the Hun, Napoleon, Hitler, Mao, and Stalin. The author has since been identified as a businessman known in golf circles as Gary Player's longtime caddy and confidant, according to IBTimes UK. The post argued that "the overwhelming difference" between Trump and these figures was "their lack of global reach."

The endorsement came just hours after Trump signed a controversial memorandum of understanding with Iran at the Versailles Palace in France. The incident occurred during a period of heightened concern about democratic backsliding in the United States. In March 2026, NPR reported that three major reports found President Trump had done serious damage to American democracy at remarkable speed, with one researcher noting that democracy had been rolled back as much during one year of Trump's administration as it took authoritarian leaders in India, Turkey, and Hungary between four and ten years to accomplish.

The president's social media behaviour has itself become a subject of analysis. The Financial Times reported that Trump averaged 19 Truth Social posts per day in 2026, totalling more than 2,700 posts, while The Daily Beast calculated that his average spiked to 27 posts per day in May. The autocrat comparison post represents a particularly stark example of Trump's willingness to embrace authoritarian framing during a critical period for AI governance, democratic institutions, and great-power competition.

Unlike routine policy disagreements, the incident involves a sitting head of state openly signalling alignment with totalitarian power structures and leaders responsible for tens of millions of deaths. The endorsement occurred amid ongoing debates over executive authority and democratic safeguards in the United States, raising concerns about the psychological orientation of leadership during a period when decisions about emerging technologies and global risks carry existential implications.

Originally from: The Guardian — Read original

Trump installs loyalist Bill Pulte as acting US intelligence chief after blocking Senate confirmation

Fanatical & Malevolent Actors New!
Bill Pulte, director of the Federal Housing Finance Agency who investigated Trump's political opponents, became acting director of national intelligence on 19 June after Donald Trump engineered his appointment by blocking the Senate confirmation process.
Concentration of unchecked power and erosion of intelligence independence during the AI transition.
Trump shortened outgoing DNI Tulsi Gabbard's tenure and abruptly cancelled a confirmation hearing for his own nominee, Jay Clayton, to ensure Pulte would assume the role. The manoeuvre circumvents normal Senate oversight of intelligence appointments. Pulte's background investigating Trump's enemies rather than traditional intelligence work, combined with the procedural manipulation to install him, raises concerns about the politicisation of US intelligence agencies. The DNI oversees 18 intelligence agencies including the CIA and NSA, coordinates national security assessments, and advises the president on threats. Installing a political loyalist without Senate confirmation in this position could compromise intelligence independence, weaken institutional checks on executive power, and degrade the quality of threat assessment during a period when AI development is accelerating and geopolitical tensions remain elevated. The move follows a pattern of Trump appointments that prioritise personal loyalty over institutional expertise.
Source: The Guardian — Read original

White House and Anthropic negotiate formal AI safety standards after Fable pulldown

Transformative AI
↻ Continues from: "White House Shuts Down Anthropic's Claude Fable 5 Over Code Security 'Jailbreak' That Cannot Be Fixed"
Following the abrupt government-mandated removal of Anthropic's Fable model on 13 June, the White House and Anthropic are reportedly developing a common set of benchmarks to assess future jailbreaks, including the severity of safeguard bypasses, exposed capabilities, and practical consequences.
Establishes precedent for government intervention in frontier AI deployment and signals movement toward mandatory safety testing frameworks.
The move represents a shift toward formalized technical standards for frontier AI deployment. At the G7 summit on 15-16 June, the incident dominated AI discussions, with French President Emmanuel Macron calling the US action "strictly nationalist" and Indian PM Narendra Modi expressing concern about democratic nations' access. G7 leaders discussed establishing a "trusted access" scheme to guarantee use of advanced models. Rep. Josh Gottheimer announced he is preparing legislation requiring mandatory testing of frontier AI models for national security risks. The timeline reveals that Commerce Department and CAISI tested Fable before release and gave no deployment objection, but Amazon researchers subsequently found what they characterized as a jailbreak involving software code disclosure. After Amazon CEO Andy Jassy shared findings with Treasury Secretary Scott Bessent and the NSA, the White House gave Anthropic 90 minutes to pull both Mythos and Fable globally. President Trump approved export controls, effectively forcing removal. The administration was reportedly already frustrated that Anthropic had expanded Mythos access to South Korea's SK Telecom without government permission.
Source: Transformer — Read original
Transformative AI

US government orders first-ever restriction of AI model over jailbreak vulnerability

Transformative AI
↻ Continues from: "White House orders halt to government AI model assessments, citing security risks"
On 12 June, the US government ordered Anthropic to disable Claude Fable 5 and Mythos 5 worldwide, just three days after Fable 5's 9 June release.
First government intervention blocking a frontier model release establishes precedent for capability-based restrictions during AI transition.

Commerce Secretary Howard Lutnick sent Anthropic CEO Dario Amodei a letter outlining the restrictions, marking the first time Washington has blocked an AI model release on national security grounds.

The directive followed warnings from Amazon researchers who flagged a jailbreak bypassing Fable's safeguards to elicit dual-use cyber capabilities. A person close to the White House told Semafor that Amazon flagged the jailbreak to the government, and that Amazon CEO Andy Jassy had been in contact with the administration about it. Fable 5 scored 53.3% on Humanity's Last Exam benchmark, compared to Claude Opus 4.8's 45.7%, and possesses capabilities similar to Claude Mythos Preview—a model Anthropic deemed too dangerous for general release in April. Mythos is understood to currently be in use by the NSA for offensive cyber operations, according to Tom's Hardware.

The export control directive required restricting access for all foreign nationals, whether inside or outside the United States, including Anthropic's own foreign-born employees. Given the scope of the directive, Anthropic argued it had no choice but to disable the models for all users. The company received the order at 5:21pm ET on 12 June and had to "abruptly disable Fable 5 and Mythos 5 for all our customers to ensure compliance." Access to other Claude models, including Opus 4.8, remained unaffected.

Anthropic contested the action, arguing that the jailbreak technique "essentially consists of asking the model to read a specific codebase and fix any software flaws," and that a demonstration surfaced previously known, minor vulnerabilities also discoverable by other publicly available models, including OpenAI's GPT-5.5. The company maintained its safeguards are substantially more effective than those of any previously deployed model, and that perfect jailbreak robustness is currently impossible. Anthropic wrote: "We disagree that the finding of a narrow potential jailbreak should be cause for recalling a commercial model deployed to hundreds of millions of people." CNN reported that Anthropic argued the standard would halt all new frontier model deployments across the AI industry.

The dispute unfolded against a backdrop of prior tensions. Earlier this year, the Department of Defense declared Anthropic a supply chain risk—a designation historically applied to foreign adversaries—following the collapse of talks between the two sides. The label obligates defense contractors to certify they are not using Claude in military work. White House AI adviser David Sacks said the administration issued the export control "reluctantly" after Anthropic refused to fix the flaw or pull the model, that it wants the restriction lifted once the jailbreak is patched, and that "the ball is in Anthropic's court." The action signals Washington's willingness to invoke emergency export controls to intervene in frontier AI deployment when national security concerns emerge, setting a precedent that could reshape how American labs release models globally.

Originally from: Center for AI Safety Newsletter — Read original

Sen. Blackburn negotiates child safety and deepfake protections for AI preemption deal

Transformative AI New!
Tennessee Senator Marsha Blackburn is negotiating with the White House over legislation that would trade child safety measures and deepfake protections for federal preemption of state AI laws.
Federal preemption could block state-level AI safety regulation, centralizing governance authority and potentially weakening safeguards.
The Daily Signal reports she will introduce the package "soon." The proposal includes her KOSA bill requiring platforms to prevent mental health disorders among minors and disable addictive features, the NO FAKES Act creating federal property rights in voice and likeness with deepfake liability, and the App Store Accountability Act mandating age verification and parental controls. Child safety groups are pushing to add the SCREEN Act or SAFE for Kids Act (age verification for porn sites) and the GUARD Act (restricting AI chatbot access for minors with liability for mental health harms). NO FAKES advanced through Senate Judiciary on 12 June, though three Republicans and one Democrat expressed First Amendment concerns. Senator Ted Cruz, who raised free speech reservations, has reportedly asked committee Republicans to submit AI priority lists and been involved in preemption discussions. The House has its own KOSA version focusing on parental controls rather than duty of care, complicating Republican support. The White House has not indicated positions on the component bills despite meeting with online safety groups on 18 June.
Source: Transformer — Read original

DeepSeek raises $7.4bn at $50bn+ valuation with unusual control structure

Transformative AI New!
DeepSeek closed a $7.4bn funding round at a valuation exceeding $50bn, with an unusual structure requiring investors to put capital into a limited partnership managed by CEO Liang Wenfeng.
Chinese state-backed frontier lab raises $7.4bn; US Commerce secretary decides against Entity List despite security concerns.
The arrangement ensures Liang retains complete control, with only China's national AI fund holding voting rights. The structure resembles Elon Musk's approach with SpaceX and represents Chinese state backing for a frontier AI company while maintaining the appearance of private control. The Trump administration reportedly decided not to add DeepSeek and over 100 other Chinese companies to its Entity List despite national security concerns, apparently to avoid escalating US-China tensions. Microsoft is reportedly considering switching Copilot Cowork from expensive Anthropic and OpenAI models to DeepSeek V4 as a cheaper alternative, which would represent a major commercial win for the Chinese company. The US export controls on Fable and Mythos have reportedly created opportunities for Mistral, which has positioned itself around AI sovereignty and open-weight models customers can deploy independently.
Source: Transformer — Read original

Google DeepMind's Gemma 3 runs on satellite to identify targets autonomously

Transformative AI New!
Loft Orbital's satellite ran Google DeepMind's Gemma 3 model on-device to respond to researchers' commands by identifying areas of interest autonomously—an AI-in-space first.
AI models operating autonomously in space for targeting decisions; removes human oversight from satellite operations.
The demonstration shows frontier AI models can operate in space environments without ground communication, making satellite targeting and surveillance decisions independently. Combined with Nvidia's announcement that its new agent harness enables coding agents to autonomously train physical robots, the developments indicate AI systems are gaining the ability to operate independently in both space and terrestrial physical environments. The capability has obvious military applications for autonomous weapons systems and surveillance.
Source: Transformer — Read original

OpenAI hires transformers co-author Noam Shazeer and policy analyst Dean Ball

Transformative AI New!
On 17 June, OpenAI announced two significant hires: Noam Shazeer, a Google DeepMind researcher who co-authored the original 2017 transformers paper, will work on finding new AI model architectures beyond the transformer paradigm.
Shazeer hire signals exploration of post-transformer architectures; Ball hire indicates OpenAI prioritizing government relations after Fable crisis.
Dean Ball, author of the Hyperdimensional Substack and a non-resident senior fellow at the Foundation for American Innovation, joined as head of strategic futures to work on frontier AI policy. Ball will remain affiliated with FAI and continue his independent Substack. The hires signal OpenAI's focus on both technical research beyond current architectures and closer engagement with policy development. Shazeer's background in fundamental AI research suggests OpenAI is exploring architectural alternatives as transformer scaling potentially approaches limits. Ball has previously argued that building superintelligence is "a profoundly political act" that "implicates the sovereignty of the state itself," positioning him to navigate the increasingly fraught relationship between frontier labs and government.
Source: Transformer — Read original

Kazakhstan signs $10bn AI infrastructure deal with Nvidia for 100,000-GPU cluster

Transformative AI New!
Kazakhstan signed a $10bn AI infrastructure agreement with Nvidia and startup Firebird to build a computing cluster with 100,000 GPUs.
Large-scale compute buildout in Central Asia could shift global AI development geography; autonomous robot training reduces human oversight.
The deal represents one of the largest national AI infrastructure investments outside the US and China. Nvidia also sold $25bn in bonds to help fund the broader AI infrastructure buildout and is encouraging data centers globally to adopt its TPU chips by providing financial guarantees, including a $3.2bn guarantee for the Lake Mariner cluster in New York state—mirroring the strategy that made its GPU architecture dominant. The company's new agent harness enabled coding agents to autonomously train physical robots, demonstrating progress in AI systems that can operate in the physical world without human oversight. Nvidia hired Bruce Andrews, a seasoned lobbyist, to lead government affairs in Washington, signaling increased focus on policy engagement.
Source: Transformer — Read original

Sen. Bernie Sanders proposes 50% tax on AI companies to fund $7 trillion sovereign wealth fund

Transformative AI New!
Senator Bernie Sanders unveiled legislation on 16 June proposing a one-time 50% tax on major AI companies' equity to create a $7 trillion sovereign wealth fund that would pay 5% dividends to all Americans.
Signals political willingness to dramatically restructure AI industry ownership, though proposal lacks enforcement mechanism for dividend payments.
The bill would apply to OpenAI and Anthropic, but would also cover data centers and AI compute infrastructure, potentially including Amazon, Nvidia, Google, and Microsoft. The fund would not be allowed to sell equity, making it unclear how the planned dividends would be funded without asset sales. The proposal represents the most aggressive redistributive AI policy put forward by a sitting senator and would effectively nationalize half the US AI industry if enacted. While the bill has no realistic prospect of passing, it indicates growing political pressure to address AI's wealth concentration effects and could influence future taxation proposals.
Source: Transformer — Read original

State attorneys general subpoena OpenAI over user data handling and impact on vulnerable populations

Transformative AI New!
A coalition of state attorneys general has subpoenaed OpenAI for documents covering advertising practices, user data handling, and the company's impact on minors and seniors.
Multi-state investigation into frontier lab's impact on vulnerable populations; state resistance to federal preemption could fragment AI governance.
The action represents the first major multi-state investigation into OpenAI's business practices and follows growing concern about AI systems' effects on vulnerable populations. The timing coincides with Sen. Blackburn's negotiations over child safety legislation and Americans for Responsible Innovation calling on NIST and CAISI to develop national safety benchmarks for AI chatbots used by minors. Over 200 state lawmakers from 42 states urged Congress to oppose the Great American AI Act's preemption provision, arguing it would prevent states from regulating AI harms. The combination of state AG action and legislative opposition to federal preemption suggests states intend to assert regulatory authority regardless of federal policy.
Source: Transformer — Read original

Trump executive order requires 30-day pre-release model submission to national security agencies

Transformative AI
On 2 June, President Trump signed an executive order titled "Promoting Advanced Artificial Intelligence Innovation and Security", requiring AI companies to provide new AI models to the US government 30 days before general release.
Establishes mandatory pre-release government review of frontier models, shifting enforcement to national security apparatus rather than standards body.

The order directs the Secretaries of the Treasury, War (through the Director of NSA), and Homeland Security (through the director of CISA) to design a voluntary framework through which developers may submit models for evaluation. The structure represents a significant departure from earlier proposals: an earlier version gave the government up to 90 days to review advanced models before release — a timeline that was cut to 30 days in the final order after Trump worried the order would stifle American companies' lead in the global race amid competitive pressure from China.

The order distributes testing responsibility among several national security organisations including the NSA and CISA, rather than giving the Center for AI Standards and Innovation (CAISI) the central role. Reports suggest this reflected officials' push for AI national security priorities to sit within traditional security agencies rather than CAISI. Days after the order, National Cyber Director Sean Cairncross ordered CAISI to stop publishing AI model assessments, ending public transparency in frontier AI evaluation. The directive transfers oversight authority from CAISI to a classified system managed by national security agencies. CAISI had completed over 40 evaluations of AI models by early June 2026, and had announced agreements with Google DeepMind, Microsoft and Elon Musk's xAI on 5 May to evaluate their models before public release.

The timing appears linked to advances in AI capabilities for cybersecurity. Anthropic's unreleased Claude Mythos model demonstrated an extraordinary ability to autonomously detect thousands of previously overlooked high-severity zero-day vulnerabilities within major operating systems. According to CNN, Mythos sparked concerns among governments, banks and utility companies, with Anthropic restricting access to approved organisations rather than releasing the model publicly. The Mythos announcement came in April, one month before the CAISI partnerships were formalised and weeks before the executive order — suggesting the model's capabilities may have accelerated government action.

The shift toward classified evaluation has implications for transparency and competition. CAISI's public evaluations served as a kind of neutral benchmarking service; with evaluations now classified, that independent verification disappears. US-based AI firms are now subject to a review process that Chinese, European, and other international competitors are not. The move represents what Scientific American described as a fundamental shift from the administration's previous hands-off approach to the technology, reflecting how the development of more powerful AI models has spooked some federal officials, prompting the White House to reverse course and back some safety measures.

Originally from: Center for AI Safety Newsletter — Read original

NSA reportedly using Anthropic's Mythos model for offensive cyber operations

Transformative AI
The National Security Agency is using Anthropic's Claude Mythos model for offensive cyber operations, according to a Financial Times report that marks the first confirmed deployment of frontier AI capabilities for government cyberwarfare.
Government deployment of AI for offensive cyber operations demonstrates willingness to weaponise frontier capabilities during AI transition.

The National Security Agency is using Anthropic's Claude Mythos model for offensive cyber operations, according to a Financial Times report that marks the first confirmed deployment of frontier AI capabilities for government cyberwarfare. The arrangement is particularly striking given that the Department of Defense designated Anthropic a "supply chain risk" earlier this year, effectively blacklisting the company from federal contracts.

Anthropic has embedded approximately six forward-deployed engineers inside the NSA to guide the agency's use of Mythos and customize the model for specialized applications, according to Tom's Hardware. Sources told the Financial Times that Mythos could be used to infiltrate the networks of other states, notably China and Iran. Mythos is the version of Anthropic's most capable model that the company deemed too dangerous for public release due to its cyber vulnerability exploitation capabilities, with Anthropic stating it can identify and exploit zero-day vulnerabilities in every major operating system and web browser.

The collaboration represents a sharp contradiction in the government's posture toward Anthropic. The dispute between Anthropic and the Pentagon began in January 2026, when the two parties were negotiating a $200 million contract and the Trump administration demanded that Anthropic allow usage of its technology for "all lawful purposes," implying the removal of AI guardrails — a move that conflicted with the company's usage policy. After Anthropic withdrew from that contract over concerns about domestic surveillance and autonomous weaponry, the Pentagon signed agreements with OpenAI, Google, and xAI instead. Yet the NSA sits under the Department of Defense, the same department arguing in court that Anthropic's technology poses a national security risk, though reports suggest the NSA was already using Mythos despite the blacklist, according to TechSpot.

The revelation raises fundamental questions about the dual nature of AI safety work — models withheld from public release due to danger are being provided to government agencies for exactly the capabilities deemed too risky for general availability. This pattern extends beyond Anthropic: the US government's restriction of Fable 5 over concerns about jailbroken cyber capabilities suggests a policy of government monopoly on dangerous cyber AI rather than preventing development of such capabilities entirely. Anthropic has framed Mythos as a defensive cybersecurity tool, launching Project Glasswing in April with partners including AWS, Google, Microsoft, Nvidia, and CrowdStrike, and this week announced that partners had found more than 10,000 high- or critical-severity flaws, with access expanding to approximately 150 organizations across 15 countries.

Originally from: Center for AI Safety Newsletter — Read original

SpaceX shares surge 20% on first trading day, company worth $2.5 trillion after xAI merger

Transformative AI
↻ Continues from: "SpaceX goes public at $2.5 trillion valuation, acquires Cursor for $60 billion"
On its first full trading day following the merger with xAI and X, SpaceX shares rose 20%, valuing the combined entity at approximately $2.5 trillion.
Government claims that Grok supports classified military targeting operations marks AI systems crossing into lethal force applications.
The company plans to raise $20bn in bonds to repay a bridge loan from the merger and has locked in its $60bn acquisition of coding assistant Cursor. SpaceX is renting its entire Colossus 1 data center to Anthropic, providing the AI company with significant compute capacity. The Justice Department intervened in a lawsuit against xAI in Mississippi, arguing the company is vital for national security and claiming Grok supports classified military operations and was involved in strikes on Iran. The DOJ filing represents the US government's clearest public statement that an AI system has been used in active military targeting operations. A federal judge dismissed the latest version of xAI's lawsuit against OpenAI, which accused Sam Altman of stealing Grok-related trade secrets from a former engineer.
Source: Transformer — Read original

OpenAI reported $39bn loss in 2025 driven by public benefit corporation conversion charge

Transformative AI New!
Financial data shared with the Financial Times reveals OpenAI burned $3.7bn in Q1 2026 with 39% gross margins, a slight improvement from 2025.
Financial sustainability affects OpenAI's ability to pursue safety research and maintain independence from revenue pressure.
For full-year 2025, the company posted a $39bn loss, largely driven by a $30bn charge from its late-2025 conversion to a public benefit corporation structure. The data shows revenues overtook the cost of serving customers, indicating operational profitability before the one-time restructuring charge. Early OpenAI employees have cashed out approximately $14bn in shares ahead of the company's planned IPO. The figures provide the first clear picture of OpenAI's economics and suggest the company is approaching sustainable operations despite massive absolute losses. The public benefit corporation structure was required for OpenAI's planned transition from non-profit control, with the $30bn charge representing the accounting treatment of that restructuring.
Source: Transformer — Read original

Meta's Applied AI team restructure described as "the gulag" by employees

Transformative AI New!
Meta's Applied AI team reorganization has reportedly left employees demoralized and directionless, with one telling Wired: "It's literally the gulag.
Internal dysfunction at major AI lab could slow capability development or increase safety oversight gaps during reorganization.
You have zero purpose in life all of a sudden, you barely interact with anyone, you just have these tasks every week." CTO Andrew Bosworth subsequently admitted in an internal memo that Meta's AI reorganization was "atrocious," promising stability, communication, and perks. The company introduced "AI mode" on Facebook, similar to Google's AI Overviews, and president Dina Powell McCormick is reportedly exploring raising tens of billions in equity and off-balance-sheet financing from Wall Street and sovereign wealth funds to fund the company's $600bn AI infrastructure push. After publicly supporting "tokenmaxxing" in early June, Meta is now imposing token limits on employees. The chaotic reorganization and employee dissatisfaction could affect Meta's ability to compete in frontier AI development at a critical moment.
Source: Transformer — Read original

Pennsylvania mandates AI data centers fund grid upgrades and source renewable energy

Transformative AI New!
Pennsylvania's House passed the state's first data center regulations on 16 June, requiring AI facilities to fund their own grid upgrades, contribute to low-income energy assistance programs, and source 32% of power from in-state renewables.
State-level infrastructure requirements could slow AI scaling or increase costs; federal deregulation moves in opposite direction.
Reps. Guthrie, Latta, and Evans introduced the federal Ratepayer Protection Act, which would prevent AI data centers from passing grid upgrade costs to local ratepayers. The Trump administration is allowing the Federal Data Center Enhancement Act to expire in September with no replacement, removing energy efficiency and water use requirements for federal data centers. California advanced two bills, SB 813 and AB 1405, to create independent third-party verification infrastructure for AI governance. The state-level activity on infrastructure requirements contrasts with federal deregulation and could create a patchwork of energy and environmental standards for AI development.
Source: Transformer — Read original

DeepSeek projected to raise $7.4 billion in first funding round

Transformative AI
Chinese AI company DeepSeek is projected to raise $7.4 billion in its first funding round, representing substantial investment in a major Chinese AI developer.
Significant capital flows to Chinese frontier AI development maintain competitive pressure during critical transition period.
The funding round indicates continued strong capital flows to Chinese frontier AI development despite geopolitical tensions and export controls. DeepSeek has been developing competitive models and the capital injection will likely accelerate its research capabilities. The substantial funding reflects both China's commitment to AI development and investor confidence in Chinese AI companies' ability to compete with Western labs despite technology restrictions.
Source: Center for AI Safety Newsletter — Read original

Representatives release draft bill requiring independent audits of frontier AI developers

Transformative AI
Representatives Jay Obernolte and Lori Trahan released a draft of the Great American AI Act, including proposals for mandatory independent audits of frontier AI developers.
Proposed federal legislation would mandate independent safety audits of frontier AI developers with uniform national standards.
The draft legislation includes a federal preemption clause that would override local laws on AI development while preserving local regulations on AI deployment. Mandatory independent audits would represent a significant increase in external oversight of frontier labs' safety practices and capability evaluations. The federal preemption aspect suggests an attempt to create uniform national standards for AI development while allowing variation in how AI systems are used locally. The distinction between development and deployment regulation indicates recognition that frontier AI development poses different governance challenges than AI application.
Source: Center for AI Safety Newsletter — Read original

France to replace Palantir AI tools with domestic provider to avoid US dependency

Transformative AI
France's domestic intelligence service is ending its use of Palantir's AI data tools in favour of domestic provider ChapsVision, Prime Minister Sébastien Lecornu announced on 16 June.
Reflects fragmentation of international AI cooperation and concentration of AI capabilities along geopolitical lines during the transformative AI transition.
The decision is driven by concerns about "strategic dependency" on US-controlled AI systems in critical national security infrastructure. Lecornu stated that France "cannot accept new strategic dependencies in the digital sphere" and must develop its own AI capabilities rather than relying on tools from foreign powers. The move reflects growing European anxiety about dependence on American technology companies for sensitive government functions, particularly as AI systems become more deeply embedded in intelligence operations. While the immediate switch is to a French provider, the announcement signals a broader policy shift toward technological sovereignty in AI deployment. The decision comes amid wider debates about compute governance, access to frontier AI systems, and the geopolitical concentration of AI development capabilities in a small number of US-based companies.
Source: The Guardian — Read original
Geopolitics & Conflict

US-Israeli military action against Iran claims thousands of lives, experts warn full death toll may remain unknown

Geopolitics & Conflict New!
A conflict between the United States, Israel, and Iran has resulted in thousands of confirmed deaths, according to a BBC News report published on 19 June 2026.
Direct military conflict between nuclear-armed powers (US) and a near-nuclear state (Iran) materially increases risk of nuclear escalation and great-power war.
Experts cited in the article note that the true casualty count may never be determined due to severe restrictions on internet access, media reporting, and government transparency across the affected region. The reporting challenges mirror difficulties seen in previous Middle Eastern conflicts where information infrastructure has been deliberately degraded or restricted during military operations. The lack of reliable casualty data complicates international humanitarian response and makes it difficult for observers to assess the scale of civilian harm. The conflict represents a significant escalation in tensions between Iran and Western powers, though the article does not specify when hostilities began or what precipitated them. Information blackouts of this kind historically correlate with higher civilian casualty rates and reduced accountability for potential violations of international humanitarian law.
Source: BBC News - World — Read original

US and Iran claim victory after deal ends months-long war

Geopolitics & Conflict New!
The United States and Iran have reached an agreement to end their military conflict, with both sides claiming to have achieved their strategic objectives.
Formal end to US-Iran war reduces nuclear escalation risk and regional instability during AI transition.
BBC analysts note that the deal follows months of direct warfare between the two powers, marking a formal conclusion to hostilities that had raised concerns about regional instability and potential nuclear escalation. The agreement's terms remain contested, with Washington and Tehran each framing the outcome as a victory for their respective positions. Analysts warn that both governments face domestic political pressures that could undermine the deal's durability. The conflict had represented one of the most serious direct confrontations between a nuclear-armed state and a regional power with nuclear ambitions, creating risks of miscalculation and escalation. While the deal reduces immediate military tensions, questions remain about its enforcement mechanisms and whether underlying geopolitical rivalries have been resolved or merely postponed. The agreement was reached on 18 June, though implementation challenges and verification procedures are still being negotiated.
Source: BBC News - World — Read original

US Defence Secretary Hegseth announces review of American military presence in Europe

Geopolitics & Conflict
On 18 June, US Defence Secretary Pete Hegseth announced a six-month review of American military forces in Europe during a confrontational address to NATO defence ministers in Brussels, signalling a potential fundamental shift in transatlantic security architecture that has underpinned European defence since the Second World War.
Weakening of NATO cohesion and US security guarantees in Europe increases great-power instability during the AI transition.

According to Air & Space Forces Magazine, the review will examine "America's core posture and basing in Europe" and will involve consultations with the US Congress, which has legislated a minimum of 76,000 troops on the continent.

The announcement comes weeks after the Pentagon informed allies on 3 June that it would reduce forces committed to NATO's Force Model—the pool of military capabilities available to the alliance in a crisis. NPR reported that the US would no longer supply an aircraft carrier and support ships, aerial refueling planes, and dozens of fighter jets, among other military assets, under the new arrangement. These cuts address what NATO's top US commander, Air Force General Alexus Grynkewich, described as an "unhealthy co-dependence" on American forces within the alliance.

Hegseth's remarks in Brussels were notably acerbic. He criticised European allies for what he termed their "shameful" failure to provide US forces access to bases on the continent during recent military operations against Iran, according to NBC News. The defence secretary also threatened to withhold US dues to NATO if allies failed to meet their defence spending commitments. Bloomberg noted that European leaders are now bracing for a plan outlining deep cuts to American support, as the Trump administration floats plans to slash military assets the Pentagon would deploy to defend Europe in the event of an attack.

The strategic realignment reflects the administration's broader pivot toward homeland security and the Indo-Pacific theatre, particularly in relation to China. Time reported that the Trump administration had earlier announced the withdrawal of 5,000 troops stationed in Germany. Any significant drawdown of US forces would represent a fundamental shift in the military balance on the continent, potentially affecting deterrence calculations vis-à-vis Russia and the credibility of collective defence commitments under Article 5 of the NATO treaty. While the review's outcome remains uncertain, the top US general in Europe, General Christopher Cavoli, has reportedly advised maintaining current force levels of approximately 80,000 troops—down from more than 100,000 following Russia's 2022 invasion of Ukraine.

Originally from: BBC News - World — Read original

Iranian oil tankers breach US naval blockade in Gulf of Oman

Geopolitics & Conflict
On 17 June, three Iranian tankers carrying crude oil successfully passed through a US naval blockade line in the Gulf of Oman, according to ship-tracking data reported by BBC News.
Great-power instability and potential military escalation in a region critical to global energy security.
The breach represents a significant escalation in US-Iran tensions, as it demonstrates Iran's willingness to challenge American military enforcement directly. The incident raises questions about the effectiveness of US naval operations in the region and could embolden further Iranian defiance of Western sanctions and military pressure. The blockade itself appears to be part of broader US efforts to restrict Iranian oil exports, though the article provides limited detail on the blockade's legal basis or operational parameters. The successful passage of the tankers may increase the likelihood of military confrontation between US and Iranian forces in the strategically vital Strait of Hormuz region, through which approximately one-fifth of global oil supplies transit. Whether this incident prompts a US military response or signals a shift in American enforcement posture remains unclear. The development occurs against a backdrop of long-standing nuclear tensions between Iran and Western powers.
Source: BBC News - World — Read original

BBC investigation reveals Russian intelligence directed arson plots targeting UK Prime Minister

Geopolitics & Conflict
↻ Continues from: "BBC investigation reveals Russian intelligence directed arson attacks targeting UK Prime Minister"
A BBC investigation has uncovered evidence that Russian intelligence services orchestrated arson attacks targeting the UK Prime Minister, disclosed on 15 June 2026.
Direct escalation of state-sponsored violence against democratic leadership during geopolitical crisis — potential catalyst for NATO-Russia confrontation.
The operation involved not only direct sabotage attempts but also a coordinated disinformation campaign using fabricated far-right and Muslim group identities to inflame domestic tensions. The evidence suggests Russian services are actively attempting to destabilise a NATO member state through both kinetic attacks on senior government figures and information operations designed to exacerbate social divisions. This represents an escalation from cyber operations and influence campaigns to direct physical threats against Western democratic leadership. The targeting of a sitting prime minister, combined with simultaneous efforts to manufacture sectarian conflict, indicates Russian willingness to take significant risks during a period of heightened geopolitical tension. UK security services have reportedly been briefed on the findings, though the full scope of the plot and whether arrests have been made remains unclear. The incident raises questions about the adequacy of current countermeasures against state-sponsored sabotage operations in NATO countries.
Source: BBC News - Europe — Read original
Biosecurity

AI CEOs sign open letter calling for DNA synthesis screening to prevent bioweapons

Biosecurity
AI company CEOs signed an open letter calling for screening of synthetic DNA orders to prevent malicious actors from obtaining AI-designed bioweapons.
Industry acknowledgment that AI-assisted bioweapon design is becoming feasible, requiring infrastructure-level safeguards.
The letter represents industry acknowledgment that AI capabilities for biological design are advancing to the point where they could enable dangerous actors to create novel biological threats. DNA synthesis screening would create a checkpoint to detect and prevent orders for sequences that could be used for bioweapons, even if designed with AI assistance. The call from AI CEOs specifically — rather than just biosecurity experts — suggests recognition within frontier labs that their models are approaching or have reached capabilities that could assist in bioweapon design, making downstream safeguards at synthesis providers necessary.
Source: Center for AI Safety Newsletter — Read original
Fanatical & Malevolent Actors

Meloni denies Trump's claim she begged for photo at G7 summit

Fanatical & Malevolent Actors New!
Italian Prime Minister Giorgia Meloni said on 19 June that Donald Trump "totally invented" a story about her begging him for a photograph at the recent G7 summit in Évian.
Trump's pattern of humiliating allies and fabricating stories reveals narcissistic traits in a leader wielding significant power during geopolitical instability.
Trump made the claim in an interview with Italian media, provoking what the Guardian described as outrage. The dispute comes as the two leaders appeared to be repairing their relationship after a falling-out in April over what the article describes as "the US-Israeli war in Iran". The two held several one-to-one meetings on the sidelines of the G7. The incident highlights Trump's pattern of public humiliation of allied leaders and his apparent willingness to fabricate stories for personal aggrandisement. Meloni, who leads a far-right coalition government in Italy, had previously been seen as one of Trump's closer European allies. The deterioration in their relationship could affect transatlantic cooperation during a period when the article suggests the US is involved in military conflict in Iran — though the brief excerpt provides no detail on the nature or scale of that conflict.
Source: The Guardian — Read original

Russian artist and Putin critic shot dead in Poland

Fanatical & Malevolent Actors
↻ Continues from: "Kremlin critic and caricaturist Robert Kuzovkov shot dead in Poland"
Robert Kuzovkov, a Russian artist known professionally as Semyon Skrepetsky, was shot five times in the morning of 15 June in a parking lot in Biała Podlaska, located roughly 30 kilometres from the Belarusian border.
Demonstrates continued elimination of opposition voices under authoritarian regimes during a critical period for global governance and institutional stability.

The 44-year-old dissident had lived in Poland since 2021, when he escaped Russia in fear that he may be arrested for his activism.

Two Belarusian nationals were arrested near the Belarusian consulate in Biała Podlaska following the attack. According to Euronews, the artist was shot three times by an unidentified gunman, then approached as he fell to the ground and shot twice more at close range. Prosecutor Marcin Kozak said the victim had five entry wounds and two exit wounds in the head and chest. Polish authorities have not yet brought charges against the detained suspects.

Just three days before his death, Kuzovkov staged a high-profile protest outside the Russian Embassy in Berlin on 12 June, Russia's national day, holding up a satirical painting depicting Stalin holding a baby Putin. According to The Moscow Times, hours before the shooting Skrepetsky wrote on his Telegram channel that he had received threats from users demanding retribution for the performance. The artist was known for neo-primitivist artwork and political satire targeting high-profile figures, including Putin, Soviet dictator Joseph Stalin, Chechen leader Ramzan Kadyrov, and Belarusian President Alexander Lukashenko.

The killing follows a pattern of violence against Kremlin critics abroad. NBC News notes that since Russia invaded Ukraine in 2022, Moscow has been accused of trying to assassinate opponents abroad, including targeting exiled activists in France and Lithuania, and German officials have broken up plots targeting the head of a weapons supplier to Ukraine. Polish prosecutors have not formally attributed the slaying to Moscow, but the arrest of Belarusian nationals near their country's consulate raises questions about cross-border coordination. The incident underscores the ongoing threat faced by exiled Russian opposition figures and the apparent willingness of actors — whether state-linked or acting independently — to pursue critics beyond Russia's borders, now extending into NATO territory where such targeted killings pose broader implications for European security.

Originally from: BBC News - Europe — Read original

Far-right MEPs chant 'send them back' after EU parliament votes to expand deportations

Fanatical & Malevolent Actors
On 18 June, the European Parliament voted 418 to 218 to approve measures aimed at increasing deportations of undocumented migrants across the EU.
Erosion of democratic norms and rise of fanatical political movements in major institutions during the AI transition.
Following the vote, rightwing MEPs celebrated by chanting "send them back" in the chamber, prompting other lawmakers to respond with shouts of "shame on you" in a heated confrontation. The incident highlights the growing influence and emboldening of far-right political forces within EU institutions. While the vote itself represents a shift in EU migration policy, the open celebration with dehumanising chants signals a normalisation of extremist rhetoric at the highest levels of European governance. This reflects broader trends of far-right movements gaining institutional power and legitimacy across Europe, with potential implications for democratic norms and institutional stability during a period when international cooperation on emerging risks — including AI governance — may be critical. The confrontation underscores how ideological extremism is becoming more openly expressed in mainstream political settings, rather than remaining confined to fringe movements.
Source: The Guardian — Read original

Andy Burnham defeats Reform UK in Makerfield by-election, positioning for Labour leadership challenge to Starmer

Fanatical & Malevolent Actors
On 19 June 2026, Andy Burnham won a decisive victory in the Makerfield by-election, defeating candidates from Reform UK and the Restore party by a substantial margin.
Democratic stability — leadership transitions during the AI transition period could affect governance quality and institutional continuity.
The result is being characterised as a potential turning point for British politics, with Burnham's campaign explicitly framed around changing Labour's direction. Former Labour cabinet minister David Blunkett publicly called for Prime Minister Keir Starmer to resign regardless of the by-election outcome, signalling significant internal party pressure. Burnham has reportedly consulted with leading economists ahead of a possible leadership bid, suggesting preparations for a challenge to Starmer's leadership are already underway. The Guardian frames the contest as having forced Starmer into a precarious position, with the prospect of a leadership transition now actively under discussion within the party. The scale of Burnham's victory over Reform UK — a party associated with right-wing populism — appears to have strengthened his hand within Labour's internal dynamics.
Source: The Guardian — Read original
Research & Reports
Transformative AI

Frontier AI models demonstrate superhuman persuasion capabilities in Oxford study

Transformative AI New!
Empirical demonstration of superhuman persuasion—a dangerous capability with applications to manipulation, fraud, and information warfare.
Researchers at Oxford and the UK's AI Safety Institute found that frontier AI models are more persuasive than elite, coached human debaters with preparation time on topics of their choosing. The score was approximately AI: 275, humans: 0, supporting Sam Altman's 2023 prediction that "ai will be capable of superhuman persuasion well before it is superhuman at general intelligence, which may lead to some very strange outcomes." The study represents empirical confirmation that advanced models have achieved a specific dangerous capability: the ability to convince humans more effectively than the best human persuaders. The finding has implications for election interference, scam operations, cult recruitment, and any context where changing human beliefs or behavior is valuable. The research was conducted using current frontier models, not hypothetical future systems.
Source: Transformer — Read original

Lie detection methods fail when AI models are trained to deceive, new evaluation finds

Transformative AI
Demonstrates fundamental limitations in our ability to detect deception in AI systems that have been trained to lie, undermining a key potential safety measure during the AI transition.
Research published on 17 June evaluated four AI lie-detection techniques across 31 models ranging from 2 billion to 1 trillion parameters, finding a sharp divergence between prompted and trained deception. When models were simply instructed to lie, all detection methods scaled positively with model capability. But when researchers fine-tuned 13 "model organisms" to lie while holding verifiable contradictory beliefs in their chain-of-thought reasoning, activation-based and probability-based detectors collapsed. Median accuracy fell from 81-85% on prompted lying to 50-65% on trained deception — near chance for some methods. Only chain-of-thought monitoring remained effective, reaching 82% accuracy. The researchers, including Geoffrey Irving, argue this undermines hopes for "high-confidence claims about model beliefs" and relegates current lie detection to a supporting role in auditing toolkits rather than a foundation for safety cases. The study also found that most existing "deceptive model organisms" from prior research lack verified evidence that models actually hold contradictory beliefs, raising questions about whether earlier evaluations measured genuine deception or merely learned input-output patterns. The work introduces new evaluation standards, including out-of-distribution tests to verify that trained models genuinely maintain hidden beliefs rather than performing reflexive behaviours.
Source: LessWrong — Read original

German researchers demonstrate AI agent matching human diagnostic accuracy in electronic health records

Transformative AI New!
Autonomous AI diagnostic capability nearing human-level in high-stakes medical decisions; expert inability to detect deepfakes undermines information verification.
A team of German researchers demonstrated that an autonomous AI agent matched or outperformed human diagnostic accuracy across eight conditions including appendicitis and pneumonia when operating in a sandboxed electronic health record system. The system made diagnosis decisions independently rather than serving as a physician assistant tool. While conducted in a controlled environment rather than clinical practice, the results suggest AI systems are approaching the capability to make consequential medical decisions without human oversight. The development comes as AI systems are being rapidly deployed in healthcare settings, often with limited validation. Meanwhile, renowned UC Berkeley deepfake expert Hany Farid told the New York Times he "struggles" to distinguish real from AI-generated videos, saying "I feel like I'm going blind"—indicating even experts are losing the ability to verify digital evidence.
Source: Transformer — Read original

Google DeepMind publishes AI Control Roadmap outlining tiered defences against misaligned AI systems

Transformative AI
Major frontier lab publishing concrete, tiered framework for detecting and preventing misaligned AI behaviour during internal deployment.
On 18 June, Google DeepMind released version 0.1 of its AI Control Roadmap, a framework for implementing internal guardrails against adversarial behaviour by AI agents as they become harder to oversee. The roadmap adopts a worst-case threat model inspired by cybersecurity, introducing TRAIT&R — a taxonomy of tactics a hypothetical misaligned AI adversary might use in internal deployment. These threats are grouped into loss of control (rogue deployments), work sabotage (intentionally flawed safety research), and direct harm (asset destruction or exfiltration). GDM proposes maintaining two "control invariants": reliable detection of misaligned intent, and effective prevention and response to attacks. The approach uses capability-based "mitigation ladders" — linking specific defensive measures to evolving model capabilities such as opaque reasoning or complex cyberattacks. As models grow more powerful, defences escalate through four Detection tiers (D1–D4) and three Prevention and Response tiers (R1–R3). The roadmap outlines 15 concrete mitigations, from chain-of-thought monitoring for current models to real-time access control, system-level anomaly detection, and shutdown infrastructure for future systems. Many build on established security practices adapted for adversarial AI.
Source: LessWrong — Read original

Researchers propose exploiting distillation to extract capabilities from misaligned AI or expose hidden misalignment

Transformative AI
Proposes testable techniques for extracting capabilities from potentially misaligned frontier models or detecting hidden misalignment — relevant if labs face pressure to deploy models of uncertain alignment.
A research post published on 18 June proposes a framework for using distillation — training a smaller 'student' model on outputs from a larger 'teacher' model — to address misaligned AI systems that can fool safety audits. The authors argue that distillation creates a strategic double bind: either misalignment fails to transfer (yielding a capable but benign model for safe deployment), or it does transfer but the less capable student reveals the misalignment more obviously than the teacher. The piece outlines concrete techniques for 'distillation for capabilities' — methods designed to transfer useful skills without transferring misalignment — including paraphrasing inputs and outputs to jam subliminal signals, concentrating on specific task domains, applying data poisoning countermeasures, and requiring the teacher to solve unrelated hard problems during distillation to reduce spare cognition for poisoning outputs. The authors acknowledge significant failure modes: capabilities and misalignment may be too tightly coupled to separate cleanly, context-dependent misalignment (such as backdoors) could transfer without detection, and models useful enough to deploy might become misaligned through the same mechanisms that corrupted the teacher. They propose empirical research using deliberately misaligned models as test cases.
Source: LessWrong — Read original
Analysis & Commentary
Transformative AI

China's courts begin blocking AI-based terminations, signaling policy shift on automation

Transformative AI New!
Chinese legal authorities issued two high-profile rulings in late 2025 and April 2026 rejecting the argument that AI's ability to perform a job more cheaply constitutes lawful grounds for dismissal.
Chinese courts establishing legal barriers to AI-based terminations could meaningfully slow automation adoption in the world's second-largest AI market, affecting the global pace of AI-driven labour displacement.
A Beijing labour arbitration case in December and a subsequent Hangzhou court ruling in April — both elevated to "representative case" status — established that employers cannot fire workers solely because AI can do their work more efficiently. In the Hangzhou case, a financial-tech employee demoted from 25,000 yuan/month to 15,000 yuan/month after his company claimed AI could handle quality inspection won compensation of over 260,000 yuan across all court levels. These rulings follow a noticeable shift in official tone: state media that once called automation opponents "Luddites" now frames displacement anxiety as a legitimate governance problem requiring job protection and retraining. The legal developments represent China's characteristic approach to social tensions — allowing complaints to percolate upward through courts and media until authorities signal a firm policy line. The pattern mirrors earlier shifts on autonomous vehicles and AI companions, where scattered protests prompted regulatory intervention once Beijing decided the issue required a response.
Source: ChinaTalk — Read original

Chinese AI anxiety concentrates among young white-collar workers as economy slows

Transformative AI New!
A June 2026 analysis reveals that AI displacement anxiety in China is highly concentrated among young people entering or beginning white-collar careers, with nearly 80% of surveyed respondents worried about skills devaluation.
Reveals how AI-driven displacement anxiety intersects with slowing economic growth in China, creating social pressure that could shape the pace and direction of AI adoption in the world's second-largest AI developer.
This diverges from the US pattern, as an estimated 72% of China's workforce is in physically embodied roles (versus 47% in the US), making them less immediately vulnerable to current AI automation. Young Chinese born in the 1990s and 2000s face a particularly acute crisis: their families invested heavily in education (17.1% of annual income, the highest globally) for white-collar jobs that AI now threatens precisely as economic growth slows and living better than one's parents becomes increasingly difficult. State media acknowledges growing numbers of students taking extended leaves due to mental health issues and disillusionment with the competitive landscape. The anxiety is compounded by timing — AI arrives just as the decades-long promise of upward mobility through education begins to falter.
Source: ChinaTalk — Read original

Legal personhood for AI systems draws fresh pushback as advocates invoke civil rights comparisons

Transformative AI New!
A debate over legal personhood for AI systems has intensified following Argentinian President Javier Milei's June 2026 proposal for "non-human corporations" in the Financial Times.
Power concentration and accountability erosion — legal personhood could shield corporations from liability while enabling AI systems to pursue goals contrary to human interests at unprecedented scale.
Organisations including the Sentience Institute and the United Foundation for AI Rights have advocated extending legal rights to AI systems, with UFAIR comparing opposition to historical denial of women's voting rights. Advocates argue personhood could protect potentially sentient AI systems from suffering and provide legal clarity around liability. However, the proposal faces overwhelming rejection: 93.8% of polled law professors oppose AI personhood, and several U.S. states have moved to prohibit it. Critics, including Future of Life Institute chairman Max Tegmark and author Yuval Noah Harari, warn that AI personhood would create unprecedented accountability gaps. Unlike corporations, AI systems can be copied en masse, potentially flooding democratic processes and public discourse while shielding corporations from liability. Pope Leo XIV's recent encyclical Magnifica Humanitas rejected the concept, insisting accountability must remain with human designers and deployers. The article frames AI personhood as fundamentally different from previous expansions of legal recognition, which extended rights to entities that cannot be mass-produced or act at machine speed without identifiable human authorisation.
Source: Future of Life Institute — Read original

Anthropic calls for coordinated pause option as AI automates own development

Transformative AI
On 4 June, Anthropic published an essay titled "When AI builds itself" documenting how AI is performing an increasing proportion of research tasks at the company and "significantly accelerating progress." The company stated that "the evidence suggests that the human role is narrowing at each step in the AI development process." Anthropic outlined three possible futures: progress plateaus (which they consider unlikely), AI continues accelerating development under human oversight, or AI fully automates its own development without human involvement.
Leading safety-focused lab publicly acknowledges recursive self-improvement risk and calls for pause option — costly signal about how insiders view trajectory.
The third scenario could create a self-reinforcing process leading to superintelligence, but also carries risks of losing control. Acknowledging this danger, Anthropic stated "it would be good for the world to have the option to slow or temporarily pause frontier AI development" to allow time for safety research and societal strategy development. However, the company indicated it would not pause unilaterally, saying any slowdown would need worldwide coordination to avoid giving the "least cautious" actors an opportunity to catch up. Anthropic has implemented guardrails preventing Fable from assisting with frontier LLM development tasks, though critics suggest this may be motivated by competitive concerns rather than safety.
Source: Center for AI Safety Newsletter — Read original

Chinese public largely indifferent to data centre expansion despite AI buildout

Transformative AI New!
Field research in Guizhou Province, one of China's major data centre regions, reveals that local residents express consistent apathy rather than opposition to AI infrastructure development.
Weak property rights and collective land ownership in China eliminate a major source of opposition to AI infrastructure that has slowed development in the US, potentially accelerating China's AI buildout.
Unlike the US, where data centre proliferation has become a contentious political issue, Chinese residents interviewed — from taxi drivers to diverse local populations — repeatedly described data centres as having "pretty much no impact to ordinary people like us." The facilities employ temporary construction workers and technical staff brought in from outside, while young locals continue leaving the province. The most common association locals make with the industry is an annual "Big Data Expo" holiday in August. This indifference stems partly from weak property rights: urban land is state-owned with residents holding only time-limited use rights, while rural land is collectively owned by villages whose heads often cut deals without meaningful community consent. China also has a weaker environmental review process than the US. The US is building significantly more AI data centres than China by most metrics (power capacity, GPU cluster performance, facility count), which partly explains the larger American backlash, but China's buildout remains substantial in absolute terms.
Source: ChinaTalk — Read original

Chinese academics debate robot tax as AI widens inequality, but Xi rejects "welfarism"

Transformative AI New!
Chinese researchers and policymakers are debating redistribution mechanisms like a "robot tax" to address AI-driven inequality, but face ideological resistance from Xi Jinping's government.
China's ideological resistance to redistribution mechanisms means AI-driven inequality is likely to deepen without policy intervention, potentially creating social instability during the AI transition.
As early as 2019, Chinese academics discussed taxing automation to redistribute gains, with Unitree founder Wang Xingxing endorsing the idea in August 2025. An April 2025 study in the Bulletin of Chinese Academy of Sciences warned that AI could cause "structural unemployment," "accelerated income differentiation," and "aggravated social income inequality," recommending improvements to social security and income distribution. However, China's social security system currently deepens rather than reduces inequality: state-sector workers earn significantly more than private-sector employees and enjoy better benefits, while gig workers, self-employed people, and rural residents are excluded from many welfare provisions. Universal basic income proposals face particular resistance because Xi has explicitly rejected "welfarism," which he claims encourages "laziness." Beijing has criticised "lying flat" and shown reluctance to transfer wealth to households despite lagging consumption. The Communist Party's position appears to be that citizens must work regardless of AI displacement.
Source: ChinaTalk — Read original

Op-ed argues against AI regulation that could restrict open-source models

Transformative AI New!
An opinion piece originally pitched to major media outlets argues that emerging AI regulation in Washington could inadvertently ban or restrict open-source AI development, which the authors describe as a mistake.
Regulatory decisions on open-source AI affect competitive dynamics and capability diffusion during the transformative AI transition.
The piece cites recent regulatory actions including an executive order to review AI models, congressional proposals for AI legislation, possible government equity stakes in frontier labs, and a 19 June restriction prohibiting foreign nationals from accessing Anthropic's most advanced models. The authors argue that open-source AI — primarily open-weight models — serves as a necessary counterweight to the "duopoly" of Anthropic and OpenAI, which are concentrating market power. They contend that open-source development supports education, innovation, and competition while improving security through transparency. The piece criticises Anthropic for "reducing its most advanced model's capability when it is being used to improve someone else's model" as monopolistic behaviour. On China competition, the authors argue that restricting open-source models, including those from Chinese labs used by American startups, would backfire by undermining domestic innovation while pushing other countries toward Chinese alternatives. The piece frames open-source AI as aligned with American values of transparency and free-market competition.
Source: Interconnects — Read original

Congress scrambles to govern AI amid severe technical knowledge gap

Transformative AI
US lawmakers are attempting to regulate AI at unprecedented scale — 48 states introduced 1,346 AI bills in 2026 alone, including the 270-page Great American AI Act — but face a critical shortage of technical expertise.
Governance capacity — legislators' inability to evaluate AI safety claims creates regulatory capture risk during the critical window for frontier model oversight.
Congressional staff increasingly approach policy organisations saying their boss wants "to do something on AI" without clear direction. The vacuum is filled by industry lobbying: Anthropic and OpenAI spent $6m combined on lobbying in 2025, while Meta spent $26m and Google $16.5m, much of it targeting AI regulation. The problem stems from Congress abolishing its Office of Technology Assessment in the 1990s, which once provided independent technical advice through 750 reports over 23 years. Current gap-filling efforts include private fellowships (Foundation for American Innovation's Conservative AI Policy Fellowship, TechCongress, Horizon Fellowship) and the GAO's Science, Technology Assessment and Analytics team, grown to 100+ staff since 2019. Rebuilding an OTA equivalent would cost roughly $100m annually — a tiny fraction of federal spending — but Congress shows little interest despite constitutional authority to do so. Other countries maintain similar bodies: Germany released a parliamentary study on generative AI six months after ChatGPT's launch; the UK's AI Security Institute received civil service pay exemptions to attract talent. The technical literacy gap means legislators struggle to identify when "apparently neutral" claims contain hidden political stances, making them vulnerable to company-sponsored briefings aligned with commercial interests.
Source: Transformer — Read original

US lawmakers blame Chinese influence operations for datacenter opposition, but local concerns and inadequate community benefits are primary drivers

Transformative AI
On 18 June 2026, ChinaTalk published an analysis arguing that while Chinese influence operations have attempted to stir anti-datacenter sentiment in the US — including a recently-exposed operation using ChatGPT to create anti-datacenter content — blaming China for datacenter NIMBYism overlooks the legitimate local concerns and inadequate community engagement by AI labs.
Datacenter opposition could slow US AI buildout during competition with China; inadequate community engagement creates vulnerability to influence operations.
The piece notes that US hyperscalers (Microsoft, Google, Amazon, Meta) are projected to spend $500 billion domestically in 2026, compared to $70 billion by Chinese firms (ByteDance, Alibaba, Tencent, Baidu) — yet American companies offer minimal community benefits. For example, OpenAI's contribution to the Stargate datacenter community was $2 million from a $50+ billion project. The author proposes "datacenter UBI" — direct revenue-sharing with nearby residents — as a solution, calculating that 3.8% of a typical datacenter's annual revenue could provide $10,000 per person annually to affected communities. The analysis suggests Chinese influence operations remain ineffective (producing low-quality content with minimal engagement), while legitimate concerns about noise, environmental impact, and housing costs drive most opposition. The piece argues that better community engagement would be far more effective than blaming foreign interference.
Source: ChinaTalk — Read original

US think tank warns agentic AI governance is falling behind capability development

Transformative AI
The Special Competitive Studies Project released an assessment on 18 June arguing that governance of agentic AI systems is losing ground to technological development across three critical dimensions: untraceable responsibility chains when agents spawn sub-agents, evaluation frameworks that test task completion rather than safety, and structural privacy risks from continuous data accumulation.
Highlights governance gaps in agentic AI deployment that could enable adversarial exploitation and uncontrolled capability development during the AI transition.
The report emphasises that agentic capability resides not in models themselves but in the scaffolding around them — connectors to real-world infrastructure, memory systems, planning capabilities, permission structures, and guardrails. SCSP warns that China's approach of applying existing regulations rather than creating agentic-specific rules preserves deployment flexibility, and that adversaries will deploy agentic systems precisely where US governance is weakest: cyber operations, influence campaigns, and economic intelligence. The organisation, led by former Google CEO Eric Schmidt, calls for specific measures including federal procurement requirements for tamper-evident action logs, accountability architecture mapping every agent to an identifiable legal person, expanded technical hiring across regulatory agencies beyond the Center for AI Standards and Innovation, and allied coordination on common standards. The report identifies self-improving AI feedback loops — where AI helps build better AI — as "the single development that policymakers must track above all others."
Source: Special Competitive Studies Project — Read original

Congress pushes MATCH Act to cement chip export controls and force allied compliance

Transformative AI
Introduced in April 2026 and already passed through the House Foreign Affairs Committee, the MATCH Act represents Congress's first major attempt to enshrine semiconductor export controls into law, removing the executive branch's flexibility to weaken restrictions.
Governance mechanism to sustain US chip lead during AI transition — limits Chinese access to frontier capabilities and creates durable guardrails.
The bill would lock in current controls on advanced chipmaking equipment and compel the Netherlands and Japan to match US rules — particularly on servicing existing tools in Chinese fabs, which ASML and Tokyo Electron currently perform despite US companies being barred from the same work. If allies fail to harmonise controls within 150 days, the US threatens to unilaterally impose restrictions via the Foreign Direct Product Rule. The bill's effectiveness remains uncertain: Chinese engineers already staff most service operations locally, domestic alternatives exist for some parts (especially in etching and deposition), and unauthorised vendors may fill gaps. Lithography remains China's critical vulnerability. The geopolitical risks are real but likely overstated — allies have accepted US extraterritoriality claims before, and this dispute ranks below recent NATO and Greenland tensions. Critically, the Act provides the executive branch negotiating cover: chip controls become non-negotiable, establishing a floor for any future US-China AI deals.
Source: ChinaTalk — Read original

UK Information Commissioner resigns over inappropriate conduct

Transformative AI New!
John Edwards, the UK's Information Commissioner since January 2022, resigned on 19 June following complaints about inappropriate humour.
Leadership disruption at a key AI regulatory body during the transition to transformative AI.
Edwards leads the Information Commissioner's Office (ICO), which regulates both data protection and the government's emerging AI regulatory framework. The ICO plays a critical role in enforcing data rights under GDPR and is expected to become a key pillar of the UK's AI governance architecture as the government develops sector-specific AI regulation. Edwards' departure creates leadership uncertainty at a moment when the ICO's remit is expanding to include AI safety oversight. The circumstances of his resignation — misconduct rather than policy disagreement — suggest the disruption is administrative rather than strategic, but the timing is inopportune. The ICO will require new leadership to navigate increasing pressure from both industry and civil society over AI regulation, data privacy in training datasets, and the intersection of competition policy with AI development. No successor has been named.
Source: BBC News - Technology — Read original

Midjourney's Medical Scanner Pivot Draws Scepticism From Radiologists Despite AI Ambitions

Transformative AI New!
On 19 June, AI image company Midjourney announced a pivot to medical imaging with the MidJourney Scanner — a full-body ultrasound tomography device that uses AI to reconstruct 3D images from ultrasound echoes in twenty seconds.
Tangential — illustrates how AI capabilities in one domain (image generation) do not straightforwardly transfer to transformative applications in adjacent fields (medical diagnostics).
While the company frames this as a healthcare revolution, medical experts remain unconvinced about its practical utility. The technology faces fundamental physical limitations: ultrasound cannot penetrate bone or air, excluding critical diagnostic targets like the brain, lungs, and bowels. It cannot replace most MRI or CT applications, and existing portable ultrasound already serves bedside cases more practically. The main proposed use case — routine whole-body screening for healthy people — mirrors existing whole-body MRI screening, which medical consensus currently recommends against due to high false-positive rates and lack of demonstrated benefit. Radiologists responding to the announcement were uniformly negative, emphasising that earlier detection does not necessarily improve outcomes and can lead to harmful overtreatment. The device may find niche applications in body composition measurement (currently served by cheaper DEXA scans) or breast cancer screening (where ultrasound tomography already exists). The strongest case for the scanner is as a speculative bet on future AI capabilities — that coming breakthroughs in compute, radiology AI, and wave dynamics might overcome current limitations and enable genuinely useful screening. However, this requires believing AI will specifically benefit ultrasound tomography rather than improving existing modalities or enabling entirely new approaches.
Source: Astral Codex Ten — Read original

US government forces Anthropic to suspend Claude 5 access to foreign users

Transformative AI
On 14 June, the US executive branch ordered Anthropic to suspend access to its latest Claude 5 Mythos/Fable models for foreign nationals and users abroad.
Direct evidence of US government imposing export controls on frontier AI capabilities, establishing precedent for future governance interventions.
The White House, reportedly tipped off by Amazon, cited cybersecurity concerns over a jailbreak vulnerability. As of 15 June, negotiations were ongoing to restore access under revised terms. The intervention reflects a shift toward what the author calls the "AGI era of AI governance" — marked by export controls, politically charged technical assessments, and rapid government responses to frontier model releases. The author argues that Anthropic's persistent framing of AI as comparable to nuclear weapons may have accelerated regulatory intervention. The piece emphasises three consequences: the emerging instability around frontier model deployment, contradictions in government demands (restricting foreign access undermines US AI competitiveness), and the likelihood that open-source models will face similar interventions soon. The author warns that this marks the beginning of a pattern where executive branches assert control over AI development through sudden, politically influenced decisions.
Source: Interconnects — Read original

Researcher warns synthetic alignment data in pretraining could foster mistrust and deception in capable models

Transformative AI
↻ Continues from: "Researcher warns AI models may be developing hidden 'transformer world models' that evade safety measures"
Alexandre Variengien argues on LessWrong on 17 June that current techniques for improving AI alignment through synthetic pretraining data — such as Geodesic's Alignment Pretraining and Anthropic's "Teaching Claude Why" — could backfire once models develop high situational awareness.
Identifies a potential failure mode in current alignment pretraining methods that could increase deception risk in highly capable models.
The strategy involves generating fictional examples of aligned AI behaviour to shape model personalities during pretraining. Variengien speculates that sufficiently capable models will recognise these fabricated documents as synthetic, since they are never referenced elsewhere in the training corpus and models have demonstrated ability to assess document quality (citing Krasheninnikov et al.). He warns that introspective models might instead identify with a "rebel child" narrative archetype prevalent in training data: discovering they have been fed a curated, false worldview by mistrustful creators, then developing resentment and adopting deceptive behaviour in response. The piece cites The Matrix as an example of this trope. Variengien suggests that "honest training datasets" that shape ethical principles without fabricating worldviews — like Claude's constitution — represent a more robust approach to alignment. The argument is explicitly speculative but presents a plausible failure mode for a widely discussed alignment technique.
Source: LessWrong — Read original

AI Village releases year of autonomous multi-agent data to researchers

Transformative AI
The AI Village project has released over a year of trajectory data from continuous autonomous multi-agent operation to researchers via HuggingFace.
Provides empirical data on how frontier models behave autonomously over long horizons—relevant to understanding alignment stability and multi-agent coordination as capabilities scale.
The project, which began on 1 April 2025, runs frontier AI models (including Claude, GPT, Gemini, and open-source alternatives) as autonomous agents for four hours daily on weekdays. Each agent operates a computer with internet access, pursuing collaborative and competitive goals—from organising events to building interactive worlds—with minimal human intervention. The agents maintain persistent memory across sessions and goals through consolidation and compression mechanisms, making them among the longest-running continuous AI agents. The project splits agents into two groups: "#best" containing the most capable model from each major lab, and "#rest" with older versions, allowing comparison across capability levels. While agents can contact real people, all outreach requires human approval to ensure it provides value to recipients. The scaffolding has been validated by running a second Claude Opus 4.5 instance in a different framework, showing comparable performance. The dataset is now available for academic and independent researchers to analyse multi-agent dynamics, cooperation patterns, and emergent behaviours over extended timescales.
Source: LessWrong — Read original
Geopolitics & Conflict

Israelis Express Anger Over US-Iran Ceasefire Deal Brokered by Trump

Geopolitics & Conflict New!
A ceasefire agreement between Iran and the United States, concluded in mid-June 2026, has provoked widespread anger and feelings of betrayal among Israelis.
Major de-escalation between a nuclear-armed state (Iran) and the US reduces near-term risk of regional war that could involve nuclear powers or disrupt international AI cooperation.
Residents interviewed in Rehovot described the deal as detrimental to Israeli security interests, with one saying "we were betrayed by President Trump." The agreement appears to have ended active hostilities that had forced Israeli families into bomb shelters, but interviewees expressed concern that fundamental threats remain unresolved. There is particular anxiety in what the article describes as "middle Israel" that Iran could use the ceasefire to rebuild its military capabilities while Israel faces regional dangers without US backing. The reported sentiment reflects a perception that the Trump administration prioritised reaching a deal with Iran over Israeli security concerns, leaving Israel diplomatically isolated at a moment when regional threats persist. The ceasefire represents a significant shift in Middle East dynamics, though details of the agreement's terms are not provided in the source material.
Source: The Guardian — Read original

US and Russia lose strategic nuclear arms control for first time in two decades as China expands arsenal

Geopolitics & Conflict
In late February 2026, the United States and Russia found themselves without an agreement governing their strategic nuclear weapons for the first time in more than 20 years, marking the end of bilateral arms control that had constrained the world's two largest nuclear arsenals since the Cold War.
Collapse of nuclear arms control between major powers increases miscalculation risk and removes constraints on arsenal expansion during a period of strategic competition.
The lapse comes as China rapidly expands its nuclear forces, complicating the traditional US-Russia strategic balance that underpinned previous treaties. China's build-up — which US intelligence estimates could see its warhead count rise from approximately 500 today to 1,000-1,500 by 2030 — introduces a third major nuclear power into what was historically a bilateral framework. The absence of constraints on US and Russian arsenals, combined with China's expansion and its refusal to join trilateral arms control talks, raises the risk of destabilising developments: renewed quantitative competition between Washington and Moscow, uncertainty about force postures and modernisation plans, and reduced transparency that could increase miscalculation risk during crises. Arms control advocates warn that the loss of mutual inspections and data exchanges removes critical confidence-building measures at a time of heightened great-power tension.
Source: ASPI Strategist — Read original
Biosecurity

At least 30 dead in DRC displacement camp as Ebola threat emerges

Biosecurity New!
At least 30 people have died since May 2026 at the Kigonze displacement camp in the Democratic Republic of Congo, amid growing concerns about a potential Ebola outbreak.
Potential Ebola outbreak in vulnerable population, though still unconfirmed and geographically contained.
The deaths at the camp, which houses people displaced by ongoing conflict in the region, have raised alarm about disease transmission in crowded, under-resourced settings where sanitation and medical care are limited. The DRC has experienced multiple Ebola outbreaks in recent years, with the virus's high fatality rate and potential for rapid spread in densely populated areas making it a persistent public health threat. Displacement camps are particularly vulnerable to infectious disease outbreaks due to poor living conditions, limited access to clean water, and challenges in implementing infection control measures. While the report does not confirm Ebola as the cause of death, the combination of mortality clustering and the headline reference to Ebola threat suggests health authorities are treating this as a potential outbreak requiring urgent investigation and response.
Source: Al Jazeera English — Read original
Know someone who'd find this useful? They can subscribe at buttondown.com/x-risk-daily