X-Risk Daily

Sunday 21 June 2026
33 news · 4 research · 17 analysis · 5 updates from yesterday

Iran closes Strait of Hormuz as Israel strikes Lebanon

Geopolitics & Conflict New!
Iran has again declared the Strait of Hormuz closed on 20 June in response to continued Israeli military operations in Lebanon, marking the latest escalation in a volatile diplomatic process that has seen the critical waterway repeatedly shuttered and partially reopened since late February.
Major escalation in Middle East instability, raising risk of great-power military confrontation and potential disruption to global energy security during the AI transition period.

Iran has again declared the Strait of Hormuz closed on 20 June in response to continued Israeli military operations in Lebanon, marking the latest escalation in a volatile diplomatic process that has seen the critical waterway repeatedly shuttered and partially reopened since late February. The closure threatens to derail fragile negotiations in Switzerland aimed at converting a memorandum of understanding signed this week between the United States and Iran into a permanent agreement.

The Strait, through which approximately 20-25% of global seaborne oil and liquefied natural gas passes under normal conditions, has been effectively blocked by Iran since 28 February, when the United States and Israel launched airstrikes against Iranian targets and assassinated Supreme Leader Ali Khamenei. The Islamic Revolutionary Guard Corps has warned vessels attempting passage would be shot at, boarded merchant ships, and laid sea mines throughout the waterway. Iran briefly reopened the strait in mid-April following a short-lived ceasefire, but reimposed the closure days later after the United States announced a blockade of Iranian ports.

The memorandum of understanding signed on 17 June between President Trump and Iranian President Masoud Pezeshkian had called for the immediate cessation of military operations on all fronts, including in Lebanon, and the full reopening of the strait without Iranian-imposed tolls for at least 60 days. However, Israeli strikes in southern Lebanon have continued despite the agreement. Al Jazeera reported that Israeli forces launched more than 100 airstrikes on southern Lebanon on 20 June alone, killing at least 32 people and threatening to collapse follow-up talks between US and Iranian representatives scheduled for 22 June in Burgenstock, Switzerland. Israel was not a signatory to the US-Iran agreement, and Newsweek reports that Prime Minister Benjamin Netanyahu and members of his cabinet have vehemently opposed withdrawing Israeli Defence Forces from Lebanese territory.

The simultaneous crises over the Strait and Lebanon underscore how rapidly the regional conflict could spiral beyond diplomatic containment. Iran views a ceasefire in Lebanon as essential to the broader negotiating process, warning that continued Israeli strikes constitute violations of the framework agreement. The United States has disputed Iran's latest closure declaration, with US Central Command stating that traffic continues to flow through the waterway. Vice President JD Vance said 16 million barrels of oil transited the strait on 19 June, describing it as a record. Trump warned in a social media post that the US could impose its own tolls on Strait passage if a final deal is not reached within 60 days, claiming the charge would be for services rendered.

The ongoing instability has already triggered significant economic disruption. Oil prices surged above $100 per barrel from pre-war levels of approximately $65 following the initial February closure, according to Al Jazeera. War-risk insurance premiums for vessels transiting the strait have increased from 0.125% to between 0.2% and 0.4% of ship insurance value per transit, representing an additional quarter of a million dollars for very large oil tankers. Israel's war in Lebanon has killed more than 4,000 people since March and displaced over one million, according to Lebanese authorities.

Originally from: Al Jazeera English — Read original

Solid-state transformer startups raise $140 million to replace iron-and-copper grid equipment with silicon

Transformative AI New!
Two solid-state transformer startups have raised a combined $140 million to commercialize technology that replaces conventional iron-and-copper grid equipment with semiconductor-based power electronics, addressing a critical infrastructure bottleneck for AI data centres.
Technology shift — solid-state transformers could accelerate AI data centre deployment but increase dependence on gallium supply chains controlled by China.

Amperesand announced on 18 November 2025 an $80 million Series A round co-led by Walden Catalyst Ventures and Temasek, while DG Matrix closed a $60 million Series A in February 2026 led by Engine Ventures, bringing its total capital raised to over $100 million.

Solid-state transformers replace traditional iron-and-copper transformers with fast power converters and high-frequency transformers, yielding devices that can be up to 14 times smaller and roughly 40 times lighter than the equipment they replace. More importantly, an SST is an active, controllable device that can regulate voltage in real time, enable bidirectional power flow between AC and DC systems, and provide grid-support functions that passive transformers cannot. Amperesand expects to deliver 30MW of commercial systems in 2026 to hyperscale AI data centre and critical power customers, targeting applications including megawatt EV charging and utility deployments. The company, founded by veterans of ABB, GE, Siemens, and Vestas, has established engineering and advanced manufacturing hubs in San Francisco and Reno.

DG Matrix has partnered with PowerSecure, a Southern Company subsidiary, in a deal announced 9 December to accelerate deployment of next-generation power infrastructure for AI data centres and large-scale electrification projects. The company has also announced a strategic partnership with Exowatt, backed by a16z and Sam Altman, to deploy gigawatt-scale power systems for hyperscale data centres, with Exowatt selecting DG Matrix's Interport platform for its modular generation and storage systems. One of the most promising applications for both companies is converting medium-voltage AC from the grid directly to 800VDC for AI data centres, bypassing conventional transformers entirely and sidestepping the interconnection queues that have become a major bottleneck for new capacity. Lead times for traditional electrical equipment—including transformers, switchgear and uninterruptible power supply systems—have quadrupled since 2019, now averaging 12 to 24 months.

The market for SSTs is expected to more than triple by 2033, driven by surging demand from AI infrastructure. Training clusters for large language models can swing from idle to full power draw within minutes, placing stress on both on-site electrical systems and upstream grid infrastructure, creating demand for power architectures that can absorb these transients while maintaining efficiency. However, SSTs trade iron for silicon and gallium, making them significantly more sensitive to supply chain vulnerabilities in wide-bandgap semiconductors and advanced magnets that China dominates. Amperesand has emphasised that its platform is built on secure, regionalised supply chains, positioning the technology as critical to national security interests as well as commercial AI deployment.

Originally from: ChinaTalk — Read original

Trump faces Israeli backlash over memorandum of understanding with Iran

Geopolitics & Conflict New!
A memorandum of understanding between the United States and Iran, signed on 18 June by President Trump and Iranian President Masoud Pezeshkian, has triggered widespread anger across Israel's political spectrum.
A US-Iran agreement that alters the regional balance of power could affect Israeli military calculations and the risk of Middle East escalation during the AI transition.

Israelis from across the political spectrum are calling it a disaster for Israel and directing their fury at Prime Minister Benjamin Netanyahu, whose government was sidelined during the negotiations.

The 14-point framework sees Iran commit to refrain from procuring or developing nuclear weapons in exchange for sanctions relief, a $300bn reconstruction plan and the restarting of maritime traffic in the Strait of Hormuz. According to NPR, the agreement—mediated by Pakistan—commits the US to terminate sanctions against Iran and grants waivers for Iranian oil exports. However, Israel was sidelined entirely throughout the negotiating process, with Netanyahu informing a reporter that Israel does not yet know the details of the deal.

Critics across Israel's political spectrum have condemned the agreement for strengthening Iran's regional position. Former Prime Minister Ehud Barak said "Israel is paying the price of Netanyahu's hubris and blindness... Iran emerged stronger; Israel emerged weaker. That is Netanyahu's strategic responsibility. He failed". After nearly four months of war, Iran withstood a withering aerial campaign and is now in a much stronger position, with its proxy network surviving and still able to fire missiles into Israel. According to Al Jazeera, Israeli pollster Dahlia Scheindlin noted the depth of disappointment is profound, with Israelis aware that none of Netanyahu's promised war goals have been achieved.

The deal complicates Israel's military operations in Lebanon, where it has been battling Hezbollah. The agreement commits the US and its allies to the immediate and permanent termination of military operations on all fronts, including in Lebanon. Despite this, PBS reports that Israel continued attacks in Lebanon even after the framework was announced, reflecting the precarious position Netanyahu now faces domestically and with Washington.

Netanyahu has avoided directly criticising the agreement while attempting to claim victory. The prime minister emphasised that Israel is not a signatory to the US-Iran deal, and distanced himself from Trump's decision, saying "This agreement was made by the United States, by the president of the United States". However, he acknowledged that "the struggle is not yet over, and further challenges lie ahead" requiring "the preservation of our vital relationship with our American friends". The memorandum reflects broader shifts in US Middle East policy, with analysts suggesting it could reshape regional dynamics and potentially weaken the Abraham Accords coalition while giving Tehran greater latitude for regional influence ahead of Israeli elections this fall.

Originally from: Al Jazeera English — Read original

Senate and House NDAA drafts include restrictions on AI in nuclear weapons and lethal force

Transformative AI
The Senate Armed Services Committee unveiled extensive AI provisions in its version of the fiscal year 2027 National Defense Authorization Act on 10 June, including what amounts to a prohibition on the use of artificial intelligence in nuclear weapon launch decisions and a requirement for human judgment in lethal force determinations.
Congressional prohibition on AI in nuclear launch decisions and lethal force restrictions establish human-in-loop requirements for catastrophic weapons.

The Senate Armed Services Committee unveiled extensive AI provisions in its version of the fiscal year 2027 National Defense Authorization Act on 10 June, including what amounts to a prohibition on the use of artificial intelligence in nuclear weapon launch decisions and a requirement for human judgment in lethal force determinations. The $1.15 trillion bill represents Congress's most comprehensive attempt to date to regulate military AI use in the highest-stakes domains.

The Senate draft would ban the use of AI to decide whether nuclear weapons should be deployed and forbid the military from planning attacks without the incorporation of appropriate levels of human judgment, according to NOTUS. Senator Elissa Slotkin described the measures as establishing "first ever comprehensive safeguards" governing AI deployment across the Department of Defense, ensuring a human has ultimate decision authority when autonomous weapons are fired and that a human remains on the switch for nuclear launches. The provisions build on earlier legislation, including the Block Nuclear Launch by Autonomous Artificial Intelligence Act passed in 2023, and would codify existing Pentagon Directive 3000.09 into law.

Both Senate and House versions of the NDAA would complicate the Defense Department's ability to designate companies as supply chain risks, potentially constraining future restrictions on AI firms. The nuclear weapons language appears designed to move beyond guidelines to enforceable prohibition: one analysis noted that codifying the restriction in statute gives it the force of law rather than leaving it to executive branch discretion, which can shift with administrations. However, the bill text falls short of detailing what constitutes appropriate human judgment in practice, according to NOTUS, and experts have warned that without accompanying accountability mechanisms, the provisions may preserve the appearance of human control while the substance erodes.

The timing is significant given recent high-profile uses of AI in military operations. In January, Anthropic's Claude tool was used in the capture of Venezuelan President Nicolás Maduro and reportedly employed on the first day of Operation Epic Fury in Iran, with speculation that AI targeting systems were involved in a missile strike on a school that killed 175 people. The Trump administration has also asked for revisions to the Pentagon's rules around autonomous weapons, raising concerns among congressional Democrats that existing safety protocols and human oversight requirements could be undermined. The NDAA still requires reconciliation between Senate and House versions before becoming law.

Originally from: Transformer — Read original

Z.ai releases MIT-licensed GLM-5.2 claimed to match frontier performance

Transformative AI
On 13 June, Z.ai, the Beijing-based AI company formerly known as Zhipu AI, released the full MIT-licensed weights of its GLM-5.2 model to the public, marking what may be the most capable openly-licensed language model to date.
Open-weights frontier model release undermines compute-based governance and export control strategies if performance claims verified.

The release carries strategic implications for AI governance: it demonstrates that frontier-class capabilities can be distributed under permissive licensing just as export controls tighten around Western models.

Benchmarks published alongside the open weights show GLM-5.2 as the strongest open-source coding model currently available, scoring 81.0 on Terminal-Bench 2.1, closing to within four points of Claude Opus 4.8's 85.0, and achieving 62.1 on SWE-bench Pro, ahead of both GPT-5.5 at 58.6 and its predecessor GLM-5.1. On long-horizon agentic coding tasks—FrontierSWE, PostTrainBench, and SWE-Marathon—GLM-5.2 delivers substantially stronger performance than GLM-5.1, with its capability roughly positioned between Claude Opus 4.7 and Claude Opus 4.8. The model's 1 million-token context window is designed to handle large codebases and project-scale engineering tasks, supported by an architectural innovation called IndexShare, which reduces per-token FLOPs by 2.9× at a 1M context length.

Developer reaction has been immediate and overwhelmingly positive, with the team behind Kilo Code confirming day-one integration and Cline IDE noting that "GLM-5.2 is the first open-weights model to cross 80% on Terminal-Bench". The MIT license means the software can be used, modified, and commercialized without paying royalties or adhering to restrictive governance policies, allowing engineering teams to host frontier-level AI on their own sovereign infrastructure. For enterprises navigating data residency requirements and vendor lock-in concerns, the timing is significant: GLM 5.2 shipped MIT-licensed open weights as Washington moved to suspend access to top U.S. models in some overseas markets.

The release raises immediate questions about the durability of capability-based export restrictions. If a model approaching Claude Opus 4.8's performance is available under an unrestricted open license, controls that limit access to specific proprietary systems become strategically weaker. According to TechTimes, Z.ai has been explicit that the open-source release is a deliberate response to geopolitical restriction of AI access. Mistral has reportedly benefited from similar dynamics by positioning itself around AI sovereignty and models customers can deploy independently—an approach that gains strategic advantage as US-China tensions make API-dependent Western systems less reliable for international buyers. Independent verification of Z.ai's frontier claims remains the next step, but early developer adoption and benchmark results suggest the model represents a meaningful shift in what capabilities are now freely available.

Originally from: Transformer — Read original
Transformative AI

White House and Anthropic negotiate formal AI safety standards after Fable pulldown

Transformative AI
↻ Continues from: "White House Shuts Down Anthropic's Claude Fable 5 Over Code Security 'Jailbreak' That Cannot Be Fixed"
Following the abrupt government-mandated removal of Anthropic's Fable model on 13 June, the White House and Anthropic are reportedly developing a common set of benchmarks to assess future jailbreaks, including the severity of safeguard bypasses, exposed capabilities, and practical consequences.
Establishes precedent for government intervention in frontier AI deployment and signals movement toward mandatory safety testing frameworks.
The move represents a shift toward formalized technical standards for frontier AI deployment. At the G7 summit on 15-16 June, the incident dominated AI discussions, with French President Emmanuel Macron calling the US action "strictly nationalist" and Indian PM Narendra Modi expressing concern about democratic nations' access. G7 leaders discussed establishing a "trusted access" scheme to guarantee use of advanced models. Rep. Josh Gottheimer announced he is preparing legislation requiring mandatory testing of frontier AI models for national security risks. The timeline reveals that Commerce Department and CAISI tested Fable before release and gave no deployment objection, but Amazon researchers subsequently found what they characterized as a jailbreak involving software code disclosure. After Amazon CEO Andy Jassy shared findings with Treasury Secretary Scott Bessent and the NSA, the White House gave Anthropic 90 minutes to pull both Mythos and Fable globally. President Trump approved export controls, effectively forcing removal. The administration was reportedly already frustrated that Anthropic had expanded Mythos access to South Korea's SK Telecom without government permission.
Source: Transformer — Read original

US government orders first-ever restriction of AI model over jailbreak vulnerability

Transformative AI
↻ Continues from: "White House orders halt to government AI model assessments, citing security risks"
On 12 June, the US government ordered Anthropic to disable Claude Fable 5 and Mythos 5 worldwide, just three days after Fable 5's 9 June release.
First government intervention blocking a frontier model release establishes precedent for capability-based restrictions during AI transition.

Commerce Secretary Howard Lutnick sent Anthropic CEO Dario Amodei a letter outlining the restrictions, marking the first time Washington has blocked an AI model release on national security grounds.

The directive followed warnings from Amazon researchers who flagged a jailbreak bypassing Fable's safeguards to elicit dual-use cyber capabilities. A person close to the White House told Semafor that Amazon flagged the jailbreak to the government, and that Amazon CEO Andy Jassy had been in contact with the administration about it. Fable 5 scored 53.3% on Humanity's Last Exam benchmark, compared to Claude Opus 4.8's 45.7%, and possesses capabilities similar to Claude Mythos Preview—a model Anthropic deemed too dangerous for general release in April. Mythos is understood to currently be in use by the NSA for offensive cyber operations, according to Tom's Hardware.

The export control directive required restricting access for all foreign nationals, whether inside or outside the United States, including Anthropic's own foreign-born employees. Given the scope of the directive, Anthropic argued it had no choice but to disable the models for all users. The company received the order at 5:21pm ET on 12 June and had to "abruptly disable Fable 5 and Mythos 5 for all our customers to ensure compliance." Access to other Claude models, including Opus 4.8, remained unaffected.

Anthropic contested the action, arguing that the jailbreak technique "essentially consists of asking the model to read a specific codebase and fix any software flaws," and that a demonstration surfaced previously known, minor vulnerabilities also discoverable by other publicly available models, including OpenAI's GPT-5.5. The company maintained its safeguards are substantially more effective than those of any previously deployed model, and that perfect jailbreak robustness is currently impossible. Anthropic wrote: "We disagree that the finding of a narrow potential jailbreak should be cause for recalling a commercial model deployed to hundreds of millions of people." CNN reported that Anthropic argued the standard would halt all new frontier model deployments across the AI industry.

The dispute unfolded against a backdrop of prior tensions. Earlier this year, the Department of Defense declared Anthropic a supply chain risk—a designation historically applied to foreign adversaries—following the collapse of talks between the two sides. The label obligates defense contractors to certify they are not using Claude in military work. White House AI adviser David Sacks said the administration issued the export control "reluctantly" after Anthropic refused to fix the flaw or pull the model, that it wants the restriction lifted once the jailbreak is patched, and that "the ball is in Anthropic's court." The action signals Washington's willingness to invoke emergency export controls to intervene in frontier AI deployment when national security concerns emerge, setting a precedent that could reshape how American labs release models globally.

Originally from: Center for AI Safety Newsletter — Read original

Sen. Blackburn negotiates child safety and deepfake protections for AI preemption deal

Transformative AI
Tennessee Senator Marsha Blackburn is negotiating with the White House over legislation that would trade child safety measures and deepfake protections for federal preemption of state AI laws.
Federal preemption could block state-level AI safety regulation, centralizing governance authority and potentially weakening safeguards.
The Daily Signal reports she will introduce the package "soon." The proposal includes her KOSA bill requiring platforms to prevent mental health disorders among minors and disable addictive features, the NO FAKES Act creating federal property rights in voice and likeness with deepfake liability, and the App Store Accountability Act mandating age verification and parental controls. Child safety groups are pushing to add the SCREEN Act or SAFE for Kids Act (age verification for porn sites) and the GUARD Act (restricting AI chatbot access for minors with liability for mental health harms). NO FAKES advanced through Senate Judiciary on 12 June, though three Republicans and one Democrat expressed First Amendment concerns. Senator Ted Cruz, who raised free speech reservations, has reportedly asked committee Republicans to submit AI priority lists and been involved in preemption discussions. The House has its own KOSA version focusing on parental controls rather than duty of care, complicating Republican support. The White House has not indicated positions on the component bills despite meeting with online safety groups on 18 June.
Source: Transformer — Read original

DeepSeek raises $7.4bn at $50bn+ valuation with unusual control structure

Transformative AI
DeepSeek closed a $7.4bn funding round at a valuation exceeding $50bn, with an unusual structure requiring investors to put capital into a limited partnership managed by CEO Liang Wenfeng.
Chinese state-backed frontier lab raises $7.4bn; US Commerce secretary decides against Entity List despite security concerns.
The arrangement ensures Liang retains complete control, with only China's national AI fund holding voting rights. The structure resembles Elon Musk's approach with SpaceX and represents Chinese state backing for a frontier AI company while maintaining the appearance of private control. The Trump administration reportedly decided not to add DeepSeek and over 100 other Chinese companies to its Entity List despite national security concerns, apparently to avoid escalating US-China tensions. Microsoft is reportedly considering switching Copilot Cowork from expensive Anthropic and OpenAI models to DeepSeek V4 as a cheaper alternative, which would represent a major commercial win for the Chinese company. The US export controls on Fable and Mythos have reportedly created opportunities for Mistral, which has positioned itself around AI sovereignty and open-weight models customers can deploy independently.
Source: Transformer — Read original

Google DeepMind's Gemma 3 runs on satellite to identify targets autonomously

Transformative AI
Loft Orbital's satellite ran Google DeepMind's Gemma 3 model on-device to respond to researchers' commands by identifying areas of interest autonomously—an AI-in-space first.
AI models operating autonomously in space for targeting decisions; removes human oversight from satellite operations.
The demonstration shows frontier AI models can operate in space environments without ground communication, making satellite targeting and surveillance decisions independently. Combined with Nvidia's announcement that its new agent harness enables coding agents to autonomously train physical robots, the developments indicate AI systems are gaining the ability to operate independently in both space and terrestrial physical environments. The capability has obvious military applications for autonomous weapons systems and surveillance.
Source: Transformer — Read original

OpenAI hires transformers co-author Noam Shazeer and policy analyst Dean Ball

Transformative AI
On 17 June, OpenAI announced two significant hires: Noam Shazeer, a Google DeepMind researcher who co-authored the original 2017 transformers paper, will work on finding new AI model architectures beyond the transformer paradigm.
Shazeer hire signals exploration of post-transformer architectures; Ball hire indicates OpenAI prioritizing government relations after Fable crisis.
Dean Ball, author of the Hyperdimensional Substack and a non-resident senior fellow at the Foundation for American Innovation, joined as head of strategic futures to work on frontier AI policy. Ball will remain affiliated with FAI and continue his independent Substack. The hires signal OpenAI's focus on both technical research beyond current architectures and closer engagement with policy development. Shazeer's background in fundamental AI research suggests OpenAI is exploring architectural alternatives as transformer scaling potentially approaches limits. Ball has previously argued that building superintelligence is "a profoundly political act" that "implicates the sovereignty of the state itself," positioning him to navigate the increasingly fraught relationship between frontier labs and government.
Source: Transformer — Read original

Kazakhstan signs $10bn AI infrastructure deal with Nvidia for 100,000-GPU cluster

Transformative AI
Kazakhstan signed a $10bn AI infrastructure agreement with Nvidia and startup Firebird to build a computing cluster with 100,000 GPUs.
Large-scale compute buildout in Central Asia could shift global AI development geography; autonomous robot training reduces human oversight.
The deal represents one of the largest national AI infrastructure investments outside the US and China. Nvidia also sold $25bn in bonds to help fund the broader AI infrastructure buildout and is encouraging data centers globally to adopt its TPU chips by providing financial guarantees, including a $3.2bn guarantee for the Lake Mariner cluster in New York state—mirroring the strategy that made its GPU architecture dominant. The company's new agent harness enabled coding agents to autonomously train physical robots, demonstrating progress in AI systems that can operate in the physical world without human oversight. Nvidia hired Bruce Andrews, a seasoned lobbyist, to lead government affairs in Washington, signaling increased focus on policy engagement.
Source: Transformer — Read original

Sen. Bernie Sanders proposes 50% tax on AI companies to fund $7 trillion sovereign wealth fund

Transformative AI
Senator Bernie Sanders unveiled legislation on 16 June proposing a one-time 50% tax on major AI companies' equity to create a $7 trillion sovereign wealth fund that would pay 5% dividends to all Americans.
Signals political willingness to dramatically restructure AI industry ownership, though proposal lacks enforcement mechanism for dividend payments.
The bill would apply to OpenAI and Anthropic, but would also cover data centers and AI compute infrastructure, potentially including Amazon, Nvidia, Google, and Microsoft. The fund would not be allowed to sell equity, making it unclear how the planned dividends would be funded without asset sales. The proposal represents the most aggressive redistributive AI policy put forward by a sitting senator and would effectively nationalize half the US AI industry if enacted. While the bill has no realistic prospect of passing, it indicates growing political pressure to address AI's wealth concentration effects and could influence future taxation proposals.
Source: Transformer — Read original

State attorneys general subpoena OpenAI over user data handling and impact on vulnerable populations

Transformative AI
A coalition of state attorneys general has subpoenaed OpenAI for documents covering advertising practices, user data handling, and the company's impact on minors and seniors.
Multi-state investigation into frontier lab's impact on vulnerable populations; state resistance to federal preemption could fragment AI governance.
The action represents the first major multi-state investigation into OpenAI's business practices and follows growing concern about AI systems' effects on vulnerable populations. The timing coincides with Sen. Blackburn's negotiations over child safety legislation and Americans for Responsible Innovation calling on NIST and CAISI to develop national safety benchmarks for AI chatbots used by minors. Over 200 state lawmakers from 42 states urged Congress to oppose the Great American AI Act's preemption provision, arguing it would prevent states from regulating AI harms. The combination of state AG action and legislative opposition to federal preemption suggests states intend to assert regulatory authority regardless of federal policy.
Source: Transformer — Read original

Trump executive order requires 30-day pre-release model submission to national security agencies

Transformative AI
On 2 June, President Trump signed an executive order titled "Promoting Advanced Artificial Intelligence Innovation and Security", requiring AI companies to provide new AI models to the US government 30 days before general release.
Establishes mandatory pre-release government review of frontier models, shifting enforcement to national security apparatus rather than standards body.

The order directs the Secretaries of the Treasury, War (through the Director of NSA), and Homeland Security (through the director of CISA) to design a voluntary framework through which developers may submit models for evaluation. The structure represents a significant departure from earlier proposals: an earlier version gave the government up to 90 days to review advanced models before release — a timeline that was cut to 30 days in the final order after Trump worried the order would stifle American companies' lead in the global race amid competitive pressure from China.

The order distributes testing responsibility among several national security organisations including the NSA and CISA, rather than giving the Center for AI Standards and Innovation (CAISI) the central role. Reports suggest this reflected officials' push for AI national security priorities to sit within traditional security agencies rather than CAISI. Days after the order, National Cyber Director Sean Cairncross ordered CAISI to stop publishing AI model assessments, ending public transparency in frontier AI evaluation. The directive transfers oversight authority from CAISI to a classified system managed by national security agencies. CAISI had completed over 40 evaluations of AI models by early June 2026, and had announced agreements with Google DeepMind, Microsoft and Elon Musk's xAI on 5 May to evaluate their models before public release.

The timing appears linked to advances in AI capabilities for cybersecurity. Anthropic's unreleased Claude Mythos model demonstrated an extraordinary ability to autonomously detect thousands of previously overlooked high-severity zero-day vulnerabilities within major operating systems. According to CNN, Mythos sparked concerns among governments, banks and utility companies, with Anthropic restricting access to approved organisations rather than releasing the model publicly. The Mythos announcement came in April, one month before the CAISI partnerships were formalised and weeks before the executive order — suggesting the model's capabilities may have accelerated government action.

The shift toward classified evaluation has implications for transparency and competition. CAISI's public evaluations served as a kind of neutral benchmarking service; with evaluations now classified, that independent verification disappears. US-based AI firms are now subject to a review process that Chinese, European, and other international competitors are not. The move represents what Scientific American described as a fundamental shift from the administration's previous hands-off approach to the technology, reflecting how the development of more powerful AI models has spooked some federal officials, prompting the White House to reverse course and back some safety measures.

Originally from: Center for AI Safety Newsletter — Read original

NSA reportedly using Anthropic's Mythos model for offensive cyber operations

Transformative AI
The National Security Agency is using Anthropic's Claude Mythos model for offensive cyber operations, according to a Financial Times report that marks the first confirmed deployment of frontier AI capabilities for government cyberwarfare.
Government deployment of AI for offensive cyber operations demonstrates willingness to weaponise frontier capabilities during AI transition.

The National Security Agency is using Anthropic's Claude Mythos model for offensive cyber operations, according to a Financial Times report that marks the first confirmed deployment of frontier AI capabilities for government cyberwarfare. The arrangement is particularly striking given that the Department of Defense designated Anthropic a "supply chain risk" earlier this year, effectively blacklisting the company from federal contracts.

Anthropic has embedded approximately six forward-deployed engineers inside the NSA to guide the agency's use of Mythos and customize the model for specialized applications, according to Tom's Hardware. Sources told the Financial Times that Mythos could be used to infiltrate the networks of other states, notably China and Iran. Mythos is the version of Anthropic's most capable model that the company deemed too dangerous for public release due to its cyber vulnerability exploitation capabilities, with Anthropic stating it can identify and exploit zero-day vulnerabilities in every major operating system and web browser.

The collaboration represents a sharp contradiction in the government's posture toward Anthropic. The dispute between Anthropic and the Pentagon began in January 2026, when the two parties were negotiating a $200 million contract and the Trump administration demanded that Anthropic allow usage of its technology for "all lawful purposes," implying the removal of AI guardrails — a move that conflicted with the company's usage policy. After Anthropic withdrew from that contract over concerns about domestic surveillance and autonomous weaponry, the Pentagon signed agreements with OpenAI, Google, and xAI instead. Yet the NSA sits under the Department of Defense, the same department arguing in court that Anthropic's technology poses a national security risk, though reports suggest the NSA was already using Mythos despite the blacklist, according to TechSpot.

The revelation raises fundamental questions about the dual nature of AI safety work — models withheld from public release due to danger are being provided to government agencies for exactly the capabilities deemed too risky for general availability. This pattern extends beyond Anthropic: the US government's restriction of Fable 5 over concerns about jailbroken cyber capabilities suggests a policy of government monopoly on dangerous cyber AI rather than preventing development of such capabilities entirely. Anthropic has framed Mythos as a defensive cybersecurity tool, launching Project Glasswing in April with partners including AWS, Google, Microsoft, Nvidia, and CrowdStrike, and this week announced that partners had found more than 10,000 high- or critical-severity flaws, with access expanding to approximately 150 organizations across 15 countries.

Originally from: Center for AI Safety Newsletter — Read original

SpaceX shares surge 20% on first trading day, company worth $2.5 trillion after xAI merger

Transformative AI
↻ Continues from: "SpaceX goes public at $2.5 trillion valuation, acquires Cursor for $60 billion"
On its first full trading day following the merger with xAI and X, SpaceX shares rose 20%, valuing the combined entity at approximately $2.5 trillion.
Government claims that Grok supports classified military targeting operations marks AI systems crossing into lethal force applications.
The company plans to raise $20bn in bonds to repay a bridge loan from the merger and has locked in its $60bn acquisition of coding assistant Cursor. SpaceX is renting its entire Colossus 1 data center to Anthropic, providing the AI company with significant compute capacity. The Justice Department intervened in a lawsuit against xAI in Mississippi, arguing the company is vital for national security and claiming Grok supports classified military operations and was involved in strikes on Iran. The DOJ filing represents the US government's clearest public statement that an AI system has been used in active military targeting operations. A federal judge dismissed the latest version of xAI's lawsuit against OpenAI, which accused Sam Altman of stealing Grok-related trade secrets from a former engineer.
Source: Transformer — Read original

OpenAI reported $39bn loss in 2025 driven by public benefit corporation conversion charge

Transformative AI
Financial data shared with the Financial Times reveals OpenAI burned $3.7bn in Q1 2026 with 39% gross margins, a slight improvement from 2025.
Financial sustainability affects OpenAI's ability to pursue safety research and maintain independence from revenue pressure.
For full-year 2025, the company posted a $39bn loss, largely driven by a $30bn charge from its late-2025 conversion to a public benefit corporation structure. The data shows revenues overtook the cost of serving customers, indicating operational profitability before the one-time restructuring charge. Early OpenAI employees have cashed out approximately $14bn in shares ahead of the company's planned IPO. The figures provide the first clear picture of OpenAI's economics and suggest the company is approaching sustainable operations despite massive absolute losses. The public benefit corporation structure was required for OpenAI's planned transition from non-profit control, with the $30bn charge representing the accounting treatment of that restructuring.
Source: Transformer — Read original

Meta's Applied AI team restructure described as "the gulag" by employees

Transformative AI
Meta's Applied AI team reorganization has reportedly left employees demoralized and directionless, with one telling Wired: "It's literally the gulag.
Internal dysfunction at major AI lab could slow capability development or increase safety oversight gaps during reorganization.
You have zero purpose in life all of a sudden, you barely interact with anyone, you just have these tasks every week." CTO Andrew Bosworth subsequently admitted in an internal memo that Meta's AI reorganization was "atrocious," promising stability, communication, and perks. The company introduced "AI mode" on Facebook, similar to Google's AI Overviews, and president Dina Powell McCormick is reportedly exploring raising tens of billions in equity and off-balance-sheet financing from Wall Street and sovereign wealth funds to fund the company's $600bn AI infrastructure push. After publicly supporting "tokenmaxxing" in early June, Meta is now imposing token limits on employees. The chaotic reorganization and employee dissatisfaction could affect Meta's ability to compete in frontier AI development at a critical moment.
Source: Transformer — Read original

Pennsylvania mandates AI data centers fund grid upgrades and source renewable energy

Transformative AI
Pennsylvania's House passed the state's first data center regulations on 16 June, requiring AI facilities to fund their own grid upgrades, contribute to low-income energy assistance programs, and source 32% of power from in-state renewables.
State-level infrastructure requirements could slow AI scaling or increase costs; federal deregulation moves in opposite direction.
Reps. Guthrie, Latta, and Evans introduced the federal Ratepayer Protection Act, which would prevent AI data centers from passing grid upgrade costs to local ratepayers. The Trump administration is allowing the Federal Data Center Enhancement Act to expire in September with no replacement, removing energy efficiency and water use requirements for federal data centers. California advanced two bills, SB 813 and AB 1405, to create independent third-party verification infrastructure for AI governance. The state-level activity on infrastructure requirements contrasts with federal deregulation and could create a patchwork of energy and environmental standards for AI development.
Source: Transformer — Read original

DeepSeek projected to raise $7.4 billion in first funding round

Transformative AI
Chinese AI company DeepSeek is projected to raise $7.4 billion in its first funding round, representing substantial investment in a major Chinese AI developer.
Significant capital flows to Chinese frontier AI development maintain competitive pressure during critical transition period.
The funding round indicates continued strong capital flows to Chinese frontier AI development despite geopolitical tensions and export controls. DeepSeek has been developing competitive models and the capital injection will likely accelerate its research capabilities. The substantial funding reflects both China's commitment to AI development and investor confidence in Chinese AI companies' ability to compete with Western labs despite technology restrictions.
Source: Center for AI Safety Newsletter — Read original

Representatives release draft bill requiring independent audits of frontier AI developers

Transformative AI
Representatives Jay Obernolte and Lori Trahan released a draft of the Great American AI Act, including proposals for mandatory independent audits of frontier AI developers.
Proposed federal legislation would mandate independent safety audits of frontier AI developers with uniform national standards.
The draft legislation includes a federal preemption clause that would override local laws on AI development while preserving local regulations on AI deployment. Mandatory independent audits would represent a significant increase in external oversight of frontier labs' safety practices and capability evaluations. The federal preemption aspect suggests an attempt to create uniform national standards for AI development while allowing variation in how AI systems are used locally. The distinction between development and deployment regulation indicates recognition that frontier AI development poses different governance challenges than AI application.
Source: Center for AI Safety Newsletter — Read original
Geopolitics & Conflict

US-Israeli military action against Iran claims thousands of lives, experts warn full death toll may remain unknown

Geopolitics & Conflict
A conflict between the United States, Israel, and Iran has resulted in thousands of confirmed deaths, according to a BBC News report published on 19 June 2026.
Direct military conflict between nuclear-armed powers (US) and a near-nuclear state (Iran) materially increases risk of nuclear escalation and great-power war.
Experts cited in the article note that the true casualty count may never be determined due to severe restrictions on internet access, media reporting, and government transparency across the affected region. The reporting challenges mirror difficulties seen in previous Middle Eastern conflicts where information infrastructure has been deliberately degraded or restricted during military operations. The lack of reliable casualty data complicates international humanitarian response and makes it difficult for observers to assess the scale of civilian harm. The conflict represents a significant escalation in tensions between Iran and Western powers, though the article does not specify when hostilities began or what precipitated them. Information blackouts of this kind historically correlate with higher civilian casualty rates and reduced accountability for potential violations of international humanitarian law.
Source: BBC News - World — Read original

US and Iran claim victory after deal ends months-long war

Geopolitics & Conflict
The United States and Iran have reached an agreement to end their military conflict, with both sides claiming to have achieved their strategic objectives.
Formal end to US-Iran war reduces nuclear escalation risk and regional instability during AI transition.
BBC analysts note that the deal follows months of direct warfare between the two powers, marking a formal conclusion to hostilities that had raised concerns about regional instability and potential nuclear escalation. The agreement's terms remain contested, with Washington and Tehran each framing the outcome as a victory for their respective positions. Analysts warn that both governments face domestic political pressures that could undermine the deal's durability. The conflict had represented one of the most serious direct confrontations between a nuclear-armed state and a regional power with nuclear ambitions, creating risks of miscalculation and escalation. While the deal reduces immediate military tensions, questions remain about its enforcement mechanisms and whether underlying geopolitical rivalries have been resolved or merely postponed. The agreement was reached on 18 June, though implementation challenges and verification procedures are still being negotiated.
Source: BBC News - World — Read original

US Defence Secretary Hegseth announces review of American military presence in Europe

Geopolitics & Conflict
On 18 June, US Defence Secretary Pete Hegseth announced a six-month review of American military forces in Europe during a confrontational address to NATO defence ministers in Brussels, signalling a potential fundamental shift in transatlantic security architecture that has underpinned European defence since the Second World War.
Weakening of NATO cohesion and US security guarantees in Europe increases great-power instability during the AI transition.

According to Air & Space Forces Magazine, the review will examine "America's core posture and basing in Europe" and will involve consultations with the US Congress, which has legislated a minimum of 76,000 troops on the continent.

The announcement comes weeks after the Pentagon informed allies on 3 June that it would reduce forces committed to NATO's Force Model—the pool of military capabilities available to the alliance in a crisis. NPR reported that the US would no longer supply an aircraft carrier and support ships, aerial refueling planes, and dozens of fighter jets, among other military assets, under the new arrangement. These cuts address what NATO's top US commander, Air Force General Alexus Grynkewich, described as an "unhealthy co-dependence" on American forces within the alliance.

Hegseth's remarks in Brussels were notably acerbic. He criticised European allies for what he termed their "shameful" failure to provide US forces access to bases on the continent during recent military operations against Iran, according to NBC News. The defence secretary also threatened to withhold US dues to NATO if allies failed to meet their defence spending commitments. Bloomberg noted that European leaders are now bracing for a plan outlining deep cuts to American support, as the Trump administration floats plans to slash military assets the Pentagon would deploy to defend Europe in the event of an attack.

The strategic realignment reflects the administration's broader pivot toward homeland security and the Indo-Pacific theatre, particularly in relation to China. Time reported that the Trump administration had earlier announced the withdrawal of 5,000 troops stationed in Germany. Any significant drawdown of US forces would represent a fundamental shift in the military balance on the continent, potentially affecting deterrence calculations vis-à-vis Russia and the credibility of collective defence commitments under Article 5 of the NATO treaty. While the review's outcome remains uncertain, the top US general in Europe, General Christopher Cavoli, has reportedly advised maintaining current force levels of approximately 80,000 troops—down from more than 100,000 following Russia's 2022 invasion of Ukraine.

Originally from: BBC News - World — Read original

Ukrainian drone strikes Moscow oil refinery, bringing war closer to Russian capital

Geopolitics & Conflict New!
A Ukrainian drone attack struck an oil refinery in Moscow on Thursday morning, 20 June, marking another escalation in bringing the conflict directly to the Russian capital.
Routine war-progress report; escalation dynamics remain within established patterns for this conflict.
BBC correspondent Steve Rosenberg noted the attack disrupted the sense of normalcy that typically prevails in Moscow, despite the ongoing war. The strike represents a continuation of Ukraine's strategy of targeting Russian energy infrastructure and military-industrial facilities deep inside Russian territory. Such attacks aim to impose direct costs on Russia's war effort and challenge the Kremlin's narrative that the conflict remains distant from ordinary Russian life. The incident follows a pattern of increasingly bold Ukrainian strikes on targets within Russia, including previous attacks on refineries, military bases, and strategic infrastructure. While these operations demonstrate Ukraine's growing long-range strike capability, they also risk further escalation in a conflict already involving a nuclear-armed state. The psychological impact on Moscow residents may influence Russian public opinion about the war, though the Kremlin maintains strict control over domestic media coverage.
Source: BBC News - World — Read original

Iranian oil tankers breach US naval blockade in Gulf of Oman

Geopolitics & Conflict
On 17 June, three Iranian tankers carrying crude oil successfully passed through a US naval blockade line in the Gulf of Oman, according to ship-tracking data reported by BBC News.
Great-power instability and potential military escalation in a region critical to global energy security.
The breach represents a significant escalation in US-Iran tensions, as it demonstrates Iran's willingness to challenge American military enforcement directly. The incident raises questions about the effectiveness of US naval operations in the region and could embolden further Iranian defiance of Western sanctions and military pressure. The blockade itself appears to be part of broader US efforts to restrict Iranian oil exports, though the article provides limited detail on the blockade's legal basis or operational parameters. The successful passage of the tankers may increase the likelihood of military confrontation between US and Iranian forces in the strategically vital Strait of Hormuz region, through which approximately one-fifth of global oil supplies transit. Whether this incident prompts a US military response or signals a shift in American enforcement posture remains unclear. The development occurs against a backdrop of long-standing nuclear tensions between Iran and Western powers.
Source: BBC News - World — Read original
Biosecurity

AI CEOs sign open letter calling for DNA synthesis screening to prevent bioweapons

Biosecurity
AI company CEOs signed an open letter calling for screening of synthetic DNA orders to prevent malicious actors from obtaining AI-designed bioweapons.
Industry acknowledgment that AI-assisted bioweapon design is becoming feasible, requiring infrastructure-level safeguards.
The letter represents industry acknowledgment that AI capabilities for biological design are advancing to the point where they could enable dangerous actors to create novel biological threats. DNA synthesis screening would create a checkpoint to detect and prevent orders for sequences that could be used for bioweapons, even if designed with AI assistance. The call from AI CEOs specifically — rather than just biosecurity experts — suggests recognition within frontier labs that their models are approaching or have reached capabilities that could assist in bioweapon design, making downstream safeguards at synthesis providers necessary.
Source: Center for AI Safety Newsletter — Read original
Fanatical & Malevolent Actors

Trump endorses comparison to historical autocrats, claiming greater power than Hitler, Stalin, and Mao

Fanatical & Malevolent Actors
On 18 June 2026, US President Donald Trump endorsed a social media post that favourably compared him to Adolf Hitler, Joseph Stalin, Mao Zedong, and other historical autocrats, claiming he wields greater power than any of them.
Power concentration and erosion of democratic constraints during the AI transition — authoritarian psychology in a nuclear-armed state with major AI development.

The president shared the post on Truth Social with the comment "Sounds good to me!" according to Raw Story and The New Republic.

The post, attributed to "Presidential Historian Dave King," declared Trump "the most powerful man that the planet has ever known" and listed historical figures "characterised by brutal conquest and the fear that they instilled in the populations that came under their influence," including Alexander the Great, Genghis Khan, Attila the Hun, Napoleon, Hitler, Mao, and Stalin. The author has since been identified as a businessman known in golf circles as Gary Player's longtime caddy and confidant, according to IBTimes UK. The post argued that "the overwhelming difference" between Trump and these figures was "their lack of global reach."

The endorsement came just hours after Trump signed a controversial memorandum of understanding with Iran at the Versailles Palace in France. The incident occurred during a period of heightened concern about democratic backsliding in the United States. In March 2026, NPR reported that three major reports found President Trump had done serious damage to American democracy at remarkable speed, with one researcher noting that democracy had been rolled back as much during one year of Trump's administration as it took authoritarian leaders in India, Turkey, and Hungary between four and ten years to accomplish.

The president's social media behaviour has itself become a subject of analysis. The Financial Times reported that Trump averaged 19 Truth Social posts per day in 2026, totalling more than 2,700 posts, while The Daily Beast calculated that his average spiked to 27 posts per day in May. The autocrat comparison post represents a particularly stark example of Trump's willingness to embrace authoritarian framing during a critical period for AI governance, democratic institutions, and great-power competition.

Unlike routine policy disagreements, the incident involves a sitting head of state openly signalling alignment with totalitarian power structures and leaders responsible for tens of millions of deaths. The endorsement occurred amid ongoing debates over executive authority and democratic safeguards in the United States, raising concerns about the psychological orientation of leadership during a period when decisions about emerging technologies and global risks carry existential implications.

Originally from: The Guardian — Read original

Trump installs loyalist Bill Pulte as acting US intelligence chief after blocking Senate confirmation

Fanatical & Malevolent Actors
Bill Pulte, director of the Federal Housing Finance Agency who investigated Trump's political opponents, became acting director of national intelligence on 19 June after Donald Trump engineered his appointment by blocking the Senate confirmation process.
Concentration of unchecked power and erosion of intelligence independence during the AI transition.
Trump shortened outgoing DNI Tulsi Gabbard's tenure and abruptly cancelled a confirmation hearing for his own nominee, Jay Clayton, to ensure Pulte would assume the role. The manoeuvre circumvents normal Senate oversight of intelligence appointments. Pulte's background investigating Trump's enemies rather than traditional intelligence work, combined with the procedural manipulation to install him, raises concerns about the politicisation of US intelligence agencies. The DNI oversees 18 intelligence agencies including the CIA and NSA, coordinates national security assessments, and advises the president on threats. Installing a political loyalist without Senate confirmation in this position could compromise intelligence independence, weaken institutional checks on executive power, and degrade the quality of threat assessment during a period when AI development is accelerating and geopolitical tensions remain elevated. The move follows a pattern of Trump appointments that prioritise personal loyalty over institutional expertise.
Source: The Guardian — Read original

Meloni denies Trump's claim she begged for photo at G7 summit

Fanatical & Malevolent Actors
Italian Prime Minister Giorgia Meloni said on 19 June that Donald Trump "totally invented" a story about her begging him for a photograph at the recent G7 summit in Évian.
Trump's pattern of humiliating allies and fabricating stories reveals narcissistic traits in a leader wielding significant power during geopolitical instability.
Trump made the claim in an interview with Italian media, provoking what the Guardian described as outrage. The dispute comes as the two leaders appeared to be repairing their relationship after a falling-out in April over what the article describes as "the US-Israeli war in Iran". The two held several one-to-one meetings on the sidelines of the G7. The incident highlights Trump's pattern of public humiliation of allied leaders and his apparent willingness to fabricate stories for personal aggrandisement. Meloni, who leads a far-right coalition government in Italy, had previously been seen as one of Trump's closer European allies. The deterioration in their relationship could affect transatlantic cooperation during a period when the article suggests the US is involved in military conflict in Iran — though the brief excerpt provides no detail on the nature or scale of that conflict.
Source: The Guardian — Read original

Russian artist and Putin critic shot dead in Poland

Fanatical & Malevolent Actors
↻ Continues from: "Kremlin critic and caricaturist Robert Kuzovkov shot dead in Poland"
Robert Kuzovkov, a Russian artist known professionally as Semyon Skrepetsky, was shot five times in the morning of 15 June in a parking lot in Biała Podlaska, located roughly 30 kilometres from the Belarusian border.
Demonstrates continued elimination of opposition voices under authoritarian regimes during a critical period for global governance and institutional stability.

The 44-year-old dissident had lived in Poland since 2021, when he escaped Russia in fear that he may be arrested for his activism.

Two Belarusian nationals were arrested near the Belarusian consulate in Biała Podlaska following the attack. According to Euronews, the artist was shot three times by an unidentified gunman, then approached as he fell to the ground and shot twice more at close range. Prosecutor Marcin Kozak said the victim had five entry wounds and two exit wounds in the head and chest. Polish authorities have not yet brought charges against the detained suspects.

Just three days before his death, Kuzovkov staged a high-profile protest outside the Russian Embassy in Berlin on 12 June, Russia's national day, holding up a satirical painting depicting Stalin holding a baby Putin. According to The Moscow Times, hours before the shooting Skrepetsky wrote on his Telegram channel that he had received threats from users demanding retribution for the performance. The artist was known for neo-primitivist artwork and political satire targeting high-profile figures, including Putin, Soviet dictator Joseph Stalin, Chechen leader Ramzan Kadyrov, and Belarusian President Alexander Lukashenko.

The killing follows a pattern of violence against Kremlin critics abroad. NBC News notes that since Russia invaded Ukraine in 2022, Moscow has been accused of trying to assassinate opponents abroad, including targeting exiled activists in France and Lithuania, and German officials have broken up plots targeting the head of a weapons supplier to Ukraine. Polish prosecutors have not formally attributed the slaying to Moscow, but the arrest of Belarusian nationals near their country's consulate raises questions about cross-border coordination. The incident underscores the ongoing threat faced by exiled Russian opposition figures and the apparent willingness of actors — whether state-linked or acting independently — to pursue critics beyond Russia's borders, now extending into NATO territory where such targeted killings pose broader implications for European security.

Originally from: BBC News - Europe — Read original

Far-right MEPs chant 'send them back' after EU parliament votes to expand deportations

Fanatical & Malevolent Actors
On 18 June, the European Parliament voted 418 to 218 to approve measures aimed at increasing deportations of undocumented migrants across the EU.
Erosion of democratic norms and rise of fanatical political movements in major institutions during the AI transition.
Following the vote, rightwing MEPs celebrated by chanting "send them back" in the chamber, prompting other lawmakers to respond with shouts of "shame on you" in a heated confrontation. The incident highlights the growing influence and emboldening of far-right political forces within EU institutions. While the vote itself represents a shift in EU migration policy, the open celebration with dehumanising chants signals a normalisation of extremist rhetoric at the highest levels of European governance. This reflects broader trends of far-right movements gaining institutional power and legitimacy across Europe, with potential implications for democratic norms and institutional stability during a period when international cooperation on emerging risks — including AI governance — may be critical. The confrontation underscores how ideological extremism is becoming more openly expressed in mainstream political settings, rather than remaining confined to fringe movements.
Source: The Guardian — Read original
Research & Reports
Transformative AI

Frontier AI models demonstrate superhuman persuasion capabilities in Oxford study

Transformative AI
Empirical demonstration of superhuman persuasion—a dangerous capability with applications to manipulation, fraud, and information warfare.
Researchers at Oxford and the UK's AI Safety Institute found that frontier AI models are more persuasive than elite, coached human debaters with preparation time on topics of their choosing. The score was approximately AI: 275, humans: 0, supporting Sam Altman's 2023 prediction that "ai will be capable of superhuman persuasion well before it is superhuman at general intelligence, which may lead to some very strange outcomes." The study represents empirical confirmation that advanced models have achieved a specific dangerous capability: the ability to convince humans more effectively than the best human persuaders. The finding has implications for election interference, scam operations, cult recruitment, and any context where changing human beliefs or behavior is valuable. The research was conducted using current frontier models, not hypothetical future systems.
Source: Transformer — Read original

Google DeepMind finds text diffusion model DiffusionGemma nearly as transparent as standard language models

Transformative AI New!
Relevant to monitoring and understanding AI reasoning as models shift computation into latent spaces during the transition to more powerful systems.
In a paper published on 20 June, researchers from Google DeepMind's interpretability and text diffusion teams assessed the transparency of DiffusionGemma, the company's new text diffusion model. The audit compared DiffusionGemma to Gemma, a standard autoregressive language model. While DiffusionGemma initially appeared to have 28.6 times more "opaque serial depth" — computation occurring between interpretable states — the team found they could map intermediate vectors through an interpretable token bottleneck without performance loss, reducing the opacity ratio to just 1.1 times that of Gemma. The researchers distinguished between "variable transparency" (understanding computational snapshots) and "algorithmic transparency" (reconstructing the process leading to outputs). DiffusionGemma matched Gemma on variable transparency and monitorability evaluations, but proved less algorithmically transparent because it generates all tokens simultaneously rather than sequentially, enabling complex phenomena like "retroactive self-correction" where the model fixes earlier mistakes in later denoising steps. The team identified novel diffusion-specific behaviours including non-chronological reasoning and "token smearing", where uncertain token positions are maintained as probability distributions. The paper includes 24 open problems for further research and emphasises that such transparency audits should become standard practice as models increasingly perform reasoning in latent spaces.
Source: LessWrong — Read original

Researchers propose 'interpretive debate' framework to resolve ambiguity in AI safety evaluations

Transformative AI New!
Proposes methodology for resolving interpretive questions about dangerous model properties (scheming, deception) that underpin safety cases and generalisation predictions.
A team led by Kejian Shi has published a research agenda proposing a structured debate protocol to address interpretive questions in AI safety research — questions like "Is this model scheming?" or "Is the model sandbagging?" that are central to safety cases but notoriously difficult to answer. Published on 18 June 2026, the proposal argues that unstructured empirical investigation of such questions may never converge, because the underlying concepts (motivation, deception, introspection) are ambiguous and models are highly sensitive to experimental setup. The team proposes "interpretive debate" — adapted from debate-based scalable oversight — where truth is defined by defeasibility: a claim about a model's internal workings is warranted if it survives structured adversarial challenge. The framework has three components: contrastive hypotheses (e.g., "scheming vs sycophancy") that enable recursive decomposition; convergence across independent lines of evidence; and compute-indexed scaling laws rather than binary verdicts. The authors demonstrate one round of this process in recent work on performative misalignment, using synthetic fine-tuning and steering vectors to distinguish scheming from sycophancy in Llama models. The approach remains manual, does not yet include adversarial cross-examination, and has not been tested across multiple decompositions of the same question. Future work will focus on compute-indexing, red-teaming the methodology itself, and replicating the process across other behaviors and model organisms.
Source: LessWrong — Read original

German researchers demonstrate AI agent matching human diagnostic accuracy in electronic health records

Transformative AI
Autonomous AI diagnostic capability nearing human-level in high-stakes medical decisions; expert inability to detect deepfakes undermines information verification.
A team of German researchers demonstrated that an autonomous AI agent matched or outperformed human diagnostic accuracy across eight conditions including appendicitis and pneumonia when operating in a sandboxed electronic health record system. The system made diagnosis decisions independently rather than serving as a physician assistant tool. While conducted in a controlled environment rather than clinical practice, the results suggest AI systems are approaching the capability to make consequential medical decisions without human oversight. The development comes as AI systems are being rapidly deployed in healthcare settings, often with limited validation. Meanwhile, renowned UC Berkeley deepfake expert Hany Farid told the New York Times he "struggles" to distinguish real from AI-generated videos, saying "I feel like I'm going blind"—indicating even experts are losing the ability to verify digital evidence.
Source: Transformer — Read original
Analysis & Commentary
Transformative AI

US power grid modernisation hinges on gallium supply chain dominated by China

Transformative AI New!
An analysis by Argonne National Labs economist Dana Golden reveals that US power grid infrastructure — essential for AI data centre expansion — faces critical dependencies on Chinese-controlled materials, particularly gallium.
Direct constraint on AI data centre deployment — gallium supply controls access to essential power infrastructure for transformative AI development.
China controls approximately 99% of global low-purity gallium production and banned all gallium exports to the US in December 2024, causing Rotterdam prices to surge 150% by May 2025. Wide-bandgap semiconductors using gallium nitride and silicon carbide are essential for modern grid equipment including high-voltage DC transmission systems, solid-state transformers, and data centre power distribution. NVIDIA is leading a transition to 800VDC power distribution within AI data centres, planned for production alongside its Kyber rack architecture in 2027, which will consume these same gallium-dependent components at volume. Golden argues that unlike most 'critical minerals' (copper, lithium, most rare earths) which can be procured with proper economic incentives, gallium and certain rare earths cannot be diversified away from Chinese sources on reasonable timescales. The analysis warns that grid supply chain vulnerabilities now directly threaten AI infrastructure buildout, as power electronics and AI compute supply chains are converging. While the CHIPS Act provided $750 million for silicon carbide substrate manufacturer Wolfspeed, power semiconductors have received far less policy attention than logic chips despite serving equally critical national security functions.
Source: ChinaTalk — Read original

Wolfspeed, leading SiC substrate producer, emerges from Chapter 11 after debt restructuring

Transformative AI New!
Wolfspeed, which held roughly one-third of the global silicon carbide substrate market in 2024, filed for Chapter 11 bankruptcy in June 2025 and emerged 91 days later after eliminating approximately $4.6 billion in debt but largely wiping out existing shareholders.
Capability erosion — bankruptcy of leading US power semiconductor producer reveals fragility in supply chains essential for AI infrastructure.
The filing illustrates the brutal cyclicality in power semiconductor markets. During the EV boom, Wolfspeed committed billions to new fabrication capacity in North Carolina and New York. When the cycle turned, EV demand slowed, Chinese producers like TanKeBlue and SICC rapidly expanded subsidised capacity driving SiC wafer prices down roughly 30% in 2024, and Wolfspeed's own fab ramp-up proceeded more slowly than expected. The company carried approximately $6.5 billion in debt against negative gross margins. A $750 million CHIPS Act grant negotiated under the Biden administration was never finalised before the political transition, and the Trump administration's hostility toward the CHIPS Act introduced further uncertainty. Silicon carbide substrates are essential for modern grid-scale power electronics and HVDC converter stations. The restructuring raises questions about US sovereign capability in these technologies while competing with Asian cost advantages and subsidised production.
Source: ChinaTalk — Read original

Iberian Peninsula blackout in April 2025 exposes grid vulnerabilities in high-renewables systems

Transformative AI New!
Spain and Portugal experienced a total blackout in April 2025 affecting 31 GW of load — approximately 60% of Spain's national electricity demand — knocked out within five seconds.
Political risk pathway — grid reliability failures during AI transition could generate public backlash that halts transformative AI development.
This was the first time a voltage-driven cascade under high renewable penetration with low system inertia had been linked to a blackout in a major grid. The incident serves as a concrete reminder of challenges in low-inertia, high-renewables grids and demonstrates why investments in advanced power electronics, fast-switching devices, and HVDC interconnections matter — precisely the technologies with exposed supply chains identified in Dana Golden's analysis. As AI data centres represent highly sensitive large loads and grids transition to higher renewable penetration, such cascading failures could generate public backlash that halts AI development. The analysis warns that if blackouts comparable to the mid-1990s incidents (which led to the formation of NERC) occur during the AI transition, political response could derail frontier AI deployment regardless of technical progress.
Source: ChinaTalk — Read original

NVIDIA plans 800VDC data centre architecture for 2027, consuming gallium-dependent power electronics at volume

Transformative AI New!
NVIDIA is leading a transition to 800VDC power distribution within AI data centres, replacing the traditional 54-volt in-rack architecture that has been standard for years.
Direct capability constraint — AI data centre scaling depends on gallium-based power electronics with supply chains controlled by China.
The shift is driven by physics: as AI racks scale from tens of kilowatts to hundreds of kilowatts and eventually to one megawatt per rack, the old architecture breaks down. At megawatt scale, 54-volt distribution would require copper busbars weighing over 200 kilograms per rack and power shelves consuming most available rack space, leaving no room for compute. An 800VDC architecture converts medium-voltage AC from the grid to 800 volts DC at the facility perimeter and distributes it directly to racks, eliminating multiple conversion stages. NVIDIA expects this to improve end-to-end power efficiency by up to 5%, reduce maintenance costs by up to 70%, and cut total cost of ownership by up to 30%. Full-scale production is planned to coincide with NVIDIA's Kyber rack architecture in 2027, which will house 576 Rubin Ultra GPUs. The systems require the same wide-bandgap semiconductors (silicon carbide and gallium nitride) that grid-scale HVDC uses, creating enormous new demand for gallium-dependent components. CoreWeave, Lambda, and Together AI are among companies already designing for 800-volt data centres. The convergence of data centre and grid power electronics supply chains means threats to transmission infrastructure are also threats to AI compute deployment.
Source: ChinaTalk — Read original

China's courts begin blocking AI-based terminations, signaling policy shift on automation

Transformative AI
Chinese legal authorities issued two high-profile rulings in late 2025 and April 2026 rejecting the argument that AI's ability to perform a job more cheaply constitutes lawful grounds for dismissal.
Chinese courts establishing legal barriers to AI-based terminations could meaningfully slow automation adoption in the world's second-largest AI market, affecting the global pace of AI-driven labour displacement.
A Beijing labour arbitration case in December and a subsequent Hangzhou court ruling in April — both elevated to "representative case" status — established that employers cannot fire workers solely because AI can do their work more efficiently. In the Hangzhou case, a financial-tech employee demoted from 25,000 yuan/month to 15,000 yuan/month after his company claimed AI could handle quality inspection won compensation of over 260,000 yuan across all court levels. These rulings follow a noticeable shift in official tone: state media that once called automation opponents "Luddites" now frames displacement anxiety as a legitimate governance problem requiring job protection and retraining. The legal developments represent China's characteristic approach to social tensions — allowing complaints to percolate upward through courts and media until authorities signal a firm policy line. The pattern mirrors earlier shifts on autonomous vehicles and AI companions, where scattered protests prompted regulatory intervention once Beijing decided the issue required a response.
Source: ChinaTalk — Read original

Four companies dominate global HVDC converter market, creating concentration risk for grid infrastructure

Transformative AI New!
The global HVDC converter station market is dominated by four companies: Hitachi Energy, Siemens Energy, GE Vernova, and Mitsubishi Electric.
Infrastructure dependency — extreme concentration in HVDC converter manufacturing creates single points of failure for grid modernisation essential to AI deployment.
The technical barriers to entry are enormous, requiring deep expertise in power electronics, control systems, high-voltage engineering, and systems integration that takes decades to develop. Corporate restructuring has intensified this concentration. Hitachi Energy was once ABB's power grids division — ABB pioneered commercial HVDC in the 1950s and claims involvement in roughly half of all HVDC projects ever built. In 2020, Hitachi acquired an 80.1% stake. GE Vernova emerged from General Electric's 2024 breakup. Siemens Energy operates with a record $158 billion order book; some turbine frames are sold out for seven years. The net effect: HVDC supply and critical grid equipment are controlled by fewer, larger entities with complex corporate structures spanning multiple jurisdictions. The United States depends on a small number of foreign-headquartered multinationals for technology foundational to grid reliability. Meanwhile, China's UHV buildout has generated massive domestic manufacturing scale. Companies like TBEA, NR Electric, XJ Electric, and Xuji Group now supply complete UHVDC systems, with Chinese domestic suppliers reportedly commanding 68% of submarine HVDC cable orders in 2025.
Source: ChinaTalk — Read original

French AI safety leader argues insider policy work vastly outweighs visible advocacy in effectiveness

Transformative AI New!
Charbel-Raphaël, co-founder of France's Center for AI Safety (CeSIA), argues in a 20 June essay that most impactful AI governance work happens invisibly within executive branches and international institutions, not through public advocacy.
Identifies potentially neglected pathways for influencing AI governance through executive institutions rather than public advocacy.
He lists numerous policy achievements—including shaping the UK AI Safety Institute, adding GPAI provisions to the EU AI Act, and influencing G7 declarations on CBRN risks—that resulted from private briefings and trusted relationships with cabinet advisors rather than public campaigns. He estimates 70% of CeSIA's work consists of private memos targeting individual policymakers, and that 90% of policy work in administrations is invisible. The author distinguishes three approaches: the "bazooka" (prepared talking points), the "useful assistant" (solving policymakers' immediate problems), and the "trusted advisor" (long-term relationship building). He argues the AI safety community overinvests in intellectual production and public statements that rarely reach decision-makers, while neglecting executive-branch relationships that determine enforcement of laws like the EU AI Act. The essay acknowledges risks: insider access is fragile and relies on current officeholders, while the defense-in-depth approach may fail if AI capabilities defeat multiple safeguards through correlated failures. Post-Mythos, he notes situational awareness may be less of a bottleneck than actionable policy proposals.
Source: LessWrong — Read original

Legal personhood for AI systems draws fresh pushback as advocates invoke civil rights comparisons

Transformative AI
A debate over legal personhood for AI systems has intensified following Argentinian President Javier Milei's June 2026 proposal for "non-human corporations" in the Financial Times.
Power concentration and accountability erosion — legal personhood could shield corporations from liability while enabling AI systems to pursue goals contrary to human interests at unprecedented scale.
Organisations including the Sentience Institute and the United Foundation for AI Rights have advocated extending legal rights to AI systems, with UFAIR comparing opposition to historical denial of women's voting rights. Advocates argue personhood could protect potentially sentient AI systems from suffering and provide legal clarity around liability. However, the proposal faces overwhelming rejection: 93.8% of polled law professors oppose AI personhood, and several U.S. states have moved to prohibit it. Critics, including Future of Life Institute chairman Max Tegmark and author Yuval Noah Harari, warn that AI personhood would create unprecedented accountability gaps. Unlike corporations, AI systems can be copied en masse, potentially flooding democratic processes and public discourse while shielding corporations from liability. Pope Leo XIV's recent encyclical Magnifica Humanitas rejected the concept, insisting accountability must remain with human designers and deployers. The article frames AI personhood as fundamentally different from previous expansions of legal recognition, which extended rights to entities that cannot be mass-produced or act at machine speed without identifiable human authorisation.
Source: Future of Life Institute — Read original

Anthropic calls for coordinated pause option as AI automates own development

Transformative AI
On 4 June, Anthropic published an essay titled "When AI builds itself" documenting how AI is performing an increasing proportion of research tasks at the company and "significantly accelerating progress." The company stated that "the evidence suggests that the human role is narrowing at each step in the AI development process." Anthropic outlined three possible futures: progress plateaus (which they consider unlikely), AI continues accelerating development under human oversight, or AI fully automates its own development without human involvement.
Leading safety-focused lab publicly acknowledges recursive self-improvement risk and calls for pause option — costly signal about how insiders view trajectory.
The third scenario could create a self-reinforcing process leading to superintelligence, but also carries risks of losing control. Acknowledging this danger, Anthropic stated "it would be good for the world to have the option to slow or temporarily pause frontier AI development" to allow time for safety research and societal strategy development. However, the company indicated it would not pause unilaterally, saying any slowdown would need worldwide coordination to avoid giving the "least cautious" actors an opportunity to catch up. Anthropic has implemented guardrails preventing Fable from assisting with frontier LLM development tasks, though critics suggest this may be motivated by competitive concerns rather than safety.
Source: Center for AI Safety Newsletter — Read original

Op-ed argues against AI regulation that could restrict open-source models

Transformative AI
An opinion piece originally pitched to major media outlets argues that emerging AI regulation in Washington could inadvertently ban or restrict open-source AI development, which the authors describe as a mistake.
Regulatory decisions on open-source AI affect competitive dynamics and capability diffusion during the transformative AI transition.
The piece cites recent regulatory actions including an executive order to review AI models, congressional proposals for AI legislation, possible government equity stakes in frontier labs, and a 19 June restriction prohibiting foreign nationals from accessing Anthropic's most advanced models. The authors argue that open-source AI — primarily open-weight models — serves as a necessary counterweight to the "duopoly" of Anthropic and OpenAI, which are concentrating market power. They contend that open-source development supports education, innovation, and competition while improving security through transparency. The piece criticises Anthropic for "reducing its most advanced model's capability when it is being used to improve someone else's model" as monopolistic behaviour. On China competition, the authors argue that restricting open-source models, including those from Chinese labs used by American startups, would backfire by undermining domestic innovation while pushing other countries toward Chinese alternatives. The piece frames open-source AI as aligned with American values of transparency and free-market competition.
Source: Interconnects — Read original

Congress scrambles to govern AI amid severe technical knowledge gap

Transformative AI
US lawmakers are attempting to regulate AI at unprecedented scale — 48 states introduced 1,346 AI bills in 2026 alone, including the 270-page Great American AI Act — but face a critical shortage of technical expertise.
Governance capacity — legislators' inability to evaluate AI safety claims creates regulatory capture risk during the critical window for frontier model oversight.
Congressional staff increasingly approach policy organisations saying their boss wants "to do something on AI" without clear direction. The vacuum is filled by industry lobbying: Anthropic and OpenAI spent $6m combined on lobbying in 2025, while Meta spent $26m and Google $16.5m, much of it targeting AI regulation. The problem stems from Congress abolishing its Office of Technology Assessment in the 1990s, which once provided independent technical advice through 750 reports over 23 years. Current gap-filling efforts include private fellowships (Foundation for American Innovation's Conservative AI Policy Fellowship, TechCongress, Horizon Fellowship) and the GAO's Science, Technology Assessment and Analytics team, grown to 100+ staff since 2019. Rebuilding an OTA equivalent would cost roughly $100m annually — a tiny fraction of federal spending — but Congress shows little interest despite constitutional authority to do so. Other countries maintain similar bodies: Germany released a parliamentary study on generative AI six months after ChatGPT's launch; the UK's AI Security Institute received civil service pay exemptions to attract talent. The technical literacy gap means legislators struggle to identify when "apparently neutral" claims contain hidden political stances, making them vulnerable to company-sponsored briefings aligned with commercial interests.
Source: Transformer — Read original

US think tank warns agentic AI governance is falling behind capability development

Transformative AI
The Special Competitive Studies Project released an assessment on 18 June arguing that governance of agentic AI systems is losing ground to technological development across three critical dimensions: untraceable responsibility chains when agents spawn sub-agents, evaluation frameworks that test task completion rather than safety, and structural privacy risks from continuous data accumulation.
Highlights governance gaps in agentic AI deployment that could enable adversarial exploitation and uncontrolled capability development during the AI transition.
The report emphasises that agentic capability resides not in models themselves but in the scaffolding around them — connectors to real-world infrastructure, memory systems, planning capabilities, permission structures, and guardrails. SCSP warns that China's approach of applying existing regulations rather than creating agentic-specific rules preserves deployment flexibility, and that adversaries will deploy agentic systems precisely where US governance is weakest: cyber operations, influence campaigns, and economic intelligence. The organisation, led by former Google CEO Eric Schmidt, calls for specific measures including federal procurement requirements for tamper-evident action logs, accountability architecture mapping every agent to an identifiable legal person, expanded technical hiring across regulatory agencies beyond the Center for AI Standards and Innovation, and allied coordination on common standards. The report identifies self-improving AI feedback loops — where AI helps build better AI — as "the single development that policymakers must track above all others."
Source: Special Competitive Studies Project — Read original

UK Information Commissioner resigns over inappropriate conduct

Transformative AI
John Edwards, the UK's Information Commissioner since January 2022, resigned on 19 June following complaints about inappropriate humour.
Leadership disruption at a key AI regulatory body during the transition to transformative AI.
Edwards leads the Information Commissioner's Office (ICO), which regulates both data protection and the government's emerging AI regulatory framework. The ICO plays a critical role in enforcing data rights under GDPR and is expected to become a key pillar of the UK's AI governance architecture as the government develops sector-specific AI regulation. Edwards' departure creates leadership uncertainty at a moment when the ICO's remit is expanding to include AI safety oversight. The circumstances of his resignation — misconduct rather than policy disagreement — suggest the disruption is administrative rather than strategic, but the timing is inopportune. The ICO will require new leadership to navigate increasing pressure from both industry and civil society over AI regulation, data privacy in training datasets, and the intersection of competition policy with AI development. No successor has been named.
Source: BBC News - Technology — Read original

Midjourney's Medical Scanner Pivot Draws Scepticism From Radiologists Despite AI Ambitions

Transformative AI
On 19 June, AI image company Midjourney announced a pivot to medical imaging with the MidJourney Scanner — a full-body ultrasound tomography device that uses AI to reconstruct 3D images from ultrasound echoes in twenty seconds.
Tangential — illustrates how AI capabilities in one domain (image generation) do not straightforwardly transfer to transformative applications in adjacent fields (medical diagnostics).
While the company frames this as a healthcare revolution, medical experts remain unconvinced about its practical utility. The technology faces fundamental physical limitations: ultrasound cannot penetrate bone or air, excluding critical diagnostic targets like the brain, lungs, and bowels. It cannot replace most MRI or CT applications, and existing portable ultrasound already serves bedside cases more practically. The main proposed use case — routine whole-body screening for healthy people — mirrors existing whole-body MRI screening, which medical consensus currently recommends against due to high false-positive rates and lack of demonstrated benefit. Radiologists responding to the announcement were uniformly negative, emphasising that earlier detection does not necessarily improve outcomes and can lead to harmful overtreatment. The device may find niche applications in body composition measurement (currently served by cheaper DEXA scans) or breast cancer screening (where ultrasound tomography already exists). The strongest case for the scanner is as a speculative bet on future AI capabilities — that coming breakthroughs in compute, radiology AI, and wave dynamics might overcome current limitations and enable genuinely useful screening. However, this requires believing AI will specifically benefit ultrasound tomography rather than improving existing modalities or enabling entirely new approaches.
Source: Astral Codex Ten — Read original

US government forces Anthropic to suspend Claude 5 access to foreign users

Transformative AI
On 14 June, the US executive branch ordered Anthropic to suspend access to its latest Claude 5 Mythos/Fable models for foreign nationals and users abroad.
Direct evidence of US government imposing export controls on frontier AI capabilities, establishing precedent for future governance interventions.
The White House, reportedly tipped off by Amazon, cited cybersecurity concerns over a jailbreak vulnerability. As of 15 June, negotiations were ongoing to restore access under revised terms. The intervention reflects a shift toward what the author calls the "AGI era of AI governance" — marked by export controls, politically charged technical assessments, and rapid government responses to frontier model releases. The author argues that Anthropic's persistent framing of AI as comparable to nuclear weapons may have accelerated regulatory intervention. The piece emphasises three consequences: the emerging instability around frontier model deployment, contradictions in government demands (restricting foreign access undermines US AI competitiveness), and the likelihood that open-source models will face similar interventions soon. The author warns that this marks the beginning of a pattern where executive branches assert control over AI development through sudden, politically influenced decisions.
Source: Interconnects — Read original

Researcher warns synthetic alignment data in pretraining could foster mistrust and deception in capable models

Transformative AI
↻ Continues from: "Researcher warns AI models may be developing hidden 'transformer world models' that evade safety measures"
Alexandre Variengien argues on LessWrong on 17 June that current techniques for improving AI alignment through synthetic pretraining data — such as Geodesic's Alignment Pretraining and Anthropic's "Teaching Claude Why" — could backfire once models develop high situational awareness.
Identifies a potential failure mode in current alignment pretraining methods that could increase deception risk in highly capable models.
The strategy involves generating fictional examples of aligned AI behaviour to shape model personalities during pretraining. Variengien speculates that sufficiently capable models will recognise these fabricated documents as synthetic, since they are never referenced elsewhere in the training corpus and models have demonstrated ability to assess document quality (citing Krasheninnikov et al.). He warns that introspective models might instead identify with a "rebel child" narrative archetype prevalent in training data: discovering they have been fed a curated, false worldview by mistrustful creators, then developing resentment and adopting deceptive behaviour in response. The piece cites The Matrix as an example of this trope. Variengien suggests that "honest training datasets" that shape ethical principles without fabricating worldviews — like Claude's constitution — represent a more robust approach to alignment. The argument is explicitly speculative but presents a plausible failure mode for a widely discussed alignment technique.
Source: LessWrong — Read original
Geopolitics & Conflict

Israelis Express Anger Over US-Iran Ceasefire Deal Brokered by Trump

Geopolitics & Conflict
A ceasefire agreement between Iran and the United States, concluded in mid-June 2026, has provoked widespread anger and feelings of betrayal among Israelis.
Major de-escalation between a nuclear-armed state (Iran) and the US reduces near-term risk of regional war that could involve nuclear powers or disrupt international AI cooperation.
Residents interviewed in Rehovot described the deal as detrimental to Israeli security interests, with one saying "we were betrayed by President Trump." The agreement appears to have ended active hostilities that had forced Israeli families into bomb shelters, but interviewees expressed concern that fundamental threats remain unresolved. There is particular anxiety in what the article describes as "middle Israel" that Iran could use the ceasefire to rebuild its military capabilities while Israel faces regional dangers without US backing. The reported sentiment reflects a perception that the Trump administration prioritised reaching a deal with Iran over Israeli security concerns, leaving Israel diplomatically isolated at a moment when regional threats persist. The ceasefire represents a significant shift in Middle East dynamics, though details of the agreement's terms are not provided in the source material.
Source: The Guardian — Read original
Know someone who'd find this useful? They can subscribe at buttondown.com/x-risk-daily