Anthropic shipped Claude Opus 4.8 today, posting 88.6% on SWE-bench Verified — the industry's gold-standard coding benchmark — and 74.6% on Terminal-Bench 2.1, placing it firmly at the frontier of agentic coding capability. The release lands on the same day Anthropic filed confidentially for an IPO, making it a banner Monday for the company.
Google's Gemini 3.5 Flash reached GA on June 1, billed as "frontier-level intelligence at 4x the speed" of comparable models, priced at $1.50/$9 per million tokens with a 1M token context window. In a significant housekeeping move, Google simultaneously retired the entire Gemini 2.0 model family, making 3.x the new baseline across all products and APIs.
Meta's newly formed Superintelligence Labs — built around the $14B deal to bring in Scale AI's Alexandr Wang — has released its first model, Muse Spark (internal codename: Avocado). The debut signals Meta's intent to compete on frontier closed-model capability, backed by $115–135B in planned 2026 AI capex.
Anthropic filed confidentially with the SEC for a U.S. IPO at a $965B post-money valuation — more than double its $380B February figure — projecting $10.9B in Q2 revenue and its first profitable quarter. A Pentagon supply-chain dispute adds legal complexity, but the financial story here is unmistakable.
Microsoft will unveil MAI-Image 2.5 (two variants, adds image editing), MAI-Voice 2 (15 new languages, emotional range including angry/confused/embarrassed), and MAI-Transcribe 1.5 at Build 2026 in San Francisco on June 2. While Google ships today, Microsoft ships tomorrow — the model cadence is genuinely relentless right now.
Cognition closed a $1B round led by Lux Capital, General Catalyst, and 8VC, reaching a $26B valuation up from $10.2B just eight months ago. Devin now writes ~90% of Cognition's own code; ARR sits at $492M annualized growing 50% month over month, with enterprise customers including NASA, Goldman Sachs, and Mercedes-Benz.
GitHub Copilot flipped to AI-credit billing on June 1, metering most features by token consumption and API rates. Code completions and Next Edit Suggestions remain unmetered, but all agentic features and chat now tick a counter. This is an immediate, meaningful cost-model shift for every enterprise development team using Copilot at scale.
OpenAI released its Frontier Governance Framework last week, outlining standards for third-party trustworthy model evaluations and safety testing. The document arrived alongside Gartner naming OpenAI a Leader in enterprise coding agents — a pairing that reads as both a policy signal and a sales move, likely timed ahead of competitive pressure from Anthropic's IPO narrative.
At Google I/O on May 19, Sundar Pichai revealed Gemini monthly active users hit 900 million — up from 400M a year ago — and cut Gemini Ultra pricing from $250 to $200/month. DeepMind also struck an $80–90M licensing deal to absorb 20+ Contextual AI researchers including CEO Douwe Kiela, a notable talent consolidation as model competition intensifies.
Elon Musk's xAI held partnership talks with French AI lab Mistral in a bid to form a third major pole in the market outside OpenAI and Anthropic. Mistral is simultaneously moving aggressively on its own infrastructure with $830M in debt financing, a proprietary chip design program, and a dedicated data center — all under a European AI sovereignty banner.