Kimi K2.7-Code: open-source coding model with better token efficiency
HN · Hugging Face Models
· Jun 12, 2026
Kimi K2.7-Code ships as an open coding model with stronger token efficiency, drawing major developer attention.
Claude Fable 5 and Claude Mythos 5
Anthropic News
· Jun 9, 2026
Anthropic released Claude Fable 5, making a Mythos-class model publicly available for general use.
Anthropic’s New Model Targets Power Users But Cuts Off AI Rivals
The Information
· Jun 10, 2026
Anthropic released Claude Fable 5, a stronger model for coding, knowledge work, and spatial reasoning with restricted cybersecurity handling.
DiffusionGemma: 4x faster text generation
Google DeepMind
· Jun 10, 2026
Google DeepMind released DiffusionGemma, a diffusion-based text model claiming 4x faster generation than comparable autoregressive approaches.
DiffusionGemma
Simon Willison
· Jun 10, 2026
Google released DiffusionGemma, an Apache-licensed open-weight Gemma model using diffusion-style text generation for much faster decoding.
MaxProof: Scaling Mathematical Proof with Generative-Verifier RL and Population-Level Test-Time Scaling
HF Daily Papers
· Jun 11, 2026
MiniMax's M3 proof model and MaxProof test-time scaling report gold-medal-level performance on IMO and USAMO proof tasks.
Introducing Gemma 4 12B: a unified, encoder-free multimodal model
Google DeepMind
· Jun 9, 2026
Google DeepMind released Gemma 4 12B, a unified multimodal model that removes separate encoders for cross-modal processing.
Fluid, natural voice translation with Gemini 3.5 Live Translate
Google DeepMind
· Jun 9, 2026
Google launched near real-time natural speech translation across AI Studio, Translate, and Meet using Gemini 3.5 Live Translate.
Google releases Gemini 3.5 Live Translate, which it says can deliver "near real-time speech-to-speech translation in over 70 languages" (The Keyword)
Techmeme
· Jun 9, 2026
Google released a Gemini 3.5 speech translation capability for near real-time conversations across more than 70 languages.
Anthropic researchers say Mythos Preview can now turn publicly disclosed software vulnerabilities, or N-days, into working exploits in hours instead of weeks (Sam Sabin/Axios)
Techmeme
· Jun 8, 2026
Anthropic’s Mythos Preview can reportedly convert disclosed vulnerabilities into working exploits within hours, advancing AI-assisted cyber offense capabilities.
Decart’s new world model can simulate hours of photorealistic driving — with some caveats
TechCrunch AI
· Jun 10, 2026
Decart released Oasis 3, a real-time world model API for generating photorealistic driving simulations for autonomous vehicle testing.
Google and Nvidia are helping Apple with Apple Foundation Model Cloud Pro, which Apple says is comparable to Gemini frontier models and runs on Nvidia GPUs (Kif Leswing/CNBC)
Techmeme
· Jun 9, 2026
Apple announced a cloud-hosted foundation model built with Google and Nvidia support, claiming Gemini-level frontier capability for Siri workloads.
Kwai Keye-VL-2.0 Technical Report
HF Daily Papers
· Jun 9, 2026
Kwai released Keye-VL-2.0-30B-A3B, an open-source multimodal MoE model targeting hour-long video understanding with 256K context.
i1: A Simple and Fully Open Recipe for Strong Text-to-Image Models
HF Daily Papers
· Jun 9, 2026
i1 releases a fully open text-to-image model recipe with weights, data, code, and large-scale training ablations.
Microsoft Build 2026 recap, from Windows to Copilot, all AI
TestingCatalog
· Jun 3, 2026
Microsoft introduced seven MAI models spanning coding, reasoning, image, and voice, including MAI-Thinking-1 in private preview on Foundry.
Introducing new capabilities to GPT-Rosalind
OpenAI Blog
· Jun 3, 2026
OpenAI upgraded GPT-Rosalind with stronger biology, chemistry, genomics, and lab workflow capabilities for life sciences research.
Alibaba releases Qwen3.7-Plus, a multimodal proprietary model with a 1M-token context window, costing $2 per 1M tokens, 60% less than text-only Qwen3.7-Max (Carl Franzen/VentureBeat)
Techmeme
· Jun 3, 2026
Alibaba released a proprietary multimodal Qwen model with a 1M-token context window and aggressive low-cost pricing.
Microsoft debuts MAI-Thinking-1, its first advanced reasoning model, trained "from the ground up on clean data, without distillation from third-party models" (Jay Peters/The Verge)
Techmeme
· Jun 2, 2026
Microsoft debuted MAI-Thinking-1, its first advanced reasoning model trained from scratch without third-party distillation.
Microsoft Unveils New Homegrown AI, OpenClaw-Inspired Agents for Businesses
The Information
· Jun 2, 2026
Microsoft introduced in-house AI models and enterprise agent-building tools for automating tasks over company data.
Nvidia launches Nemotron 3 Ultra, a 550B-parameter MoE open model; Artificial Analysis: it's the smartest open US model but trails the Chinese model Kimi K2.6 (Maximilian Schreiner/The Decoder)
Techmeme
· Jun 2, 2026
Nvidia released Nemotron 3 Ultra, a 550B-parameter open MoE model rated as the strongest open U.S. model.
Cosmos 3: Omnimodal World Models for Physical AI
HF Daily Papers
· Jun 1, 2026
Cosmos 3 introduces omnimodal world models for physical AI, unifying language, image, video, audio, and action generation.
Anthropic Releases New Flagship AI Model
The Information
· May 28, 2026
Anthropic released Claude Opus 4.8 with improved coding, financial analysis, and uncertainty-reporting performance.
Introducing Claude Opus 4.8
Anthropic News
· May 28, 2026
Anthropic released Claude Opus 4.8 with improved coding, agentic task performance, and long-running work consistency.
The MiniMax-M2 Series: Mini Activations Unleashing Max Real-World Intelligence
HF Daily Papers
· May 26, 2026
MiniMax introduced M2, a 230B-parameter MoE model series with sparse activation and agent-focused training infrastructure.
Quoting SpaceX S-1
Simon Willison
· May 20, 2026
Anthropic signed a $1.25B/month cloud services agreement with SpaceX for compute access across COLOSSUS and COLOSSUS II through May 2029, while SpaceX trains Grok 5 on the same infrastructure—a massive shift making SpaceX a major AI compute provider.
Gemini 3.5 Flash
HN · Frontpage AI
· May 19, 2026
Google released Gemini 3.5 Flash, a new Gemini model drawing broad developer discussion.
Qwen3.7-Max: The Agent Frontier
HN · Agents
· May 20, 2026
Qwen3.7-Max was released as a new agent-focused frontier model, drawing substantial early developer discussion.
OpenAI claims it solved an 80-year-old math problem — for real this time
TechCrunch AI
· May 20, 2026
OpenAI says a reasoning model disproved a 1946 geometry conjecture, with outside mathematicians validating the result.
Introducing Gemini Omni
Google DeepMind
· May 17, 2026
Google DeepMind announces Gemini Omni, a new multimodal model that reasons across text, images, audio, and video through simple conversation.