Model Releases

Latest Model Releases on Megadose. AI news ranked, decayed, deduped.

29 recent items

  1. Kimi K2.7-Code: open-source coding model with better token efficiency
    HN · Hugging Face Models ·
    Kimi K2.7-Code ships as an open coding model with stronger token efficiency, drawing major developer attention.
  2. Claude Fable 5 and Claude Mythos 5
    Anthropic News ·
    Anthropic released Claude Fable 5, making a Mythos-class model publicly available for general use.
  3. Anthropic’s New Model Targets Power Users But Cuts Off AI Rivals
    The Information ·
    Anthropic released Claude Fable 5, a stronger model for coding, knowledge work, and spatial reasoning with restricted cybersecurity handling.
  4. DiffusionGemma: 4x faster text generation
    Google DeepMind ·
    Google DeepMind released DiffusionGemma, a diffusion-based text model claiming 4x faster generation than comparable autoregressive approaches.
  5. DiffusionGemma
    Simon Willison ·
    Google released DiffusionGemma, an Apache-licensed open-weight Gemma model using diffusion-style text generation for much faster decoding.
  6. MaxProof: Scaling Mathematical Proof with Generative-Verifier RL and Population-Level Test-Time Scaling
    HF Daily Papers ·
    MiniMax's M3 proof model and MaxProof test-time scaling report gold-medal-level performance on IMO and USAMO proof tasks.
  7. Introducing Gemma 4 12B: a unified, encoder-free multimodal model
    Google DeepMind ·
    Google DeepMind released Gemma 4 12B, a unified multimodal model that removes separate encoders for cross-modal processing.
  8. Fluid, natural voice translation with Gemini 3.5 Live Translate
    Google DeepMind ·
    Google launched near real-time natural speech translation across AI Studio, Translate, and Meet using Gemini 3.5 Live Translate.
  9. Google releases Gemini 3.5 Live Translate, which it says can deliver "near real-time speech-to-speech translation in over 70 languages" (The Keyword)
    Techmeme ·
    Google released a Gemini 3.5 speech translation capability for near real-time conversations across more than 70 languages.
  10. Anthropic researchers say Mythos Preview can now turn publicly disclosed software vulnerabilities, or N-days, into working exploits in hours instead of weeks (Sam Sabin/Axios)
    Techmeme ·
    Anthropic’s Mythos Preview can reportedly convert disclosed vulnerabilities into working exploits within hours, advancing AI-assisted cyber offense capabilities.
  11. Decart’s new world model can simulate hours of photorealistic driving — with some caveats
    TechCrunch AI ·
    Decart released Oasis 3, a real-time world model API for generating photorealistic driving simulations for autonomous vehicle testing.
  12. Google and Nvidia are helping Apple with Apple Foundation Model Cloud Pro, which Apple says is comparable to Gemini frontier models and runs on Nvidia GPUs (Kif Leswing/CNBC)
    Techmeme ·
    Apple announced a cloud-hosted foundation model built with Google and Nvidia support, claiming Gemini-level frontier capability for Siri workloads.
  13. Kwai Keye-VL-2.0 Technical Report
    HF Daily Papers ·
    Kwai released Keye-VL-2.0-30B-A3B, an open-source multimodal MoE model targeting hour-long video understanding with 256K context.
  14. i1: A Simple and Fully Open Recipe for Strong Text-to-Image Models
    HF Daily Papers ·
    i1 releases a fully open text-to-image model recipe with weights, data, code, and large-scale training ablations.
  15. Microsoft Build 2026 recap, from Windows to Copilot, all AI
    TestingCatalog ·
    Microsoft introduced seven MAI models spanning coding, reasoning, image, and voice, including MAI-Thinking-1 in private preview on Foundry.
  16. Introducing new capabilities to GPT-Rosalind
    OpenAI Blog ·
    OpenAI upgraded GPT-Rosalind with stronger biology, chemistry, genomics, and lab workflow capabilities for life sciences research.
  17. Alibaba releases Qwen3.7-Plus, a multimodal proprietary model with a 1M-token context window, costing $2 per 1M tokens, 60% less than text-only Qwen3.7-Max (Carl Franzen/VentureBeat)
    Techmeme ·
    Alibaba released a proprietary multimodal Qwen model with a 1M-token context window and aggressive low-cost pricing.
  18. Microsoft debuts MAI-Thinking-1, its first advanced reasoning model, trained "from the ground up on clean data, without distillation from third-party models" (Jay Peters/The Verge)
    Techmeme ·
    Microsoft debuted MAI-Thinking-1, its first advanced reasoning model trained from scratch without third-party distillation.
  19. Microsoft Unveils New Homegrown AI, OpenClaw-Inspired Agents for Businesses
    The Information ·
    Microsoft introduced in-house AI models and enterprise agent-building tools for automating tasks over company data.
  20. Nvidia launches Nemotron 3 Ultra, a 550B-parameter MoE open model; Artificial Analysis: it's the smartest open US model but trails the Chinese model Kimi K2.6 (Maximilian Schreiner/The Decoder)
    Techmeme ·
    Nvidia released Nemotron 3 Ultra, a 550B-parameter open MoE model rated as the strongest open U.S. model.
  21. Cosmos 3: Omnimodal World Models for Physical AI
    HF Daily Papers ·
    Cosmos 3 introduces omnimodal world models for physical AI, unifying language, image, video, audio, and action generation.
  22. Anthropic Releases New Flagship AI Model
    The Information ·
    Anthropic released Claude Opus 4.8 with improved coding, financial analysis, and uncertainty-reporting performance.
  23. Introducing Claude Opus 4.8
    Anthropic News ·
    Anthropic released Claude Opus 4.8 with improved coding, agentic task performance, and long-running work consistency.
  24. The MiniMax-M2 Series: Mini Activations Unleashing Max Real-World Intelligence
    HF Daily Papers ·
    MiniMax introduced M2, a 230B-parameter MoE model series with sparse activation and agent-focused training infrastructure.
  25. Quoting SpaceX S-1
    Simon Willison ·
    Anthropic signed a $1.25B/month cloud services agreement with SpaceX for compute access across COLOSSUS and COLOSSUS II through May 2029, while SpaceX trains Grok 5 on the same infrastructure—a massive shift making SpaceX a major AI compute provider.
  26. Gemini 3.5 Flash
    HN · Frontpage AI ·
    Google released Gemini 3.5 Flash, a new Gemini model drawing broad developer discussion.
  27. Qwen3.7-Max: The Agent Frontier
    HN · Agents ·
    Qwen3.7-Max was released as a new agent-focused frontier model, drawing substantial early developer discussion.
  28. OpenAI claims it solved an 80-year-old math problem — for real this time
    TechCrunch AI ·
    OpenAI says a reasoning model disproved a 1946 geometry conjecture, with outside mathematicians validating the result.
  29. Introducing Gemini Omni
    Google DeepMind ·
    Google DeepMind announces Gemini Omni, a new multimodal model that reasons across text, images, audio, and video through simple conversation.