Model Releases

Latest Model Releases on Megadose. AI news ranked, decayed, deduped.

31 recent items

  1. xAI launches Grok 4.3, featuring "always-on reasoning", 1M token context window, and low API pricing, and releases a voice cloning suite called Custom Voices (Carl Franzen/VentureBeat)
    Techmeme ·
    xAI released Grok 4.3 with always-on reasoning, 1M token context, and aggressive API pricing alongside Custom Voices, a new voice cloning suite.
  2. Cybersecurity analysis: GPT-5.5 reaches a similar level of performance as Mythos Preview and is the second model to solve a multi-step cyberattack simulation (AI Security Institute)
    Techmeme ·
    AI Security Institute's evaluation finds GPT-5.5 matches Mythos Preview in cyber capabilities, becoming the second model to complete a multi-step cyberattack simulation.
  3. SenseTime releases SenseNova U1 models on HuggingFace
    TestingCatalog ·
    SenseTime released SenseNova-U1, open multimodal models unifying image understanding and generation using a novel architecture without visual encoders or VAEs, now available on HuggingFace.
  4. Anthropic unveils BioMysteryBench to test Claude's bioinformatics skills against human experts, and says Mythos solved ~30% of 23 questions that stumped experts (Anthropic)
    Techmeme ·
    Anthropic released BioMysteryBench, a benchmark for evaluating Claude's bioinformatics capabilities against human experts, with Mythos solving ~30% of questions that stumped specialists.
  5. Standard Intelligence, which is developing computer use AI models, raised $75M led by Sequoia and Spark at a $500M post-money valuation (Rocket Drew/The Information)
    Techmeme ·
    Standard Intelligence raised $75M from Sequoia and Spark at a $500M valuation to develop computer-use AI models, joining the competitive landscape alongside Anthropic and OpenAI.
  6. Mistral AI unveils Medium 3.5 model and Work Mode for Le Chat
    TestingCatalog ·
    Mistral released Medium 3.5 128B with 256K context via CLI and Le Chat, featuring coding and vision tasks in sandboxed sessions with 4-GPU backing.
  7. Mistral Médium 3.5 is here
    r/LocalLLaMA ·
    Mistral released Medium 3.5 (128B) on HuggingFace, a mid-tier model slot in their lineup with availability for local deployment.
  8. US startup Poolside debuts its first open-weight model, Laguna XS.2, a 33B-A3B-parameter MoE model, and Laguna M.1, a proprietary 225B-A23B-parameter MoE model (Carl Franzen/VentureBeat)
    Techmeme ·
    Poolside released its first open-weight model, Laguna XS.2 (33B-A3B MoE), alongside proprietary Laguna M.1 (225B-A23B MoE), targeting local agentic coding applications.
  9. microsoft/VibeVoice
    Simon Willison ·
    Microsoft released VibeVoice, a Whisper-style open-source speech-to-text model with built-in speaker diarization, MIT licensed and available via MLX for local Mac inference.
  10. Microsoft Presents "TRELLIS.2": An Open-Source, 4b-Parameter, Image-To-3D Model Producing Up To 1536³ PBR Textured Assets, Built On Native 3D VAES With 16× Spatial Compression, Delivering Efficient, Scalable, High-Fidelity Asset Generation.
    r/LocalLLaMA ·
    Microsoft released TRELLIS.2, a 4B-parameter open-source image-to-3D model with native 3D VAEs achieving 16× spatial compression to generate high-fidelity PBR-textured assets up to 1536³ resolution.
  11. Nemotron 3 Nano Omni: Efficient and Open Multimodal Intelligence
    HF Daily Papers ·
    NVIDIA releases Nemotron 3 Nano Omni, the first model in their series to natively support audio alongside text, images, and video, built on their efficient 30B-A3B backbone.
  12. DeepSeek Launches New-Generation V4 Models
    The Information ·
    DeepSeek releases V4, its new generation open-source AI models with enhanced reasoning and coding capabilities, the first major release since January's R1 sensation.
  13. Introducing GPT-5.5
    OpenAI Blog ·
    OpenAI released GPT-5.5, its most capable model yet, optimized for agentic coding, computer use, and long-context scientific research with improved autonomy.
  14. xAI launches Grok Voice Think Fast 1.0 for voice agents
    TestingCatalog ·
    xAI launches Grok Voice Think Fast 1.0, a real-time voice model for business workflow automation via API.
  15. Tencent releases Hy3-preview, its first AI model developed under former OpenAI researcher Yao Shunyu; the model features 295B parameters, down from HY2's 400B (Vincent Chow/South China Morning Post)
    Techmeme ·
    Tencent releases Hy3-preview, a 295B-parameter flagship model led by former OpenAI researcher Yao Shunyu, marking the company's first model under his direction.
  16. Introducing ChatGPT Images 2.0
    OpenAI Blog ·
    OpenAI releases ChatGPT Images 2.0, a major image generation upgrade with superior text rendering, multilingual prompts, and advanced visual reasoning integrated into ChatGPT.
  17. Alibaba launches Qwen3.6-27B, an open-weight dense model with 27B parameters, saying it surpasses Qwen3.5-397B-A17B on major coding benchmarks (Qwen)
    Techmeme ·
    Alibaba releases Qwen3.6-27B, an open-weight dense model that claims to outperform the larger MoE Qwen3.5-397B-A17B on major coding benchmarks.
  18. Introducing OpenAI Privacy Filter
    OpenAI Blog ·
    OpenAI released an open-weight PII detection and redaction model with state-of-the-art accuracy, available for download and integration.
  19. China’s Moonshot AI Launches New Coding Model Ahead of DeepSeek’s V4 Release
    The Information ·
    Moonshot AI released Kimi K2.6, a coding-specialized model positioned ahead of DeepSeek's anticipated V4 launch, intensifying competition among Chinese AI labs.
  20. Google now offers two research agents: Deep Research, replacing its December preview release, and Deep Research Max, both available via Gemini API paid tiers (The Keyword)
    Techmeme ·
    Google launches two research agent tiers (Deep Research, Deep Research Max) via Gemini API paid tiers, replacing the December preview with a higher-tier option.
  21. OpenAI Takes Aim at Google with New Image Model
    The Information ·
    OpenAI is testing a new image generation model (reportedly 'gpt-image-2') producing photorealistic results nearly indistinguishable from real images, positioned as competition for Google.
  22. Qwen3.5-Omni Technical Report
    HF Daily Papers ·
    Qwen3.5-Omni is a multimodal omni-model scaling to hundreds of billions of parameters with 256k context, achieving SOTA on 215 audio/visual tasks and surpassing Gemini-3.1 Pro in audio tasks.
  23. Google’s New Model Makes Robotic Brains Slightly Smarter
    The Information ·
    Google DeepMind released Gemini Robotics-ER-1.6, a vision-language model for robotics that shows incremental improvements, particularly with multi-camera setups.
  24. Introducing GPT-Rosalind for life sciences research
    OpenAI Blog ·
    OpenAI released GPT-Rosalind, a domain-specific frontier reasoning model for life sciences covering drug discovery, genomics, and protein reasoning, now in research preview with Moderna and Amgen as launch partners.
  25. Alibaba unveils Qwen3.6-35B-A3B, an open-weight MoE model with 35B total and 3B active parameters, saying it rivals larger dense models in agentic coding tasks (Qwen)
    Techmeme ·
    Alibaba released Qwen3.6-35B-A3B, an open-weight MoE model with 35B total/3B active parameters that claims to match larger dense models on agentic coding tasks.
  26. Anthropic releases Claude Opus 4.7, saying it is a "notable improvement" on Opus 4.6 in advanced software engineering and comes with a new "xhigh" effort level (Anthropic)
    Techmeme ·
    Anthropic releases Claude Opus 4.7 with notable improvements in advanced software engineering and a new 'xhigh' effort level for extended reasoning.
  27. Alibaba's new Token Hub unit releases Happy Oyster, a new AI world model that can create 3D environments, interactive videos, films, video content, and games (Luz Ding/Bloomberg)
    Techmeme ·
    Alibaba's Token Hub division releases Happy Oyster, a world model capable of generating 3D environments, interactive videos, films, and games—signaling Alibaba's push into AI-generated virtual worlds.
  28. Physical Intelligence, a hot robotics startup, says its new robot brain can figure out tasks it was never taught
    TechCrunch AI ·
    Physical Intelligence released π0.7, a general-purpose robot brain enabling robots to perform tasks never explicitly taught, marking early progress toward generalist manipulation.
  29. Seedance 2.0: Advancing Video Generation for World Complexity
    HF Daily Papers ·
    Seedance 2.0 is a native multi-modal audio-video generation model supporting text, image, audio, and video inputs, released in China with SOTA-quality generation.
  30. [AINews] Meta Superintelligence Labs announces Muse Spark, first frontier model on their completely new stack
    Latent Space ·
    Meta Superintelligence Labs announced Muse Spark, their first frontier model built on a completely new tech stack.
  31. Gemma 4: Byte for byte, the most capable open models
    Google DeepMind ·
    Google DeepMind releases Gemma 4, its most capable open model family for advanced reasoning and agentic workflows.