AI Security Institute's evaluation finds GPT-5.5 matches Mythos Preview in cyber capabilities, becoming the second model to complete a multi-step cyberattack simulation.
SenseTime released SenseNova-U1, open multimodal models unifying image understanding and generation using a novel architecture without visual encoders or VAEs, now available on HuggingFace.
Anthropic released BioMysteryBench, a benchmark for evaluating Claude's bioinformatics capabilities against human experts, with Mythos solving ~30% of questions that stumped specialists.
Standard Intelligence raised $75M from Sequoia and Spark at a $500M valuation to develop computer-use AI models, joining the competitive landscape alongside Anthropic and OpenAI.
Microsoft released VibeVoice, a Whisper-style open-source speech-to-text model with built-in speaker diarization, MIT licensed and available via MLX for local Mac inference.
Microsoft released TRELLIS.2, a 4B-parameter open-source image-to-3D model with native 3D VAEs achieving 16× spatial compression to generate high-fidelity PBR-textured assets up to 1536³ resolution.
NVIDIA releases Nemotron 3 Nano Omni, the first model in their series to natively support audio alongside text, images, and video, built on their efficient 30B-A3B backbone.
DeepSeek releases V4, its new generation open-source AI models with enhanced reasoning and coding capabilities, the first major release since January's R1 sensation.
OpenAI released GPT-5.5, its most capable model yet, optimized for agentic coding, computer use, and long-context scientific research with improved autonomy.
Tencent releases Hy3-preview, a 295B-parameter flagship model led by former OpenAI researcher Yao Shunyu, marking the company's first model under his direction.
OpenAI releases ChatGPT Images 2.0, a major image generation upgrade with superior text rendering, multilingual prompts, and advanced visual reasoning integrated into ChatGPT.
Moonshot AI released Kimi K2.6, a coding-specialized model positioned ahead of DeepSeek's anticipated V4 launch, intensifying competition among Chinese AI labs.
Google launches two research agent tiers (Deep Research, Deep Research Max) via Gemini API paid tiers, replacing the December preview with a higher-tier option.
OpenAI is testing a new image generation model (reportedly 'gpt-image-2') producing photorealistic results nearly indistinguishable from real images, positioned as competition for Google.
Qwen3.5-Omni is a multimodal omni-model scaling to hundreds of billions of parameters with 256k context, achieving SOTA on 215 audio/visual tasks and surpassing Gemini-3.1 Pro in audio tasks.
Google DeepMind released Gemini Robotics-ER-1.6, a vision-language model for robotics that shows incremental improvements, particularly with multi-camera setups.
OpenAI released GPT-Rosalind, a domain-specific frontier reasoning model for life sciences covering drug discovery, genomics, and protein reasoning, now in research preview with Moderna and Amgen as launch partners.
Alibaba released Qwen3.6-35B-A3B, an open-weight MoE model with 35B total/3B active parameters that claims to match larger dense models on agentic coding tasks.
Alibaba's Token Hub division releases Happy Oyster, a world model capable of generating 3D environments, interactive videos, films, and games—signaling Alibaba's push into AI-generated virtual worlds.
Physical Intelligence released π0.7, a general-purpose robot brain enabling robots to perform tasks never explicitly taught, marking early progress toward generalist manipulation.
Seedance 2.0 is a native multi-modal audio-video generation model supporting text, image, audio, and video inputs, released in China with SOTA-quality generation.