Kimi K2.7-Code: open-source coding model with better token efficiency
HN · Hugging Face Models
· Jun 12, 2026
Kimi K2.7-Code ships as an open coding model with stronger token efficiency, drawing major developer attention.
Xiaomi releases MiMo Code V0.1.0, an open-source AI coding assistant that it says outperforms Claude Code on agentic coding and software engineering benchmarks (Carl Franzen/VentureBeat)
Techmeme
· Jun 12, 2026
Xiaomi released MiMo Code V0.1.0, an open-source coding assistant claiming strong results on long-horizon software engineering benchmarks.
DiffusionGemma
Simon Willison
· Jun 10, 2026
Google released DiffusionGemma, an Apache-licensed open-weight Gemma model using diffusion-style text generation for much faster decoding.
ORCA: A Platform for Open-Source Dexterity Research
ArXiv · AI/CL/LG
· Jun 12, 2026
ORCA releases an open-source dexterity stack for robot hand control, simulation, teleoperation, and retargeting.
Google introduces DiffusionGemma, an experimental 26B-parameter open model that uses text diffusion for faster text generation compared to autoregressive models (The Keyword)
Techmeme
· Jun 10, 2026
Google released DiffusionGemma, a 26B open text-diffusion model aimed at faster generation than autoregressive systems.
Open Reproduction of DeepSeek-R1
HN · GitHub AI
· Jun 11, 2026
An open-source effort to reproduce DeepSeek-R1 gives researchers a public path to inspect and rebuild its reasoning pipeline.
Pythagoras-Prover: Advancing Efficient Formal Proving via Augmented Lean Formalisation
HF Daily Papers
· Jun 10, 2026
Pythagoras-Prover releases open-source Lean theorem-proving models and training data focused on lower compute budgets.
RedAct: Redacting Agent Capability Traces for Procedural Skill Protection
HF Daily Papers
· Jun 10, 2026
RedAct introduces CapTraceBench and a trace-redaction framework for protecting procedural skills while preserving agent auditability.
Kwai Keye-VL-2.0 Technical Report
HF Daily Papers
· Jun 9, 2026
Kwai released Keye-VL-2.0-30B-A3B, an open-source multimodal MoE model targeting hour-long video understanding with 256K context.
i1: A Simple and Fully Open Recipe for Strong Text-to-Image Models
HF Daily Papers
· Jun 9, 2026
i1 releases a fully open text-to-image model recipe with weights, data, code, and large-scale training ablations.
Apache Burr: Build reliable AI agents and applications
HN · Agents
· Jun 10, 2026
Apache Burr is an open-source framework for building stateful AI agents and applications with more reliable execution paths.
AutoMegaKernel: A Statically-Checked Agent Harness for Self-Retargeting Megakernel Synthesis
ArXiv · AI/CL/LG
· Jun 8, 2026
AutoMegaKernel uses statically checked agent-generated schedules to compile Llama-family models into single persistent CUDA megakernels.
MuJoCo-Drones-Gym: A GPU-Accelerated Multi-Drone Simulator for Control and Reinforcement Learning
HF Daily Papers
· Jun 6, 2026
MuJoCo-Drones-Gym releases a Gymnasium-compatible multi-drone simulator with GPU-accelerated reinforcement learning support.
pg_durable: Microsoft open sources in-database durable execution
HN · GitHub AI
· Jun 5, 2026
Microsoft open sourced pg_durable, bringing durable execution primitives directly into PostgreSQL for workflow and agent infrastructure.
Google introduces Gemma 4 12B, a unified, encoder-free open multimodal model that can run locally on devices with 16GB of VRAM or unified memory (Carl Franzen/VentureBeat)
Techmeme
· Jun 3, 2026
Google released Gemma 4 12B, an open multimodal model designed to process audio and video locally on 16GB devices.
Anthropic's open-source framework for AI-powered vulnerability discovery
HN · Inference
· Jun 4, 2026
Anthropic released an open-source framework for AI-assisted vulnerability discovery, giving security researchers reusable infrastructure for automated bug-finding workflows.
Open Code Review – An AI-powered code review CLI tool
HN · GitHub AI
· Jun 5, 2026
Open Code Review is an AI-powered CLI for automated code review, drawing early developer attention on Hacker News.
KVarN: Native vLLM backend for KV-cache quantization by Huawei
HN · LLMs
· Jun 4, 2026
Huawei released KVarN, a native vLLM backend for KV-cache quantization aimed at reducing inference memory costs.
The next chapter in flood resilience: Open sourcing Google’s hydrology framework
Google Research Blog
· Jun 3, 2026
Google open-sourced its hydrology framework for flood resilience, making climate modeling infrastructure available to developers and researchers.
Nvidia launches Nemotron 3 Ultra, a 550B-parameter MoE open model; Artificial Analysis: it's the smartest open US model but trails the Chinese model Kimi K2.6 (Maximilian Schreiner/The Decoder)
Techmeme
· Jun 2, 2026
Nvidia released Nemotron 3 Ultra, a 550B-parameter open MoE model rated as the strongest open U.S. model.
Microsoft releases ASSERT, an open-source framework that lets developers generate and run AI behavior tests using natural-language descriptions (Ram Iyer/TechCrunch)
Techmeme
· Jun 2, 2026
Microsoft released ASSERT, an open-source framework for generating and running AI behavior regression tests from natural-language specs.
TinyFish Bigset turns text prompts into live datasets from web
TestingCatalog
· Jun 2, 2026
TinyFish launched Bigset, an open-source multi-agent system that turns plain-language prompts into self-refreshing web datasets.
Nvidia unveils Cosmos 3, an open physical AI foundation model, to help robots and autonomous cars better understand the real world with limited training data (Ina Fried/Axios)
Techmeme
· Jun 1, 2026
Nvidia released Cosmos 3 as an open physical AI foundation model for robotics and autonomous driving training.
Nvidia unveils Isaac GR00T, an open humanoid reference design powered by its Jetson Thor chip, combining a Unitree H2 Plus robot and Sharpa five-fingered hands (Stephen Nellis/Reuters)
Techmeme
· Jun 1, 2026
Nvidia unveiled Isaac GR00T as an open humanoid reference design built around Jetson Thor and commercial robot components.
Microsoft offers devs a better way to control AI agent behavior
TechCrunch AI
· Jun 2, 2026
Microsoft introduced Agent Control Specification, an open-source standard for portable policy files governing AI agent behavior.
OpenWebRL: Demystifying Online Multi-turn Reinforcement Learning for Visual Web Agents
HF Daily Papers
· Jun 1, 2026
OpenWebRL introduces an open framework for training visual web agents with online multi-turn reinforcement learning.
Mellum2 Technical Report
HF Daily Papers
· May 29, 2026
Mellum 2 is an open-weight 12B MoE model specialized for software engineering, coding agents, and tool use.
Introducing Search Toolkit
Mistral AI News
· May 28, 2026
Mistral introduced Search Toolkit, a composable framework for building production search pipelines in AI applications.
GPIC: A Giant Permissive Image Corpus for Visual Generation
ArXiv · AI/CL/LG
· May 28, 2026
GPIC releases a 100 million-image permissive corpus with benchmarks and baselines for visual generative modeling.
minWM: A Full-Stack Open-Source Framework for Real-Time Interactive Video World Models
HF Daily Papers
· May 28, 2026
minWM releases an open framework for converting video diffusion models into low-latency interactive world models.
A new dataset with more that 100M hi-quality, curated images, with captions and meta data! [P]
r/MachineLearning
· May 28, 2026
Jasper released MONET, an Apache-licensed 104.9 million image-text dataset with tooling and a paper for text-to-image training.
PrismML just released Binary and Ternary Bonsai Image 4B: 1-bit/ternary text-to-image diffusion transformers that can even run 100% locally in your browser on WebGPU.
r/LocalLLaMA
· May 26, 2026
PrismML released 1-bit and ternary 4B text-to-image diffusion models that can run locally in browser via WebGPU.
NuExtract3 released: open-weight 4B VLM for Markdown, OCR and structured extraction (self-hostable)
r/LocalLLaMA
· May 25, 2026
NuExtract3 is a self-hostable 4B open VLM for OCR, Markdown conversion, and structured document extraction.
QUEST: Training Frontier Deep Research Agents with Fully Synthetic Tasks
HF Daily Papers
· May 22, 2026
QUEST releases open 2B-35B deep research agent models trained on synthetic long-horizon search and report-generation tasks.
BeeLlama v0.2.0 – major DFlash update. Single RTX 3090: Qwen 3.6 27B up to 164 tps (4.40x), Gemma 4 31B up to 177.8 tps (4.93x). Prompt processing speed near baseline.
r/LocalLLaMA
· May 22, 2026
BeeLlama v0.2.0 adds a major DFlash update, speeding Qwen and Gemma inference on single-GPU consumer hardware.
Cohere releases Command A+, a sparse MoE open model built for agentic tasks, with 218B total and 25B active parameters, its first under the Apache 2.0 license (Carl Franzen/VentureBeat)
Techmeme
· May 21, 2026
Cohere releases Command A+, its first fully Apache 2.0 licensed open model: a sparse MoE with 218B total/25B active parameters featuring lossless quantization and native citations, targeting agentic workflows.
DeltaBox: Scaling Stateful AI Agents with Millisecond-Level Sandbox Checkpoint/Rollback
ArXiv · AI/CL/LG
· May 21, 2026
DeltaBox proposes DeltaState, an OS-level abstraction for millisecond-level sandbox checkpoint/rollback by exploiting high similarity between consecutive agent checkpoints.
HarnessAPI: A Skill-First Framework for Unified Streaming APIs and MCP Tools
ArXiv · AI/CL/LG
· May 21, 2026
HarnessAPI generates HTTP endpoints with SSE streaming, OpenAPI UI, and MCP tool registration from a single Pydantic-typed skill folder.
torchtune: PyTorch native post-training library
ArXiv · AI/CL/LG
· May 20, 2026
Meta releases torchtune, a PyTorch-native fine-tuning library emphasizing modularity and hackability over specialized recipes, covering LoRA, DPO, and RLVR workflows.
OpenComputer: Verifiable Software Worlds for Computer-Use Agents
HF Daily Papers
· May 19, 2026
OpenComputer provides a verifier-grounded framework for computer-use agents with 33 desktop apps and 1,000 machine-checkable tasks, including self-evolving verification and auditable partial-credit rewards.
Toto 2.0: Time Series Forecasting Enters the Scaling Era
ArXiv · AI/CL/LG
· May 19, 2026
Toto 2.0 releases five open-weights time series forecasting models (4M–2.5B parameters) under Apache 2.0, setting new SOTA on BOOM, GIFT-Eval, and TIME benchmarks.
VibeBench/VibeSearchBench
GitHub · LLM repos
· May 20, 2026
VibeSearchBench introduces a multi-turn search-agent benchmark with verifiable knowledge-graph evaluation and early GitHub traction.
bytedance released an open source model that attempts to do just about anything with only 3b parameters
r/LocalLLaMA
· May 19, 2026
ByteDance released an open-source 3B parameter model designed as a general-purpose model, marking the company's entry into the compact open-weights space.
Google’s AI Studio now lets anyone build Android apps in minutes
TechCrunch AI
· May 19, 2026
Google AI Studio now includes web-based tools that generate native Android apps in minutes, targeting AI-powered software development.
EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL
HF Daily Papers
· May 18, 2026
EnvFactory automates executable environment synthesis and robust RL training for tool-use agents, generating realistic multi-turn interaction data without costly real-world APIs.
MementoGUI: Learning Agentic Multimodal Memory Control for Long-Horizon GUI Agents
HF Daily Papers
· May 18, 2026
MementoGUI adds a plug-in memory framework with a learned controller for online memory selection, compression, and retrieval to improve long-horizon GUI agent performance.
Introducing Google Antigravity 2.0
Google DeepMind
· May 17, 2026
Google launches Antigravity 2.0 with an updated desktop app and CLI tool for building agentic AI applications.