OSS & Tools

Latest OSS & Tools on Megadose. AI news ranked, decayed, deduped.

47 recent items

  1. Kimi K2.7-Code: open-source coding model with better token efficiency
    HN · Hugging Face Models ·
    Kimi K2.7-Code ships as an open coding model with stronger token efficiency, drawing major developer attention.
  2. Xiaomi releases MiMo Code V0.1.0, an open-source AI coding assistant that it says outperforms Claude Code on agentic coding and software engineering benchmarks (Carl Franzen/VentureBeat)
    Techmeme ·
    Xiaomi released MiMo Code V0.1.0, an open-source coding assistant claiming strong results on long-horizon software engineering benchmarks.
  3. DiffusionGemma
    Simon Willison ·
    Google released DiffusionGemma, an Apache-licensed open-weight Gemma model using diffusion-style text generation for much faster decoding.
  4. ORCA: A Platform for Open-Source Dexterity Research
    ArXiv · AI/CL/LG ·
    ORCA releases an open-source dexterity stack for robot hand control, simulation, teleoperation, and retargeting.
  5. Google introduces DiffusionGemma, an experimental 26B-parameter open model that uses text diffusion for faster text generation compared to autoregressive models (The Keyword)
    Techmeme ·
    Google released DiffusionGemma, a 26B open text-diffusion model aimed at faster generation than autoregressive systems.
  6. Open Reproduction of DeepSeek-R1
    HN · GitHub AI ·
    An open-source effort to reproduce DeepSeek-R1 gives researchers a public path to inspect and rebuild its reasoning pipeline.
  7. Pythagoras-Prover: Advancing Efficient Formal Proving via Augmented Lean Formalisation
    HF Daily Papers ·
    Pythagoras-Prover releases open-source Lean theorem-proving models and training data focused on lower compute budgets.
  8. RedAct: Redacting Agent Capability Traces for Procedural Skill Protection
    HF Daily Papers ·
    RedAct introduces CapTraceBench and a trace-redaction framework for protecting procedural skills while preserving agent auditability.
  9. Kwai Keye-VL-2.0 Technical Report
    HF Daily Papers ·
    Kwai released Keye-VL-2.0-30B-A3B, an open-source multimodal MoE model targeting hour-long video understanding with 256K context.
  10. i1: A Simple and Fully Open Recipe for Strong Text-to-Image Models
    HF Daily Papers ·
    i1 releases a fully open text-to-image model recipe with weights, data, code, and large-scale training ablations.
  11. Apache Burr: Build reliable AI agents and applications
    HN · Agents ·
    Apache Burr is an open-source framework for building stateful AI agents and applications with more reliable execution paths.
  12. AutoMegaKernel: A Statically-Checked Agent Harness for Self-Retargeting Megakernel Synthesis
    ArXiv · AI/CL/LG ·
    AutoMegaKernel uses statically checked agent-generated schedules to compile Llama-family models into single persistent CUDA megakernels.
  13. MuJoCo-Drones-Gym: A GPU-Accelerated Multi-Drone Simulator for Control and Reinforcement Learning
    HF Daily Papers ·
    MuJoCo-Drones-Gym releases a Gymnasium-compatible multi-drone simulator with GPU-accelerated reinforcement learning support.
  14. pg_durable: Microsoft open sources in-database durable execution
    HN · GitHub AI ·
    Microsoft open sourced pg_durable, bringing durable execution primitives directly into PostgreSQL for workflow and agent infrastructure.
  15. Google introduces Gemma 4 12B, a unified, encoder-free open multimodal model that can run locally on devices with 16GB of VRAM or unified memory (Carl Franzen/VentureBeat)
    Techmeme ·
    Google released Gemma 4 12B, an open multimodal model designed to process audio and video locally on 16GB devices.
  16. Anthropic's open-source framework for AI-powered vulnerability discovery
    HN · Inference ·
    Anthropic released an open-source framework for AI-assisted vulnerability discovery, giving security researchers reusable infrastructure for automated bug-finding workflows.
  17. Open Code Review – An AI-powered code review CLI tool
    HN · GitHub AI ·
    Open Code Review is an AI-powered CLI for automated code review, drawing early developer attention on Hacker News.
  18. KVarN: Native vLLM backend for KV-cache quantization by Huawei
    HN · LLMs ·
    Huawei released KVarN, a native vLLM backend for KV-cache quantization aimed at reducing inference memory costs.
  19. The next chapter in flood resilience: Open sourcing Google’s hydrology framework
    Google Research Blog ·
    Google open-sourced its hydrology framework for flood resilience, making climate modeling infrastructure available to developers and researchers.
  20. Nvidia launches Nemotron 3 Ultra, a 550B-parameter MoE open model; Artificial Analysis: it's the smartest open US model but trails the Chinese model Kimi K2.6 (Maximilian Schreiner/The Decoder)
    Techmeme ·
    Nvidia released Nemotron 3 Ultra, a 550B-parameter open MoE model rated as the strongest open U.S. model.
  21. Microsoft releases ASSERT, an open-source framework that lets developers generate and run AI behavior tests using natural-language descriptions (Ram Iyer/TechCrunch)
    Techmeme ·
    Microsoft released ASSERT, an open-source framework for generating and running AI behavior regression tests from natural-language specs.
  22. TinyFish Bigset turns text prompts into live datasets from web
    TestingCatalog ·
    TinyFish launched Bigset, an open-source multi-agent system that turns plain-language prompts into self-refreshing web datasets.
  23. Nvidia unveils Cosmos 3, an open physical AI foundation model, to help robots and autonomous cars better understand the real world with limited training data (Ina Fried/Axios)
    Techmeme ·
    Nvidia released Cosmos 3 as an open physical AI foundation model for robotics and autonomous driving training.
  24. Nvidia unveils Isaac GR00T, an open humanoid reference design powered by its Jetson Thor chip, combining a Unitree H2 Plus robot and Sharpa five-fingered hands (Stephen Nellis/Reuters)
    Techmeme ·
    Nvidia unveiled Isaac GR00T as an open humanoid reference design built around Jetson Thor and commercial robot components.
  25. Microsoft offers devs a better way to control AI agent behavior
    TechCrunch AI ·
    Microsoft introduced Agent Control Specification, an open-source standard for portable policy files governing AI agent behavior.
  26. OpenWebRL: Demystifying Online Multi-turn Reinforcement Learning for Visual Web Agents
    HF Daily Papers ·
    OpenWebRL introduces an open framework for training visual web agents with online multi-turn reinforcement learning.
  27. Mellum2 Technical Report
    HF Daily Papers ·
    Mellum 2 is an open-weight 12B MoE model specialized for software engineering, coding agents, and tool use.
  28. Introducing Search Toolkit
    Mistral AI News ·
    Mistral introduced Search Toolkit, a composable framework for building production search pipelines in AI applications.
  29. GPIC: A Giant Permissive Image Corpus for Visual Generation
    ArXiv · AI/CL/LG ·
    GPIC releases a 100 million-image permissive corpus with benchmarks and baselines for visual generative modeling.
  30. minWM: A Full-Stack Open-Source Framework for Real-Time Interactive Video World Models
    HF Daily Papers ·
    minWM releases an open framework for converting video diffusion models into low-latency interactive world models.
  31. A new dataset with more that 100M hi-quality, curated images, with captions and meta data! [P]
    r/MachineLearning ·
    Jasper released MONET, an Apache-licensed 104.9 million image-text dataset with tooling and a paper for text-to-image training.
  32. PrismML just released Binary and Ternary Bonsai Image 4B: 1-bit/ternary text-to-image diffusion transformers that can even run 100% locally in your browser on WebGPU.
    r/LocalLLaMA ·
    PrismML released 1-bit and ternary 4B text-to-image diffusion models that can run locally in browser via WebGPU.
  33. NuExtract3 released: open-weight 4B VLM for Markdown, OCR and structured extraction (self-hostable)
    r/LocalLLaMA ·
    NuExtract3 is a self-hostable 4B open VLM for OCR, Markdown conversion, and structured document extraction.
  34. QUEST: Training Frontier Deep Research Agents with Fully Synthetic Tasks
    HF Daily Papers ·
    QUEST releases open 2B-35B deep research agent models trained on synthetic long-horizon search and report-generation tasks.
  35. BeeLlama v0.2.0 – major DFlash update. Single RTX 3090: Qwen 3.6 27B up to 164 tps (4.40x), Gemma 4 31B up to 177.8 tps (4.93x). Prompt processing speed near baseline.
    r/LocalLLaMA ·
    BeeLlama v0.2.0 adds a major DFlash update, speeding Qwen and Gemma inference on single-GPU consumer hardware.
  36. Cohere releases Command A+, a sparse MoE open model built for agentic tasks, with 218B total and 25B active parameters, its first under the Apache 2.0 license (Carl Franzen/VentureBeat)
    Techmeme ·
    Cohere releases Command A+, its first fully Apache 2.0 licensed open model: a sparse MoE with 218B total/25B active parameters featuring lossless quantization and native citations, targeting agentic workflows.
  37. DeltaBox: Scaling Stateful AI Agents with Millisecond-Level Sandbox Checkpoint/Rollback
    ArXiv · AI/CL/LG ·
    DeltaBox proposes DeltaState, an OS-level abstraction for millisecond-level sandbox checkpoint/rollback by exploiting high similarity between consecutive agent checkpoints.
  38. HarnessAPI: A Skill-First Framework for Unified Streaming APIs and MCP Tools
    ArXiv · AI/CL/LG ·
    HarnessAPI generates HTTP endpoints with SSE streaming, OpenAPI UI, and MCP tool registration from a single Pydantic-typed skill folder.
  39. torchtune: PyTorch native post-training library
    ArXiv · AI/CL/LG ·
    Meta releases torchtune, a PyTorch-native fine-tuning library emphasizing modularity and hackability over specialized recipes, covering LoRA, DPO, and RLVR workflows.
  40. OpenComputer: Verifiable Software Worlds for Computer-Use Agents
    HF Daily Papers ·
    OpenComputer provides a verifier-grounded framework for computer-use agents with 33 desktop apps and 1,000 machine-checkable tasks, including self-evolving verification and auditable partial-credit rewards.
  41. Toto 2.0: Time Series Forecasting Enters the Scaling Era
    ArXiv · AI/CL/LG ·
    Toto 2.0 releases five open-weights time series forecasting models (4M–2.5B parameters) under Apache 2.0, setting new SOTA on BOOM, GIFT-Eval, and TIME benchmarks.
  42. VibeBench/VibeSearchBench
    GitHub · LLM repos ·
    VibeSearchBench introduces a multi-turn search-agent benchmark with verifiable knowledge-graph evaluation and early GitHub traction.
  43. bytedance released an open source model that attempts to do just about anything with only 3b parameters
    r/LocalLLaMA ·
    ByteDance released an open-source 3B parameter model designed as a general-purpose model, marking the company's entry into the compact open-weights space.
  44. Google’s AI Studio now lets anyone build Android apps in minutes
    TechCrunch AI ·
    Google AI Studio now includes web-based tools that generate native Android apps in minutes, targeting AI-powered software development.
  45. EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL
    HF Daily Papers ·
    EnvFactory automates executable environment synthesis and robust RL training for tool-use agents, generating realistic multi-turn interaction data without costly real-world APIs.
  46. MementoGUI: Learning Agentic Multimodal Memory Control for Long-Horizon GUI Agents
    HF Daily Papers ·
    MementoGUI adds a plug-in memory framework with a learned controller for online memory selection, compression, and retrieval to improve long-horizon GUI agent performance.
  47. Introducing Google Antigravity 2.0
    Google DeepMind ·
    Google launches Antigravity 2.0 with an updated desktop app and CLI tool for building agentic AI applications.