Orchard: An Open-Source Agentic Modeling Framework

By Baolin Peng, Wenlin Yao, Qianhui Wu, Hao Cheng, Xiao Yu

· HF Daily Papers · May 14, 2026

Orchard is an open-source framework for scalable agentic modeling with Orchard Env, a lightweight environment service providing reusable primitives for sandbox management, agent harnesses, and training pipelines.

Categories: OSS & Tools, Research

Excerpt

Baolin Peng, Wenlin Yao, Qianhui Wu, Hao Cheng, Xiao Yu — Agentic modeling aims to transform LLMs into autonomous agents capable of solving complex tasks through planning, reasoning, tool use, and multi-turn interaction with environments. Despite major investment, open research remains constrained by infrastructure and training gaps. Many high-performing systems rely on proprietary codebases, models, or services, while most open-source frameworks focus on orchestration and evaluation rather than scalable agent training. We present Orchard, an open-source framework for scalable agentic modeling. At its core is Orchard Env, a lightweight environment service providing reusable primitives for sandbox lifecycle management across task domains, agent harnesses, and pipeline stages. On top of Orchard Env, we build three agentic modeling recipes. Orchard-SWE targets coding agents. We distill 107K trajectories from MiniMax-M2.5 and Qwen3.5-397B, introduce credit-assignment SFT to learn from productive segments of unresolved trajectories, and apply Balanced Adaptive Rollout for RL. Starting from Qwen3-30B-A3B-Thinking, Orchard-SWE achieves 64.3% on SWE-bench Verified after SFT and 67.5% after SFT+RL, setting a new state of the art among open-source models of comparable size. Orchard-GUI trains a 4B vision-language computer-use agent using only 0.4K distilled trajectories and 2.2K open-ended tasks. It achieves 74.1%, 67.0%, and 64.0% success rates on WebVoyager, Online-Mind2Web, and Deep

Read at source: https://arxiv.org/abs/2605.15040