o3-mini

openai · Ranked across 4 benchmarks · best rank #12

Benchmark scores

BenchmarkCategoryRankScoreCaptured
Aider Polyglot code #12 60.4% 2025-01-31
SWE-bench Verified agents #23 42.4% 2025-02-14
OpenRouter · Weekly Usage usage #36 #155 2026-05-02
Chatbot Arena chat #38 1336 2026-04-30