o3-mini

openai · Ranked across 5 benchmarks · best rank #12

Benchmark scores

BenchmarkCategoryRankScoreCaptured
Aider Polyglot code #12 60.4% 2025-01-31
SWE-bench Verified agents #25 42.4% 2025-02-14
OpenRouter · Weekly Usage usage #38 #135 2026-06-09
Chatbot Arena chat #43 1337 2026-06-10
AA Coding Index code #50 17.9% 2026-06-15