o3

openai · Ranked across 5 benchmarks · best rank #2

Benchmark scores

BenchmarkCategoryRankScoreCaptured
Aider Polyglot code #2 84.9% 2025-06-28
METR Time Horizon agents #10 1h60m 2025-04-16
AA Coding Index code #20 38.4% 2026-06-15
Chatbot Arena chat #38 1409 2026-06-10
OpenRouter · Weekly Usage usage #46 #239 2026-06-09