GPT-4o

openai · Ranked across 5 benchmarks · best rank #8

Benchmark scores

BenchmarkCategoryRankScoreCaptured
METR Task Horizon (HCAST) agents #8 10m 2025-07-12
Chatbot Arena chat #17 1429 2026-04-30
Aider Polyglot code #19 45.3% 2025-03-29
SWE-bench Verified agents #27 38.8% 2024-10-28
OpenRouter · Weekly Usage usage #31 #102 2026-05-02