Introducing SimpleQA

OpenAI Blog ·

OpenAI released SimpleQA, a benchmark for measuring short factual question-answering accuracy across models.

Categories: Research

Excerpt

A factuality benchmark called SimpleQA that measures the ability for language models to answer short, fact-seeking questions.