Introducing SimpleQA
OpenAI released SimpleQA, a benchmark for measuring short factual question-answering accuracy across models.
Excerpt
A factuality benchmark called SimpleQA that measures the ability for language models to answer short, fact-seeking questions.
Read at source: https://openai.com/index/introducing-simpleqa