Evaluating AI’s ability to perform scientific research tasks

OpenAI Blog · Dec 16, 2025

OpenAI releases FrontierScience, a new benchmark evaluating AI reasoning capabilities across physics, chemistry, and biology to track progress toward genuine scientific research assistance.

Categories: Research

Excerpt

OpenAI introduces FrontierScience, a benchmark testing AI reasoning in physics, chemistry, and biology to measure progress toward real scientific research.

Read at source: https://openai.com/index/frontierscience