OpenAI launches GDPval to measure AI performance on real-world economic tasks
OpenAI has launched GDPval, a new benchmark that evaluates AI models on real-world tasks across 44 occupations. Early results show models like GPT-5 and Claude Opus 4.1 competing with industry experts.