Model of the Week
OpenAI's New Benchmark Measures the One Thing That Actually Matters: Can AI Do Your Job?
GPT-5.5 scores 84.9% on GDPVal — OpenAI's new benchmark spanning 44 real occupations across 9 industries — and the trajectory of that number tells a more uncomfortable story than any academic test score.
