Recent releases of OpenAI’s GPT-5.5 variants have lifted scores on Humanity’s Last Exam—a 2,500-question benchmark testing graduate-level expertise across math, sciences, and humanities—to the low-to-mid 40s percent range as of late April 2026, narrowing the gap with Google’s leading Gemini 3.1 Pro at roughly 45 percent. Competitive pressure from Anthropic’s Claude models and Meta’s Muse series continues to accelerate iteration cycles, while ongoing training runs and post-training optimizations remain key levers for further gains before the June 30 deadline. Traders are watching for any official announcements of new model weights or evaluation updates that could shift the leaderboard in the final weeks.
Eksperimental na AI-generated summary na nire-reference ang Polymarket data. Hindi ito trading advice at wala itong papel sa kung paano nire-resolve ang market na ito. · Na-updateOpenAI GPT score on Humanity’s Last Exam by June 30?
$23,154 Vol.
50%+
30%
$23,154 Vol.
50%+
30%
The resolution source will be the official Humanity’s Last Exam leaderboard https://scale.com/leaderboard/humanitys_last_exam.
Binuksan ang Market: Jan 30, 2026, 12:00 AM ET
Resolver
0x65070BE91...The resolution source will be the official Humanity’s Last Exam leaderboard https://scale.com/leaderboard/humanitys_last_exam.
Resolver
0x65070BE91...Recent releases of OpenAI’s GPT-5.5 variants have lifted scores on Humanity’s Last Exam—a 2,500-question benchmark testing graduate-level expertise across math, sciences, and humanities—to the low-to-mid 40s percent range as of late April 2026, narrowing the gap with Google’s leading Gemini 3.1 Pro at roughly 45 percent. Competitive pressure from Anthropic’s Claude models and Meta’s Muse series continues to accelerate iteration cycles, while ongoing training runs and post-training optimizations remain key levers for further gains before the June 30 deadline. Traders are watching for any official announcements of new model weights or evaluation updates that could shift the leaderboard in the final weeks.
Eksperimental na AI-generated summary na nire-reference ang Polymarket data. Hindi ito trading advice at wala itong papel sa kung paano nire-resolve ang market na ito. · Na-update
Mag-ingat sa mga external link.
Mag-ingat sa mga external link.
Mga Madalas na Tanong