OpenAI’s April 2026 release of GPT-5.5 variants, scoring up to 44.3 percent on Humanity’s Last Exam, has narrowed the gap with Google’s leading Gemini 3.1 Pro Preview at 44.7 percent and lifted trader expectations for further gains by June 30. These models demonstrated stronger multi-step reasoning on the 2,500-question benchmark, which tests frontier expertise across sciences and humanities. Competitive pressure from Meta’s Muse Spark and Anthropic’s latest Opus releases continues to accelerate OpenAI’s internal iteration cycle, while historical patterns show frontier labs often deliver meaningful capability jumps every six to eight weeks. Key near-term catalysts include potential June model updates, expanded test-time compute techniques, and any official announcements on new training runs or post-training refinements that could push OpenAI models into the mid-40s or higher before the deadline.
基于Polymarket数据的AI实验性摘要。这不是交易建议,也不影响该市场的结算方式。 · 更新于$23,212 交易量
50%+
33%
$23,212 交易量
50%+
33%
The resolution source will be the official Humanity’s Last Exam leaderboard https://scale.com/leaderboard/humanitys_last_exam.
市场开放时间: Jan 30, 2026, 12:00 AM ET
Resolver
0x65070BE91...The resolution source will be the official Humanity’s Last Exam leaderboard https://scale.com/leaderboard/humanitys_last_exam.
Resolver
0x65070BE91...OpenAI’s April 2026 release of GPT-5.5 variants, scoring up to 44.3 percent on Humanity’s Last Exam, has narrowed the gap with Google’s leading Gemini 3.1 Pro Preview at 44.7 percent and lifted trader expectations for further gains by June 30. These models demonstrated stronger multi-step reasoning on the 2,500-question benchmark, which tests frontier expertise across sciences and humanities. Competitive pressure from Meta’s Muse Spark and Anthropic’s latest Opus releases continues to accelerate OpenAI’s internal iteration cycle, while historical patterns show frontier labs often deliver meaningful capability jumps every six to eight weeks. Key near-term catalysts include potential June model updates, expanded test-time compute techniques, and any official announcements on new training runs or post-training refinements that could push OpenAI models into the mid-40s or higher before the deadline.
基于Polymarket数据的AI实验性摘要。这不是交易建议,也不影响该市场的结算方式。 · 更新于
警惕外部链接哦。
警惕外部链接哦。
常见问题