Anthropic’s April 2026 releases of Claude Opus 4.7 and the Mythos Preview have propelled its models to leading scores of 35–65% on Humanity’s Last Exam, a 2,500-question benchmark testing frontier-level reasoning across mathematics, science, and humanities. These results, often achieved with tool use and extended chain-of-thought, surpass prior Claude versions and edge out competitors such as OpenAI’s GPT-5 series and Google’s Gemini 3.1 Pro. Trader focus now centers on whether a publicly listed Claude variant can sustain or exceed these thresholds through June 30 amid ongoing evaluation updates and rapid model iterations. Potential catalysts include any Claude 5 preview announcements or benchmark refreshes at upcoming AI conferences, while discrepancies in prompting methods or leaderboard methodology remain key variables.
Resumo experimental gerado por IA com dados do Polymarket. Isto não é aconselhamento de trading e não tem qualquer papel na resolução deste mercado. · AtualizadoClaude pontua no Último Exame da Humanidade até 30 de junho?
$283,400 Vol.
45%+
18%
50%+
9%
55%+
4%
$283,400 Vol.
45%+
18%
50%+
9%
55%+
4%
The resolution source will be the official Humanity’s Last Exam leaderboard https://scale.com/leaderboard/humanitys_last_exam.
Mercado Aberto: Jan 30, 2026, 12:00 AM ET
Resolver
0x65070BE91...The resolution source will be the official Humanity’s Last Exam leaderboard https://scale.com/leaderboard/humanitys_last_exam.
Resolver
0x65070BE91...Anthropic’s April 2026 releases of Claude Opus 4.7 and the Mythos Preview have propelled its models to leading scores of 35–65% on Humanity’s Last Exam, a 2,500-question benchmark testing frontier-level reasoning across mathematics, science, and humanities. These results, often achieved with tool use and extended chain-of-thought, surpass prior Claude versions and edge out competitors such as OpenAI’s GPT-5 series and Google’s Gemini 3.1 Pro. Trader focus now centers on whether a publicly listed Claude variant can sustain or exceed these thresholds through June 30 amid ongoing evaluation updates and rapid model iterations. Potential catalysts include any Claude 5 preview announcements or benchmark refreshes at upcoming AI conferences, while discrepancies in prompting methods or leaderboard methodology remain key variables.
Resumo experimental gerado por IA com dados do Polymarket. Isto não é aconselhamento de trading e não tem qualquer papel na resolução deste mercado. · Atualizado
Cuidado com os links externos.
Cuidado com os links externos.
Frequently Asked Questions