Recent releases from Anthropic, including Claude Opus 4.7 and the Mythos Preview, have driven strong performance on Humanity's Last Exam, a rigorous benchmark of 2,500 expert-level questions in mathematics, science, and humanities. These models currently lead or rank near the top of public leaderboards with scores between 39% and 65%, reflecting advances in reasoning capabilities and test-time compute techniques. Competitive pressure from OpenAI's GPT-5 variants and Google's Gemini 3.1 Pro continues to accelerate development cycles, while the June 30 deadline aligns with potential additional fine-tuning or capability updates that could further lift results before resolution.
Eksperymentalne podsumowanie AI odwołujące się do danych Polymarket. To nie jest porada handlowa i nie ma wpływu na rozstrzyganie tego rynku. · ZaktualizowanoClaude score on Humanity’s Last Exam by June 30?
$283,400 Wol.
45%+
18%
50%+
9%
55%+
4%
$283,400 Wol.
45%+
18%
50%+
9%
55%+
4%
The resolution source will be the official Humanity’s Last Exam leaderboard https://scale.com/leaderboard/humanitys_last_exam.
Rynek otwarty: Jan 30, 2026, 12:00 AM ET
Resolver
0x65070BE91...The resolution source will be the official Humanity’s Last Exam leaderboard https://scale.com/leaderboard/humanitys_last_exam.
Resolver
0x65070BE91...Recent releases from Anthropic, including Claude Opus 4.7 and the Mythos Preview, have driven strong performance on Humanity's Last Exam, a rigorous benchmark of 2,500 expert-level questions in mathematics, science, and humanities. These models currently lead or rank near the top of public leaderboards with scores between 39% and 65%, reflecting advances in reasoning capabilities and test-time compute techniques. Competitive pressure from OpenAI's GPT-5 variants and Google's Gemini 3.1 Pro continues to accelerate development cycles, while the June 30 deadline aligns with potential additional fine-tuning or capability updates that could further lift results before resolution.
Eksperymentalne podsumowanie AI odwołujące się do danych Polymarket. To nie jest porada handlowa i nie ma wpływu na rozstrzyganie tego rynku. · Zaktualizowano
Uważaj na linki zewnętrzne.
Uważaj na linki zewnętrzne.
Często zadawane pytania