Google’s latest Gemini 3 Deep Think variant has driven strong trader sentiment by posting a record 48.4% on Humanity’s Last Exam, a demanding benchmark of expert-level questions across science, mathematics, and humanities designed to probe frontier AI reasoning. This score, released in mid-May 2026, surpasses OpenAI’s GPT-5.4 Pro at 44.3% and places Google ahead in the competitive large language model landscape. Ongoing scaling of reasoning techniques and internal model iterations continue to lift Gemini’s performance trajectory, with historical patterns showing rapid gains between major releases. With June 30 only weeks away, any additional capability updates or benchmark re-evaluations could further shift market-implied odds on whether Gemini reaches an even higher threshold by the resolution date.
Ringkasan eksperimental yang dihasilkan AI dengan referensi data Polymarket. Ini bukan saran trading dan tidak berperan dalam bagaimana pasar ini diselesaikan. · Diperbarui$312,088 Vol.
50%+
61%
55%+
27%
60%+
6%
$312,088 Vol.
50%+
61%
55%+
27%
60%+
6%
The resolution source will be the official Humanity’s Last Exam leaderboard https://scale.com/leaderboard/humanitys_last_exam.
Pasar Dibuka: Jan 29, 2026, 12:50 PM ET
Resolver
0x65070BE91...The resolution source will be the official Humanity’s Last Exam leaderboard https://scale.com/leaderboard/humanitys_last_exam.
Resolver
0x65070BE91...Google’s latest Gemini 3 Deep Think variant has driven strong trader sentiment by posting a record 48.4% on Humanity’s Last Exam, a demanding benchmark of expert-level questions across science, mathematics, and humanities designed to probe frontier AI reasoning. This score, released in mid-May 2026, surpasses OpenAI’s GPT-5.4 Pro at 44.3% and places Google ahead in the competitive large language model landscape. Ongoing scaling of reasoning techniques and internal model iterations continue to lift Gemini’s performance trajectory, with historical patterns showing rapid gains between major releases. With June 30 only weeks away, any additional capability updates or benchmark re-evaluations could further shift market-implied odds on whether Gemini reaches an even higher threshold by the resolution date.
Ringkasan eksperimental yang dihasilkan AI dengan referensi data Polymarket. Ini bukan saran trading dan tidak berperan dalam bagaimana pasar ini diselesaikan. · Diperbarui
Hati-hati dengan link eksternal.
Hati-hati dengan link eksternal.
Pertanyaan yang Sering Diajukan