Anthropic’s April 2026 release of Claude Opus 4.7 and the Mythos Preview has established frontier-leading performance on Humanity’s Last Exam, a 2,500-question benchmark spanning advanced math, science, and humanities, with recent leaderboards showing scores from 36% to as high as 64.7% under tool-augmented and extended-reasoning conditions. These gains stem from iterative improvements in scalable oversight, larger context windows, and agentic capabilities that outpace OpenAI’s GPT-5 series and Google’s Gemini models on the same evaluation. Traders are focused on whether a public Claude variant sustains or exceeds the 30–35% threshold by the June 30 resolution date, with potential catalysts including developer previews, benchmark updates, or competitive releases at upcoming AI conferences that could shift the aggregated market-implied odds.
สรุปจาก AI ทดลองที่อ้างอิงข้อมูลจาก Polymarket ไม่ใช่คำแนะนำในการเทรดและไม่มีผลต่อการตัดสินตลาดนี้ · อัปเดตแล้ว$283,400 ปริมาณ
45%+
18%
50%+
9%
55%+
4%
$283,400 ปริมาณ
45%+
18%
50%+
9%
55%+
4%
The resolution source will be the official Humanity’s Last Exam leaderboard https://scale.com/leaderboard/humanitys_last_exam.
ตลาดเปิดเมื่อ: Jan 30, 2026, 12:00 AM ET
Resolver
0x65070BE91...The resolution source will be the official Humanity’s Last Exam leaderboard https://scale.com/leaderboard/humanitys_last_exam.
Resolver
0x65070BE91...Anthropic’s April 2026 release of Claude Opus 4.7 and the Mythos Preview has established frontier-leading performance on Humanity’s Last Exam, a 2,500-question benchmark spanning advanced math, science, and humanities, with recent leaderboards showing scores from 36% to as high as 64.7% under tool-augmented and extended-reasoning conditions. These gains stem from iterative improvements in scalable oversight, larger context windows, and agentic capabilities that outpace OpenAI’s GPT-5 series and Google’s Gemini models on the same evaluation. Traders are focused on whether a public Claude variant sustains or exceeds the 30–35% threshold by the June 30 resolution date, with potential catalysts including developer previews, benchmark updates, or competitive releases at upcoming AI conferences that could shift the aggregated market-implied odds.
สรุปจาก AI ทดลองที่อ้างอิงข้อมูลจาก Polymarket ไม่ใช่คำแนะนำในการเทรดและไม่มีผลต่อการตัดสินตลาดนี้ · อัปเดตแล้ว
ระวังลิงก์ภายนอก
ระวังลิงก์ภายนอก
คำถามที่พบบ่อย