T
traeai
Sign in

产品

Putnam exam

别名:Putnam

美国最高难度大学数学竞赛,常被视为AI数学能力的标杆挑战。

相关材料

已收录 1 条与 Putnam exam 相关的内容,按评分排序。

Latent Space 图标

Scaling Past Informal AI - Carina Hong, Axiom Math

Latent Space1535 字 (约 7 分钟)
87

Axiom Math advances Verified AI to scale brilliance and compound it through formal proofs with Lean, achieving 12/12 on Putnam and 99% (187/189) on Verina Codegen, far exceeding OpenAI o3’s 4.9%, providing critical capability verification and knowledge propagation for AGI.

入选理由:Axiom在Putnam考试中取得12/12,优于顶尖本科生与当时最接近的AI系统DeepSeek(103/120)。

FeaturedArticle#Verified AI#Formal Verification#Lean#AGI#Putnam Exam英文

跨材料问答 · Putnam exam

回答基于:Putnam exam 相关 1 条材料
    0 / 500

    AI may generate inaccurate information. Please verify important content.