T
traeai
Sign in

概念

什么是 CritPt

评估大模型在理论物理推理任务上的基准测试集。

📰 CritPt 最新动态

已收录 1 篇与「CritPt」相关的 AI 资讯和分析。

watching a team of agents tackling a hard theoretical physics problem is quite mesmerizing - self-co...

The Physics-Intern framework boosts Gemini 3.1 Pro's performance on the CritPt benchmark from 17.7% to 31.4% via multi-agent collaboration, setting a new SOTA in theoretical physics reasoning.

入选理由:Physics-Intern 使用多智能体协作框架解决复杂理论物理问题。

FeaturedTweet#AI Agent#Theoretical Physics#LLM Reasoning#Gemini#CritPt中英混合

与「CritPt」经常一起出现的 AI 术语。

💡 想追踪「CritPt」的长期趋势?去 实体雷达 · CritPt 查看详细分析和跨材料问答。

AI may generate inaccurate information. Please verify important content.