T
traeai
Sign in

人物

Thomas Wolf

别名:@Thom_Wolf

Hugging Face CEO,知名AI领域专家。

已跟踪 5 条高相关材料

TraeAI 观察

相关材料

已收录 5 条与 Thomas Wolf 相关的内容,按评分排序。

watching a team of agents tackling a hard theoretical physics problem is quite mesmerizing - self-co...

The Physics-Intern framework boosts Gemini 3.1 Pro's performance on the CritPt benchmark from 17.7% to 31.4% via multi-agent collaboration, setting a new SOTA in theoretical physics reasoning.

入选理由:Physics-Intern 使用多智能体协作框架解决复杂理论物理问题。

FeaturedTweet#AI Agent#Theoretical Physics#LLM Reasoning#Gemini#CritPt中英混合
I'm very excited about this extension to the celebrated Terminal-Bench to science.

If you're a scie...

Thomas Wolf is excited about the extension of Terminal-Bench to scientific fields, known as Terminal-Bench Science. This benchmark evaluates AI models' ability to control tools via the command line to achieve scientific goals. It's open for contributions of real scientific workflows until August 2026, aiming to improve AI models' assistance in research work.

入选理由:Terminal-Bench Science evaluates AI models' performance in handling scientific workflows through command-line tools.

FeaturedTweet#AI#Science#Terminal-Bench#Benchmarking#Command Line英文
my 13 yo the other day:

“we didn’t want to pay for the game with my friend so we just rebuilt it wi...

my 13 yo the other day:

Thomas Wolf(@Thom_Wolf)221 字 (约 1 分钟)
50

Codex can be used to rebuild games as an alternative to paying for them.

入选理由:13 岁的用户使用 Codex 重构游戏以避免付费。

FeaturedTweet#AI#Game Development英文
👀

👀

Thomas Wolf(@Thom_Wolf)28 字 (约 1 分钟)
20

This tweet contains only an emoji and a link to an external image, lacking substantive technical content, architectural analysis, or engineering practice guidance. It has extremely low information density and offers no valuable reading reference for engineers.

入选理由:原文仅为社交媒体状态更新,缺乏可提取的技术深度或原理说明。

FeaturedTweet#Social Media#Low Information Density#No Technical Content英文

跨材料问答 · Thomas Wolf

回答基于:Thomas Wolf 相关 5 条材料
    0 / 500

    AI may generate inaccurate information. Please verify important content.