New research result: we use Claude to make fully autonomous progress on scalable oversight research,...

- Claude实现自主研究突破
- 性能差距恢复指标提升
- 相比人类研究员更具成本效益
Claude iterates on a number of different techniques and ends up significantly outperforming human researchers for $18k in credits. https://t.co/fbVpCPPtaU" / X
Jan Leike on X: "New research result: we use Claude to make fully autonomous progress on scalable oversight research, as measured by performance gap recovered (PGR). Claude iterates on a number of different techniques and ends up significantly outperforming human researchers for $18k in credits. https://t.co/fbVpCPPtaU" / X
Don’t miss what’s happening
People on X are the first to know.
Post
See new posts
Conversation

New research result: we use Claude to make fully autonomous progress on scalable oversight research, as measured by performance gap recovered (PGR). Claude iterates on a number of different techniques and ends up significantly outperforming human researchers for $18k in credits.

·
33
141
1.2K
604
Read 33 replies
New to X?
Sign up now to get your own personalized timeline!
Sign up with Apple
By signing up, you agree to the Terms of Service and Privacy Policy, including Cookie Use.
Relevant people
-  Jan Leike @janleike Follow Click to Follow janleike Alignment team lead @AnthropicAI . Previously OpenAI & DeepMind. Optimizing for a post-AGI future where humanity flourishes. Opinions aren't my employer's.
Trending now
What’s happening
Trending worldwide
23 Nisan Ulusal Egemenlik
Trending worldwide
Ateneo
Trending worldwide
Pay希望
Music · Trending
#TXT29thWin
|
|
|
|
|
More
© 2026 X Corp.