Jan Leike(@janleike)2026年4月14日

However, most alignment research is not very crisp and requires research taste when evaluating. Thi...

5.0Score

AI 深度提炼

对齐研究通常缺乏明确性
可扩展监督是关键挑战
人类只能提供弱监督

#AI对齐#研究方法#监督学习

This is why we chose to point the AAR at this scalable oversight problem! Progress would let AARs work on fuzzier alignment problems, where humans can only provide weak supervision." / X

Jan Leike on X: "However, most alignment research is not very crisp and requires research taste when evaluating. This is why we chose to point the AAR at this scalable oversight problem! Progress would let AARs work on fuzzier alignment problems, where humans can only provide weak supervision." / X

Don’t miss what’s happening

People on X are the first to know.

Post

See new posts

Conversation

![Image 1](http://x.com/janleike)

Jan Leike

@janleike

However, most alignment research is not very crisp and requires research taste when evaluating. This is why we chose to point the AAR at this scalable oversight problem! Progress would let AARs work on fuzzier alignment problems, where humans can only provide weak supervision.

7:43 PM · Apr 14, 2026

6,771 Views

New to X?

Create account

By signing up, you agree to the Terms of Service and Privacy Policy, including Cookie Use.

Relevant people

![Image 2](http://x.com/janleike) Jan Leike @janleike Follow Click to Follow janleike Alignment team lead @AnthropicAI . Previously OpenAI & DeepMind. Optimizing for a post-AGI future where humanity flourishes. Opinions aren't my employer's.

Trending now

What’s happening

Trending worldwide

23 Nisan Ulusal Egemenlik

Trending worldwide

Ateneo

Trending worldwide

Pay希望

Music · Trending

#TXT29thWin

Cookie Policy

Accessibility

Ads info