Jan Leike 还有哪些别名？

Jan Leike 也被称为：janleike。

人物

什么是 Jan Leike？

Q: Jan Leike 最近有什么新动态？

traeai 已收录 4 篇与 Jan Leike 相关的内容。最新一篇是「I'm really excited about this as a new tool in our interpretability tool kit」，由 Jan Leike(@janleike) 发布。

也叫：janleike

AI 安全研究员，曾任 DeepMind 研究员，现关注 LLM 可解释性与对齐。

为什么现在值得关注？

如果只读 3 篇

I'm really excited about this as a new tool in our interpretability tool kit

Jan Leike(@janleike) · 8.5 分

When I started to work on the alignment problem more than 10 years ago, we had no idea how AGI was g...

Jan Leike(@janleike) · 7.5 分

Some personal news: I am starting a new research project at Anthropic. Very excited about this! Man...

Jan Leike(@janleike) · 5.5 分

📰 Jan Leike 最新动态

已收录 4 篇与「Jan Leike」相关的 AI 资讯和分析。

Jan Leike on X: "I'm really excited about this as a new tool in our interpretability tool kit"

Jan Leike(@janleike)5月9日152 字 (约 1 分钟)

NLAs is an unsupervised method that converts LLM internal states into human-readable text, significantly improving model transparency and safety auditing.

入选理由：NLAs 是一种无监督技术，能将 LLM 内部激活向量转为自然语言描述。

FeaturedTweet#LLM#Interpretability#AI Safety#Anthropic英文

When I started to work on the alignment problem more than 10 years ago, we had no idea how AGI was g...

Jan Leike on X: The Evolution of AI Alignment Research Over a Decade

Jan Leike(@janleike)5月9日292 字 (约 2 分钟)

Jan Leike reflects on the transformation of AI alignment research over the past decade—from a niche field with only ~12 researchers and unclear methods to one now driven by RLHF, scalable oversight, and automated techniques like constitutional AI in models such as Claude.

入选理由：10 年前 AI 对齐领域仅有约 12 人作为副业从事研究，且方法混乱。

FeaturedTweet#AI Alignment#AGI#RLHF#Machine Learning英文

Some personal news: I am starting a new research project at Anthropic. Very excited about this!

Jan Leike(@janleike)5月9日113 字 (约 1 分钟)

Jan Leike is joining Anthropic to start a new research project, emphasizing that alignment is just one of many factors needed for AGI success.

入选理由：Jan Leike 加入 Anthropic 开始新研究项目，聚焦 AGI 发展。

FeaturedTweet#AGI#Anthropic#research英文

Jan Leike on X: Grateful for talented people in AI alignment

Jan Leike(@janleike)5月9日120 字 (约 1 分钟)

Jan Leike thanks talented collaborators in AI alignment, calling it a privilege to work with those deeply motivated to make the future better.

入选理由：Jan Leike 感谢了多年在 AI 对齐领域合作的顶尖人才

FeaturedTweet#AI Alignment#OpenAI#Ethics英文

与「Jan Leike」经常一起出现的 AI 术语。

Anthropic NLAs Claude AGI AI Alignment

💡 想追踪「Jan Leike」的长期趋势？去实体雷达 · Jan Leike 查看详细分析和跨材料问答。