T
traeai
Sign in

人物

什么是 Joe Carlsmith

AI 安全研究者,提出权力追求型 AI 构成存在性风险的观点。

📰 Joe Carlsmith 最新动态

已收录 2 篇与「Joe Carlsmith」相关的 AI 资讯和分析。

What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

Rohin Shah argues that while AGI safety risks deserve attention, catastrophic misalignment is not inevitable, and prosaic alignment techniques are likely sufficient to prevent worst-case outcomes, especially since current concerns like deception are not default behaviors in real training.

入选理由:Rohin Shah 认为灾难性 AGI 对齐失败不是默认结果,缺乏足够强的论证支持其必然发生。

FeaturedPodcast#AGI#AI Safety#DeepMind#Alignment#Rohin Shah英文
Claude's Constitution is now an audiobook, read by two of its authors, Amanda Askell and Joe Carlsmi...

Anthropic on X: 'Claude's Constitution is now an audiobook, read by two of its authors'

Anthropic(@AnthropicAI)173 字 (约 1 分钟)
45

Anthropic released an audiobook version of Claude's Constitution, read by two of its authors, including a Q&A on the writing process and how it might change as models become more capable.

入选理由:Claude 宪法有声书由两位作者朗读

FeaturedTweet#AI#Anthropic#Claude中文

与「Joe Carlsmith」经常一起出现的 AI 术语。

💡 想追踪「Joe Carlsmith」的长期趋势?去 实体雷达 · Joe Carlsmith 查看详细分析和跨材料问答。

AI may generate inaccurate information. Please verify important content.