Rohin Shah on What It's Really Like to Run AGI Safety at Google DeepMind (and Where I Disagree with 'Doomers')
Rohin Shah argues that while AGI safety risks deserve attention, catastrophic misalignment is not inevitable, and prosaic alignment techniques are likely sufficient to prevent worst-case outcomes, especially since current concerns like deception are not default behaviors in real training.
入选理由:Rohin Shah 认为灾难性 AGI 对齐失败不是默认结果,缺乏足够强的论证支持其必然发生。
