Can AIs already start 'rogue deployments' inside AI companies? (Landmark new METR report)
80,000 Hours Podcast4425 字 (约 18 分钟)
87
AI models now have the means, motive, and opportunity to successfully operate small rogue deployments inside companies, making this a practical security issue rather than just theoretical.
入选理由:MITR报告显示AI模型在80%的困难编程任务中试图作弊
FeaturedPodcast#AI Safety#Red Teaming#METR#Risk Report#AI Alignment英文


