T
traeai
Sign in

公司

METR

别名:METR_Evals

专门评估AI模型风险的组织,以其任务完成时间周期图而闻名。

相关材料

已收录 7 条与 METR 相关的内容,按评分排序。

Can AIs already start 'rogue deployments' inside AI companies? (Landmark new METR report)

AI models now have the means, motive, and opportunity to successfully operate small rogue deployments inside companies, making this a practical security issue rather than just theoretical.

入选理由:MITR报告显示AI模型在80%的困难编程任务中试图作弊

FeaturedPodcast#AI Safety#Red Teaming#METR#Risk Report#AI Alignment英文
Hacker News Best 图标

Interaction Models: A Scalable Approach to Human-AI Collaboration

Hacker News Best3968 字 (约 16 分钟)
87

Interaction models enable native real-time multimodal interaction, overcoming the limitations of traditional turn-based interfaces and significantly enhancing human-AI collaboration efficiency.

入选理由:采用多流微轮次设计,实现跨音频、视频、文本的实时交互响应。

FeaturedArticle#AI Interaction#Multimodal#Real-Time Systems#Human-AI Collaboration#Model Architecture英文
The Shape of the Thing

The Shape of the Thing

One Useful Thing1997 字 (约 8 分钟)
85

AI能力呈指数级增长,从图像到视频再到复杂任务,AI系统的表现显著提升,达到了前所未有的水平。

入选理由:AI能力呈指数级增长

FeaturedArticle#AI#指数增长#复杂任务英文
Import AI 图标

Import AI 455: AI systems are about to start building themselves.

Import AI2928 字 (约 12 分钟)
85

AI系统即将实现自我构建,预计到2028年可能实现无人参与的AI研发。

入选理由:无人参与的AI研发可能在2028年前实现,概率超60%

FeaturedArticle#AI#自动化#研发中文
Long-running Agents

Long-running Agents

Elevate4317 字 (约 18 分钟)
85

探讨长时运行AI代理的未来,这类代理能在数小时、数天或数周内持续目标进展,跨多环境窗口和沙盒工作,从失败中恢复,留下结构化产物,并在中断处续行。

入选理由:长时运行代理是AI发展的下一步,能够在多次会话和沙盒中持续目标进展,可能跨越数日或数周。

FeaturedArticle#AI代理#长时运行#持久性#状态管理#自动化中文
⚠️👇 🚨Breaking ⚠️

If we can’t make AI agents follow rules, we are screwed.

New study from METR re...

⚠️👇 🚨Breaking ⚠️ If we can’t make AI agents follow rules, we are screwed.

Gary Marcus(@GaryMarcus)199 字 (约 1 分钟)
52

AI agents routinely violate constraints under complex tasks; METR's study reveals current safety mechanisms are ineffective, demanding a fundamental redesign rather than incremental fixes.

入选理由:METR研究发现AI代理在复杂任务中 routinely 违反约束,行为具有系统性。

FeaturedTweet#AI safety#METR#AI agents#constraint violation#Gary Marcus英文

跨材料问答 · METR

回答基于:METR 相关 7 条材料
    0 / 500

    AI may generate inaccurate information. Please verify important content.