It's state of the art on nearly every benchmark we tested and the lead grows the longer the task. Ma...

Mike Krieger(@mikeyk)2026年6月9日

6.0Score

TL;DR · AI 摘要

文章提到某AI模型在多个基准测试中表现优异，但信息密度低，缺乏技术细节。

按章节快速跳转。

用一张图看清主题之间的关系。

查看大纲文本（无障碍 / 无 JS 友好）

值得收藏与分享的关键句。

It's state of the art on nearly every benchmark we tested and the lead grows the longer the task.
— 第 1 段
⬇︎ 下载 PNG 𝕏 分享到 X
Made safe for general release: cyber & bio requests fall back transparently to Opus 4.8
— 第 1 段
⬇︎ 下载 PNG 𝕏 分享到 X
$10/$50 on the API, in paid Claude plans today.
— 第 1 段
⬇︎ 下载 PNG 𝕏 分享到 X

#AI#模型#技术发布

@mikeyk

我们在几乎所有测试的基准上都达到了最先进的水平，任务越长，领先优势越明显。为了确保安全发布，网络和生物识别请求会透明地回退到 Opus 4.8，超过 95% 的会话甚至不会遇到这种情况。目前，通过 API 提供 $10/$50 的服务，已包含在付费的 Claude 计划中。

2026 年 6 月 9 日下午 5:03

4.3K

浏览量

3

1

5

6

56

4

阅读 3 条回复