Opus 4.8 最近有什么新动态？

traeai 已收录 18 篇与 Opus 4.8 相关的内容。最新一篇是「MYTHOS MYTHOS MYTHOS」，由 Matthew Berman 发布。

模型

Opus 4.8

Q: 什么是 Opus 4.8？

一个可能用于模型回退的AI模型。

别名：Opus

一个可能用于模型回退的AI模型。

已跟踪 18 条高相关材料

TraeAI 观察

如果只读 3 篇

Claude Fable 5省钱秘诀来了：调成Low档比Opus更便宜

量子位 · 8.5 分

Claude Fable 5在低档位下表现优于Opus 4.8，且在复杂任务中更省成本。

Fable 5自带反蒸馏机制！检测到就降智，误触率高到离谱

量子位 · 8.5 分

Fable 5模型内置反蒸馏机制，检测到潜在训练行为时会自动降智，误触率远超官方宣称的5%。

Claude Fable 5 - Full 319 page Breakdown

AI Explained · 8.5 分

Claude Fable 5 显著提升了 AI 能力，但存在使用限制和内容过滤机制。

MYTHOS MYTHOS MYTHOS

Matthew BermanYesterday6582 字 (约 27 分钟)

Anthropic 发布了 Mythos 模型，其能力远超以往所有公开模型，且分为带安全限制的 Fable 和无限制的 Mythos。

入选理由：Mythos 模型能力远超 Anthropic 以往所有公开模型。

FeaturedVideo#Anthropic#AI 模型#Mythos#Fable#深度学习英文

Claude Fable 5 - Full 319 page Breakdown

AI ExplainedYesterday7804 字 (约 32 分钟)

Claude Fable 5 显著提升了 AI 能力，但存在使用限制和内容过滤机制。

入选理由：Claude Fable 5 在性能和功能上都有显著提升。

FeaturedVideo#AI#Claude#Anthropic#模型发布英文

Fable 5自带反蒸馏机制！检测到就降智，误触率高到离谱

量子位6月11日2636 字 (约 11 分钟)

Fable 5模型内置反蒸馏机制，检测到潜在训练行为时会自动降智，误触率远超官方宣称的5%。

入选理由：Fable 5的反蒸馏机制会自动降低回答质量，且不通知用户。

FeaturedArticle#Fable 5#Anthropic#AI模型#安全机制中文

Claude Fable 5省钱秘诀来了：调成Low档比Opus更便宜

量子位6月11日2414 字 (约 10 分钟)

Claude Fable 5在低档位下表现优于Opus 4.8，且在复杂任务中更省成本。

入选理由：Fable 5低档位下表现优于Opus 4.8

FeaturedArticle#Claude#AI模型#成本优化中文

Anthropic 今天同时发布了两个模型：Claude Fable 5 和 Claude Mythos 5。两个模型用的是同一个底座，区别在于 Fable 5 加了一套安全分类器，面向所有用户开...

宝玉(@dotey)6月10日1018 字 (约 5 分钟)

Anthropic 发布 Claude Fable 5 和 Mythos 5，前者面向所有用户开放并内置安全机制，后者专供网络安全合作伙伴使用。

入选理由：Fable 5 通过降级机制保障安全，95% 的对话不会触发降级。

FeaturedTweet#Anthropic#Claude#AI模型#网络安全中英混合

SWE-Bench style grading has been the standard for years now - you ask the agent to solve an issue an...

Scott Wu(@ScottWu46)6月10日239 字 (约 1 分钟)

FrontierCode 是一种新的代码评估基准，通过多维度评价模型生成代码的质量，显著减少误判并提升评估标准。

入选理由：FrontierCode 评估标准比传统单元测试更全面，涵盖代码风格、可维护性等维度。

FeaturedTweet#AI#代码评估#模型测试#开源英文

Claude Opus 4.8 is now live in Genspark AI Chat Agent. Ready to use today. Try it now → https://t.c...

Genspark(@genspark_ai)5月29日78 字 (约 1 分钟)

Claude Opus 4.8 在 Genspark AI Chat Agent 上线，具备更强判断力、更诚实的自我评估和更长独立工作能力。

入选理由：Claude Opus 4.8 提升了判断力和独立工作时长。

FeaturedTweet#AI#Claude#Genspark#Language Model英文

Fast mode for Opus 4.8 is much more affordable now. Try it out in Claude Code, I've found it chang...

Alex Albert(@alexalbert__)5月28日245 字 (约 1 分钟)

Opus 4.8 的快速模式现在更便宜且速度提升 2.5 倍，适合需要快速响应的交互任务。

入选理由：快速模式比普通模式快 2.5 倍，价格降低至原来的三分之一。

FeaturedTweet#Opus 4.8#Claude Code#AI模型#快速模式中英混合

Hermes Agent NEW Super-App and DeepSeek v4 Catches Up To Opus 4.8?

Riley BrownYesterday5752 字 (约 24 分钟)

AI代理平台更新迅速，Codex新增站点插件，Hermes推出超级应用，DeepSeek v4与Opus竞争。

入选理由：Codex新增站点插件，可生成内部工具并部署到互联网。

FeaturedVideo#AI代理#Codex#Hermes#DeepSeek#Opus英文

Testing Out the New Opus 4.8 👌

The AI Advantage6月2日167 字 (约 1 分钟)

Anthropic launches Opus 4.8, available on Claude Web, Claude Cowork, and API. A demo shows generating a visually stunning frontend website in one prompt, taking over 10 minutes, showcasing strong creative output.

入选理由：Opus 4.8 已在 Claude Web、Claude Cowork 和 API 中上线，支持多场景应用。

FeaturedVideo#Anthropic#Opus 4.8#AI-generated design#Claude#Frontend Development英文

We just shipped Opus 4.8! It's noticeably more honest, owning what it doesn't know and flagging problems in its own code instead of glossing over them.

cat(@_catwu)5月30日117 字 (约 1 分钟)

Claude releases Opus 4.8, significantly improving honesty by admitting knowledge gaps and flagging code issues, recommended for daily use in Claude Code.

入选理由：Opus 4.8 在 Claude Code 中默认推荐使用，适合日常编码任务。

FeaturedTweet#Claude#AI Model#Code Generation#Large Language Model英文

TL;DR: Fable 5 isn’t the right-sized model for every task, but when quality and depth matter (revie...

Augment Code(@augmentcode)6月10日105 字 (约 1 分钟)

Fable 5模型在特定任务中表现优异，但并非所有场景都适用。

入选理由：Fable 5在需要高质量和深度的任务中表现突出。

FeaturedTweet#Fable 5#模型#AI#GPT#Opus 4.8英文

Claude Fable 5 may refuse to handle certain requests due to security guardrails. You can configure ...

AI SDK(@aisdk)6月10日72 字 (约 1 分钟)

Claude Fable 5 由于安全限制可能拒绝处理某些请求，可配置回退到 Opus 4.8 模型。

入选理由：Claude Fable 5 可能因安全限制拒绝处理某些请求。

FeaturedTweet#AI#Claude#模型#安全英文

A new top scorer just one day after our benchmark released! Especially strong on the hardest tasks: ...

Scott Wu(@ScottWu46)6月10日115 字 (约 1 分钟)

Claude Fable 5在FrontierCode Diamond基准测试中表现优异，比Opus 4.8提升了15.9个百分点。

入选理由：Claude Fable 5在FrontierCode Diamond基准测试中得分从13.4%提升至29.3%。

FeaturedTweet#AI模型#基准测试#Claude#FrontierCode英文

Claude Can Make Hundreds of AI Agents Now!

The AI Advantage6月7日280 字 (约 2 分钟)

Claude Code's "workflow" feature, activated via a keyword in prompts, automatically spawns hundreds of sub-agents to handle the end-to-end development lifecycle from planning to QA testing.

入选理由：在 Claude Code 提示词中使用 "workflow" 关键词可触发多代理协作模式，将复杂任务分解给数百个子代理执行。

FeaturedVideo#Claude Code#AI Agents#Anthropic#Automated Testing#Software Engineering英文

Testing Out the New Opus 4.8 👌

The AI Advantage6月1日167 字 (约 1 分钟)

Anthropic's new model Opus 4.8 is available via web app, Claude Cowork/Code, and API; author generated a visually stunning frontend website design taking over 10 minutes, but no technical details or performance metrics were provided.

入选理由：Opus 4.8 已上线网页版、Claude Cowork/Code 和 API，支持多场景调用。

FeaturedVideo#Anthropic#Opus 4.8#AI-generated design英文

SWEbench is done.

Matthew Berman6月2日212 字 (约 1 分钟)

SWEbench benchmark is invalid as GPT 5.5 scores 70% on Deep Suite versus Opus 4.7's 54%, showing opposite trends in SWEbench, indicating unreliability.

入选理由：GPT 5.5 achieves 70% accuracy on Deep Suite, significantly outperforming Opus 4.7 at 54%.

FeaturedVideo#SWEbench#Deep Suite#GPT#Opus#Gemini英文

Opus 4.8 Cache Hit Rate and Effective Price Now Viewable in Real Time

AI HOT 精选6月7日97 字 (约 1 分钟)

OpenRouter now lets users view real‑time cache hit rates and historical traffic in the Pricing tab, enabling cost‑effectiveness comparisons across model providers.

入选理由：实时缓存命中率可直接在 Pricing 选项卡查看。

FeaturedArticle#OpenRouter#Cache Hit Rate#Cost Evaluation中文

跨材料问答 · Opus 4.8

回答基于：Opus 4.8 相关 18 条材料