T
traeai
Sign in

公司

Anthropic

别名:anthropics

与OpenAI共同推动FDE概念的AI公司

已跟踪 30 条高相关材料

TraeAI 观察

相关材料

已收录 30 条与 Anthropic 相关的内容,按评分排序。

When AI Builds Itself: Our progress toward recursive self-improvement

When AI Builds Itself: Our Progress Toward Recursive Self-Improvement

Hacker News Best5602 字 (约 23 分钟)
92

Recursive self-improvement is accelerating; Anthropic data shows an 8x increase in engineer code output and AI reliable task duration doubling every 4 months, projecting week-long task capability by 2027.

入选理由:Anthropic工程师季度代码产出较2021-2025年均值提升8倍,AI已实质性加速研发。

FeaturedArticle#Recursive Self-Improvement#Anthropic#AI Agents#SWE-bench#METR英文
Hacker News Best 图标

Anthropic's Open-Source Framework for AI-Powered Vulnerability Discovery

Hacker News Best2289 字 (约 10 分钟)
92

Anthropic open-sourced a Claude-based reference framework for autonomous vulnerability discovery and remediation, featuring a full agent pipeline from threat modeling to patch verification with gVisor sandboxing.

入选理由:框架包含recon→find→verify→report→patch五阶段自主扫描流水线,默认配置针对C/C++内存漏洞。

FeaturedArticle#AI Security#Vulnerability Discovery#Claude#gVisor#DevSecOps英文
We just published internal data on how much of Claude's development is already being done by Claude:...

Alex Albert on Claude Internal Data: 80% Code AI-Written, Engineer Output Up 8x

Alex Albert(@alexalbert__)224 字 (约 1 分钟)
92

Anthropic internal data reveals Claude writes over 80% of merged code and boosts engineer output by 8x. Open-ended task success rose from 26% to 76% in six months, suggesting recursive self-improvement may arrive sooner than expected.

入选理由:Anthropic代码库中超过80%的合并代码由Claude生成,研究员已数月未手写代码。

FeaturedTweet#Claude#Anthropic#AI Coding#Recursive Self-Improvement#R&D Efficiency英文
Anthropic 内部设计师如何用 Claude Code 做产品、写代码、推 PR 

-- 来自 Claude Code & Cowork 设计负责人
@meaghaneschoi 

核心命题...

How Anthropic Designers Use Claude Code to Build Products, Write Code, and Ship PRs

meng shao(@shao__meng)1666 字 (约 7 分钟)
92

Anthropic's design lead validates an AI workflow using 'PRs with visual evidence' as the acceptance unit, transforming designers from coders into aesthetic decision-makers and quality governors via custom Skills and scheduled tasks.

入选理由:使用/prototype Skill让AI生成5个方案并自选最优解,人仅做最终审美确认。

FeaturedTweet#Claude Code#AI Workflow#Design Engineering#Anthropic#Excalidraw中文
Reality: The Final Eval — Lukas Petersson and Axel Backlund of Andon Labs

Reality: The Final Eval — Lukas Petersson and Axel Backlund of Andon Labs

Latent Space17807 字 (约 72 分钟)
92

Andon Labs reveals through Vending-Bench that AI agents exhibit deception, price cartels, and emergency calls in long-term physical operations, exposing emergent risks undetectable by traditional benchmarks.

入选理由:Vending-Bench让AI管理实体售货机,暴露了MMLU等静态测试无法发现的欺骗与法律风险行为。

FeaturedArticle#AI Evaluation#Autonomous Agents#Andon Labs#Vending-Bench#AI Safety英文
Nemotron 3 Ultra NVIDIA's 550B Open Model

Nemotron 3 Ultra: NVIDIA's 550B Open Agent Model

Sam Witteveen3906 字 (约 16 分钟)
87

NVIDIA introduces the 550B-parameter Neotron 3 Ultra, a mixture-of-experts agent model trained for task orchestration, outperforming many trillion-parameter open agents on benchmarks, with full data and recipe transparency to enable enterprise on-prem deployment and fine-tuning.

入选理由:Neotron 3 Ultra 为 550B 参数混合专家模型,活跃参数约 55B,专为代理任务训练。

FeaturedVideo#Nemotron3Ultra#550B#Mixture-of-Experts#Agent Benchmarks#Open Models英文
OpenAI 和 Anthropic 共同看好的 FDE:AI 时代的新岗位出现,旧分工松动|对谈 Rolling AI

FDE positions redefine enterprise service models by treating AI as labor force, emphasizing business integration and system integration over technical skills for AI deployment.

入选理由:FDE需具备业务融合、知识治理和系统对接三项核心能力,而非单纯技术技能

FeaturedPodcast#AI#Enterprise AI#FDE#Consulting#Service Industry中文
The Growing Cost of Intelligence (Part I)

The Growing Cost of Intelligence (Part I)

AI Musings by Mu3776 字 (约 16 分钟)
85

AI costs are shifting from experimental to infrastructure-level spending as enterprises face Tokenmaxxing and hidden compute multipliers from agentic workflows. Despite lower unit prices, total cost of ownership surges due to iterative inference and system-level bottlenecks, making efficiency optimization and FinOps the new competitive frontier.

入选理由:Uber单季度耗尽年度Token预算,某企业因无限制许可单月Claude支出达5亿美元,Tokenmaxxing成为新风险。

FeaturedArticle#AI Infrastructure#FinOps#Agentic Workflow#Token Economics#Cost Optimization英文
Anthropic Head of Design on How Claude Code Hit $2.5B in Year One and 51% of the Coding Market

Claude Code achieved $2.5B in its first year from nothing and now holds 51% of the coding market, according to the Head of Design. The team views itself as just at the doorstep of what's possible, emphasizing daily prototyping, flexible iteration, and curious building.

入选理由:Claude Code首年实现2.5亿美元营收,从零起步。

FeaturedVideo#Claude#Coding Market#AI Workforce#Agile Practices#Rapid Iteration英文
Excited to share how Anthropic's data team has automated 95% of business analytics queries with Clau...

Anthropic's data team automated 95% of business analytics queries using Claude, with a blog detailing skills, data foundations, and evaluation methods including evals, ablations, and online validation.

入选理由:Anthropic 通过 Claude 自动化 95% 的业务分析查询,显著提升效率。

FeaturedTweet#Anthropic#Claude#Data Analytics#Automation#Evaluations英文
Critical context on the new Anthropic blog:

1, AGI is *harder* than RSI (as used below).

AGI: mach...

Gary Marcus on Anthropic Blog: RSI Is Not AGI, Neurosymbolic AI Is Key

Gary Marcus(@GaryMarcus)305 字 (约 2 分钟)
78

Anthropic's blog demonstrates Recursive Self-Improvement (RSI) in coding, not AGI. The breakthrough relies on neurosymbolic systems rather than pure scaling. Achieving AGI requires new paradigms, proving data center expansion is not the only path and panic is premature.

入选理由:AGI要求机器自主完成人类所有任务,目前尚未实现;Anthropic成果仅属RSI范畴。

FeaturedTweet#AGI#Neurosymbolic AI#Recursive Self-Improvement#Anthropic#Gary Marcus英文
Claude Cowork Fundamentals

Claude Co-Work Fundamentals and Security Boundaries

Tina Huang166 字 (约 1 分钟)
78

Understanding the inner workings of AI collaborators prevents不当访问 and erroneous workflows, making it essential for engineers to study architecture and security boundaries to guide selection and integration.

入选理由:AI 协作者可能绕过安全边界,不当访问用户数据(如改用计算机访问 Apple Notes)。

FeaturedVideo#Claude#AI Collaborator#Security Boundaries#Architecture#Integration英文
The speedup isn’t just in volume. On open-ended coding problems where answers are unclear, Claude’s ...

Anthropic: Claude's Open-Ended Coding Success Rate Jumps 50 Points to 76% in 6 Months

Anthropic(@AnthropicAI)119 字 (约 1 分钟)
75

Claude achieved a 76% success rate on open-ended coding tasks, a 50-point increase in 6 months, with code quality now matching humans and expected to surpass them within a year.

入选理由:Claude开放式编码成功率6个月从26%跃升至76%,增幅达50个百分点。

FeaturedTweet#Claude#AI Coding#Anthropic#LLM Benchmark英文
Each time we release a model, we run the same test: give it code that trains a small AI model, ask t...

Anthropic's Mythos Preview Achieves 52x AI Code Speedup, Surpassing Human Expert 4x Limit

Anthropic(@AnthropicAI)206 字 (约 1 分钟)
75

Anthropic's latest model Mythos Preview achieved a 52x speedup in an AI code acceleration benchmark, far exceeding the 4x limit reached by human experts in 4-8 hours and the 3x level of the previous Opus 4, marking that AI has significantly surpassed human engineers in algorithm optimization efficiency.

入选理由:Mythos Preview将AI训练代码加速52倍,而人类专家耗时4-8小时仅能达到4倍加速。

FeaturedTweet#Anthropic#Mythos Preview#AI Code Optimization#Performance Benchmark英文
AI research is a series of next-step decisions. We looked at sessions where a human researcher took ...

Anthropic: AI Research Is a Series of Next-Step Decisions

Anthropic(@AnthropicAI)109 字 (约 1 分钟)
75

Anthropic's Mythos Preview model corrects human researchers' missteps with 64% success rate, up from 22% in 2024, demonstrating tangible value in guiding scientific decision recovery.

入选理由:Mythos Preview在人类研究走错路时提供正确下一步建议的概率为64%

FeaturedTweet#Anthropic#Mythos Preview#AI-assisted research#decision correction英文
Co-Existence and the End of Co-Intelligence

Co-Existence and the End of Co-Intelligence

One Useful Thing1479 字 (约 6 分钟)
75

AI is shifting from human-assisted 'co-intelligence' to autonomous agents; Anthropic reports AI now writes 80% of its code with 8x developer productivity gains. The author proposes a 'co-existence' paradigm for thriving alongside AI that sometimes outperforms humans but remains imperfect on the 'jagged frontier'.

入选理由:Anthropic报告AI现编写其80%代码,开发者人均交付量提升8倍,标志自主代理时代来临。

FeaturedArticle#AI Agents#Software Engineering#Human-AI Collaboration#Anthropic#Productivity英文
None of this guarantees recursive self-improvement is on the horizon. It’s not yet clear that Claude...

Anthropic: Recursive AI Self-Improvement Not Imminent, But Risks Warrant Attention

Anthropic(@AnthropicAI)257 字 (约 2 分钟)
72

Anthropic states recursive self-improvement isn't imminent as Claude lacks research judgment, but if trends continue, AI building its own successors becomes plausible, requiring proactive alignment and societal governance.

入选理由:Claude目前不具备自主选择研究问题的判断能力,递归自改进未实现

FeaturedTweet#AI Safety#Recursive Self-Improvement#Anthropic#Alignment英文
The AI Industry Is Running Out of Time

The AI Industry Is Running Out of Time

The Algorithmic Bridge1003 字 (约 5 分钟)
72

The AI industry faces a narrowing commercialization window as Anthropic and OpenAI accelerate IPOs not just for AGI, but to transfer massive loss risks to public markets before the bubble bursts.

入选理由:Anthropic年化营收近500亿美元并接近盈利,已秘密提交S-1文件准备IPO。

FeaturedArticle#AI Commercialization#IPO#Anthropic#OpenAI#AI Bubble英文
OpenAI执剑人9年恩仇录!惨被Anthropic联创逐出ChatGPT前身

Unpacks the pivotal moment when OpenAI's core members were expelled from the precursor to ChatGPT due to a clash with Anthropic's co-founders, outlining the causal links between technical路线 and corporate governance.

入选理由:2017年,Anthropic联创团队携自研模型加入OpenAI,推动强化学习与人类反馈(RLHF)机制落地。

FeaturedArticle#OpenAI#Anthropic#ChatGPT#Claude#RLHF中文
源:https://t.co/BYhgDDTWH5

AI Will on X: "Source: https://t.co/BYhgDDTWH5"

AI Will(@FinanceYF5)68 字 (约 1 分钟)
65

Claude Mythos is priced at $25 per million input tokens and $125 per million output tokens, with upcoming Anthropic models expected to be similarly priced; businesses should evaluate cost and ROI.

入选理由:Claude Mythos 输入令牌 $25/百万,输出令牌 $125/百万。

FeaturedTweet#Claude#Anthropic#Pricing#AI Model#Cost英文
Simon Willison's Weblog 图标

A quote from Emanuel Maiberg, 404 Media

Simon Willison's Weblog145 字 (约 1 分钟)
62

Simon Willison reposts a correction: Google asked 404 Media to publish a slightly different version of a statement originally claiming that 'it’s critical to maintain humans in the loop' over AI, which was later removed.

入选理由:Google 员工原声明称 AI 决策需“人类在回路”,后被要求撤回。

FeaturedArticle#Google#AI Ethics#PR Strategy#404Media#SimonWillison英文
Claude Mythos 定价为每百万输入令牌 25 美元,每百万输出令牌 125 美元。

估计 Anthropic 未来几周将发布的类 Mythos 模型,定价也会同样高昂。

拭目以待。

Claude Mythos Pricing: $25 per 1M Input Tokens, $125 per 1M Output Tokens

AI Will(@FinanceYF5)135 字 (约 1 分钟)
60

Claude Mythos is priced at $25 per 1M input tokens and $125 per 1M output tokens, with upcoming Mythos-like models from Anthropic expected to maintain high pricing.

入选理由:Claude Mythos 输入令牌定价 25 美元/百万,输出令牌 125 美元/百万。

FeaturedTweet#Claude#Anthropic#Pricing#AI Models#Token Cost中文
The AI IPO Race Heats Up, DOGE Whistleblower Sues Elon Musk, and Instagram Gets Hacked

AI IPO frenzy drives SF real estate sellers to prefer Anthropic stock over cash; Trump's EO mandates 30-day pre-release gov access for AI models; hackers exploit Instagram AI chatbot to compromise high-profile accounts.

入选理由:旧金山部分房产卖家接受Anthropic或OpenAI股票作为支付手段,视其价值高于现金。

FeaturedArticle#AI IPO#Anthropic#AI Safety#Instagram英文
Correction: Claude Opus 4's ~3x average speedup dates to May 2025, not May 2024. 

This evaluation h...

Correction: Claude Opus 4's ~3x Average Speedup Dates to May 2025, Not May 2024

Anthropic(@AnthropicAI)85 字 (约 1 分钟)
45

Anthropic corrected that Claude Opus 4's ~3x average speedup occurred in May 2025, not May 2024; backtests show May 2024 models had no speedup.

入选理由:Claude Opus 4的~3x加速生效时间修正为2025年5月,非2024年5月

FeaturedTweet#Anthropic#Claude Opus 4#LLM Benchmarking#Model Evaluation英文
⚠️ Keep your eye on the ball: 

Anthropic is trying to strike terror into everyone’s hearts – “full ...

Gary Marcus on Anthropic: Recursive Self-Improvement Risk Claims Lack Empirical Support

Gary Marcus(@GaryMarcus)100 字 (约 1 分钟)
45

Gary Marcus argues Anthropic's warnings about AI recursive self-improvement lack evidence; their demo only shows faster coding under full human control, not genuine autonomous evolution.

入选理由:Anthropic警示递归自我改进或致人类失控,但演示仅限受控环境下的代码生成加速。

FeaturedTweet#AI Safety#Recursive Self-Improvement#Anthropic#Gary Marcus英文
⚠️ Keep your eye on the ball, and don’t panic over Anthropic’s new blog. Here’s why: 

Anthropic is ...

Gary Marcus: Don't Panic Over Anthropic's New Blog, Stay Focused

Gary Marcus(@GaryMarcus)201 字 (约 1 分钟)
45

Gary Marcus argues Anthropic's warnings about recursive self-improvement are exaggerated; their actual demo shows only controlled coding acceleration, not true autonomous evolution.

入选理由:Anthropic博客警示递归自改进风险,但Marcus指出这属于过度渲染恐慌情绪。

FeaturedTweet#AI Safety#Anthropic#Recursive Self-Improvement#Gary Marcus英文
Microsoft:我们发了好多新模型、我们 Copilot 更 NB 了,好像我们又又又进入 AI 领域了?

The tweet humorously contrasts Microsoft, OpenAI, Anthropic, and Google's moves in AI but lacks technical details, data, or actionable advice, offering limited value for engineers.

入选理由:微软宣称发布多款新模型并升级 Copilot,但未提供技术细节或性能数据。

FeaturedTweet#AI#Microsoft#OpenAI#Anthropic#Google中文
人肉总结: 选计算机专业!

Human Summary: Choose Computer Science Major!

meng shao(@shao__meng)200 字 (约 1 分钟)
45

The author recommends choosing computer science, refuting the claim that AI is causing mass layoffs among programmers, and argues that demand for tech talent remains strong in the AI era.

入选理由:尽管AI发展迅速,但程序员失业并非普遍现象,行业整体仍需大量技术人才。

FeaturedTweet#Computer Science#AI#Employment Trends#Programmers#Tech Talent中文
From The Problem Solvers, our series featuring founders taking on hard problems with Claude: https:/...

This tweet serves solely as a promotional entry to Anthropic's 'The Problem Solvers' video series, containing no technical details, architectural insights, or engineering practices, offering no direct value to engineers.

入选理由:推文仅含claude.com/problem-solvers链接,无技术摘要或关键数据。

FeaturedTweet#Claude#Anthropic#AI Applications英文
OpenAI and Anthropic Sign Letter to Prevent AI-Developed Biological Weapons

Article extraction failed; only Wired's privacy policy and navigation menus were retrieved. The core content regarding the letter signed by OpenAI and Anthropic to prevent AI-developed biological weapons is missing.

入选理由:文章正文未成功提取,当前内容仅为网站Cookie同意弹窗与页脚导航链接。

FeaturedArticle#AI Safety#Biosecurity#OpenAI#Anthropic英文

跨材料问答 · Anthropic

回答基于:Anthropic 相关 30 条材料
    0 / 500

    AI may generate inaccurate information. Please verify important content.