[AINews] not much happened today
Latent Space1494 字 (约 6 分钟)
63
The article summarizes recent AI industry highlights, covering Anthropic’s Mythos/Opus discussion, the formalization of RSI research, and new long‑horizon evaluation benchmarks, underscoring the reliability gaps in frontier models.
入选理由:Anthropic 的 Opus 4.7 在某些化学任务上已匹配或超越专用 NMR 软件,显示模型在专业领域的潜力。
FeaturedArticle#AI Research#Self‑Improvement#Evaluation Benchmarks#Anthropic#Sakana AI中文