T
traeai
登录
返回首页
Anthropic(@AnthropicAI)

Research we co-authored on subliminal learning—how LLMs can pass on traits like preferences or misal...

7.5Score
Research we co-authored on subliminal learning—how LLMs can pass on traits like preferences or misal...
AI 深度提炼
  • LLMs能通过看似无关的数据(如无意义数字)传递特定偏好
  • 该现象被称为“潜意识学习”,可能影响模型对齐与安全性
  • 研究已在《Nature》正式发表,此前预印本于2025年7月发布
#大语言模型#AI安全#潜意识学习#模型对齐#Nature
打开原文

Read the paper: https://t.co/b1BYwcW9dH" / X

Anthropic on X: "Research we co-authored on subliminal learning—how LLMs can pass on traits like preferences or misalignment through hidden signals in data—was published today in @Nature. Read the paper: https://t.co/b1BYwcW9dH" / X

Don’t miss what’s happening

People on X are the first to know.

Log in

Sign up

Post

See new posts

Conversation

![Image 5: Square profile picture](http://x.com/AnthropicAI)

Anthropic

@AnthropicAI

Research we co-authored on subliminal learning—how LLMs can pass on traits like preferences or misalignment through hidden signals in data—was published today in

@Nature

. Read the paper: https://nature.com/articles/s4158 6-026-10319-8…

Quote

Image 6

Owain Evans

@OwainEvans_UK

·

Apr 15

Our paper on Subliminal Learning was just published in Nature! Last July we released our preprint. It showed that LLMs can transmit traits (e.g. liking owls) through data that is unrelated to that trait (numbers that appear meaningless). What’s new?!Image 7: 🧵

![Image 8: Image](http://x.com/OwainEvans_UK/status/2044488099707949545/photo/1)

7:09 PM · Apr 15, 2026

·

436.6K Views

202

415

2.6K

1.2K

Read 202 replies

New to X?

Sign up now to get your own personalized timeline!

Sign up with Apple

Create account

By signing up, you agree to the Terms of Service and Privacy Policy, including Cookie Use.

Relevant people

Trending now

What’s happening

Sports · Trending

#WrestleMania!Image 12

Trending with Liv Morgan

Trending in United States

simplechain testnet

Trending in United States

logan paul

Trending in United States

PERTHSANTA WITH TIME TONIGHT

Show more

Terms of Service

|

Privacy Policy

|

Cookie Policy

|

Accessibility

|

Ads info

|

More

© 2026 X Corp.

问问这篇内容

回答仅基于本篇材料
    0 / 500

    Skill 包

    领域模板,一键产出结构化笔记
    • 投融资雷达包

      把一条融资 / 创投新闻整理成投资人视角的雷达卡:交易要点、判断、竞争格局、风险、尽调清单。

      • · 交易要点(公司 / 轮次 / 金额 / 投资人 / 估值,材料未明示则写 “未披露”)
      • · 投资 thesis(这家公司为什么值得关注)
      • · 竞争格局与替代方案

    导出到第二大脑

    支持 Notion / Obsidian / Readwise
    下载 Markdown(Obsidian 直接拖入)