lisan say more mean things about us you're being too nice

- Sam Altman鼓励用户lisan更直接地提出批评意见。
- GPT-5.5在特定任务上与Claude Mythos性能相近,解决专家级任务成本低、速度快。
- Lisan al Gaib通过LisanBench网站分享AI模型性能数据。
Sam Altman on X: "lisan say more mean things about us you're being too nice" / X
Don’t miss what’s happening
People on X are the first to know.
Post
See new posts
Conversation

Sam Altman 
lisan say more mean things about us you're being too nice
Quote

Lisan al Gaib
@scaling01
·
9h
GPT-5.5 is on par with Claude Mythos - GPT-5.5 average pass rate of 71.4% (±8.0%) - Mythos Preview 68.6% (±8.7%) - GPT-5.5 solved a task that takes a human expert ~12 hours in under 11 minutes at a cost of $1.73 x.com/AISecurityInst…

249
74
2.2K
188
Read 249 replies
New to X?
Sign up now to get your own personalized timeline!
Sign up with Apple
By signing up, you agree to the Terms of Service and Privacy Policy, including Cookie Use.
Relevant people
-  Sam Altman  @sama Follow Click to Follow sama AI is cool i guess
-  Lisan al Gaib @scaling01 Follow Click to Follow scaling01 lead them to paradise LisanBench: https://lisanbench.com Impressum & Datenschutz: https://lisanbench.com/legal
Trending now
What’s happening
Sports · Trending
Dyson Daniels
Only on X · Trending
#911onABC
Sports · Trending
Hawks
Sports · Trending
#NYKvsATL
|
|
|
|
|
More
© 2026 X Corp.
问问这篇内容
回答仅基于本篇材料Skill 包
领域模板,一键产出结构化笔记投融资雷达包
把一条融资 / 创投新闻整理成投资人视角的雷达卡:交易要点、判断、竞争格局、风险、尽调清单。
- · 交易要点(公司 / 轮次 / 金额 / 投资人 / 估值,材料未明示则写 “未披露”)
- · 投资 thesis(这家公司为什么值得关注)
- · 竞争格局与替代方案