Our evaluation of OpenAI's GPT-5.5 cyber capabilities

Simon Willison's Weblog

Simon Willison's Weblog2026年4月30日

Our evaluation of OpenAI's GPT-5.5 cyber capabilities

7.5Score

TL;DR · AI 摘要

英国AI安全研究所评估OpenAI的GPT-5.5在发现安全漏洞方面的能力，认为其与Claude Mythos相当，但GPT-5.5现已普遍可用。

核心要点

GPT-5.5被评估具有与Claude Mythos相似的网络安全能力。
不同于Claude Mythos，GPT-5.5当前即可供广泛使用。
评估由英国AI安全研究所进行，增加了结果的可信度。

结构提纲

按章节快速跳转。

§引言
简述对OpenAI GPT-5.5网络能力的评估背景。
·GPT-5.5能力评估
概述GPT-5.5在安全漏洞检测方面的表现及与Claude Mythos的比较。
·可用性对比
强调GPT-5.5即时可用性相对于Claude Mythos的优势。
·来源与评价机构
介绍评估的执行机构——英国AI安全研究所。

思维导图

用一张图看清主题之间的关系。

查看大纲文本（无障碍 / 无 JS 友好）

GPT-5.5评估
- 能力展示
  - 与Claude Mythos对比
- 发布状态
  - 即时可用性
- 评估机构
  - 英国AI安全研究所

金句 / Highlights

值得收藏与分享的关键句。

GPT-5.5被评估为在发现安全漏洞方面与Claude Mythos相当，且已普遍可用。
— 正文首段
⬇︎ 下载 PNG 𝕏 分享到 X
英国AI安全研究所之前评估了Claude Mythos的预览版网络安全能力。
— 正文首段
⬇︎ 下载 PNG 𝕏 分享到 X
赞助获取每月LLM重要发展精选邮件摘要。
— 赞助信息部分
⬇︎ 下载 PNG 𝕏 分享到 X

#OpenAI#GPT-5.5#网络安全#AI安全研究所

打开原文

Our evaluation of OpenAI's GPT-5.5 cyber capabilities

[Simon Willison’s Weblog](http://simonwillison.net/)

Subscribe

30th April 2026 - Link Blog

[Our evaluation of OpenAI's GPT-5.5 cyber capabilities](https://www.aisi.gov.uk/blog/our-evaluation-of-openais-gpt-5-5-cyber-capabilities). The UK's AI Security Institute previously evaluated Claude Mythos: now they've evaluated GPT-5.5 for finding security vulnerability and found it to be comparable to Mythos, but unlike Mythos it's generally available right now.

Posted 30th April 2026 at 11:03 pm

Recent articles

LLM 0.32a0 is a major backwards-compatible refactor - 29th April 2026
Tracking the history of the now-deceased OpenAI Microsoft AGI clause - 27th April 2026
DeepSeek V4 - almost on the frontier, a fraction of the price - 24th April 2026

This is a link post by Simon Willison, posted on 30th April 2026.

ai 1995 openai 416 generative-ai 1768 llms 1734 anthropic 278 claude 272 ai-security-research 16 gpt 124

Monthly briefing

Sponsor me for $10/month and get a curated email digest of the month's most important LLM developments.

Pay me to send you less!

Sponsor & subscribe