Tip 3: There is a big jump on coding tasks like - SWE-Bench Pro 64.3% - SWE-Bench Verified 87.6% -...

- Opus 4.7在SWE-Bench Pro和Verified等基准测试中得分大幅提升。
- 性能跃升被视作模型编码能力实质性进步的信号。
- 缺乏技术细节、实验设置或与其他模型的横向对比。
- SWE-Bench Pro 64.3%
- SWE-Bench Verified 87.6%
- TerminalBench 69.4%
Altogether, it’s a strong signal that coding performance has improved meaningfully in Opus 4.7 compared to Opus 4.6. https://t.co/UdkPdLtROj" / X
Poe on X: "Tip 3: There is a big jump on coding tasks like - SWE-Bench Pro 64.3% - SWE-Bench Verified 87.6% - TerminalBench 69.4% Altogether, it’s a strong signal that coding performance has improved meaningfully in Opus 4.7 compared to Opus 4.6. https://t.co/UdkPdLtROj" / X
Don’t miss what’s happening
People on X are the first to know.
Post
See new posts
Conversation

Tip 3: There is a big jump on coding tasks like - SWE-Bench Pro 64.3% - SWE-Bench Verified 87.6% - TerminalBench 69.4% Altogether, it’s a strong signal that coding performance has improved meaningfully in Opus 4.7 compared to Opus 4.6.

·
4
2
New to X?
Sign up now to get your own personalized timeline!
Sign up with Apple
By signing up, you agree to the Terms of Service and Privacy Policy, including Cookie Use.
Relevant people
-  Poe @poe_platform Follow Click to Follow poe_platform The best AI, all in one place. GPT-5.4, Grok 4.2, Claude 4.6, Veo 3 and more. At http://poe.com, or for iOS, Android, Mac, or Windows at http://poe.com/download
Trending now
What’s happening
Trending in United States
Chibi
Sports · Trending
#VegasBorn
Trending in United States
$HIGHER
Trending in United States
Nicky
|
|
|
|
|
More
© 2026 X Corp.