and here https://t.co/4zz2mrWf3Y

- Cursor IDE显著提升大模型在复杂任务中的表现
- 测试涵盖Gemini、GPT-5.4和Opus等多个前沿模型
- 评估标准是模型对100项产品需求的实现能力
https://t.co/4zz2mrWf3Y" / X
eric zakariasson on X: "and here https://t.co/4zz2mrWf3Y" / X
Don’t miss what’s happening
People on X are the first to know.
Post
See new posts
Conversation

eric zakariasson 
and here
Quote

edwin

@edwinarbus
·
Mar 16
Matt Maher tested frontier models in Cursor v. other harnesses. Cursor boosted model performance by 11% on average: Gemini: 52% → 57% GPT-5.4: 82% → 88% Opus: 77% → 93% His benchmark measures how well models implement a 100-feature PRD. @cursor_ai consistently outperformed.

·
1
20
2
New to X?
Sign up now to get your own personalized timeline!
Sign up with Apple
By signing up, you agree to the Terms of Service and Privacy Policy, including Cookie Use.
Relevant people
-  eric zakariasson  @ericzakariasson Follow Click to Follow ericzakariasson @cursor_ai & tinkering. http://colf.dev
-  edwin  @edwinarbus Follow Click to Follow edwinarbus @cursor_ai , previously @openai and @stripe
Trending now
What’s happening
Music · Trending
#BANGCHELLA
Trending with #BIGBANG
Trending in United States
John Denver
Trending in United States
GO KNIGHTS GO
Trending in United States
Piccolo
|
|
|
|
|
More
© 2026 X Corp.