T
traeai
Sign in

论文

MCP Atlas

别名:Model Context Performance Atlas

衡量模型在长上下文场景下的综合性能指标。

相关材料

已收录 2 条与 MCP Atlas 相关的内容,按评分排序。

Read more from @MiniMax_AI:

MiniMax introduces M3, the first open-weight model combining coding, agentic, and long-context capabilities, achieving 59%+ on benchmarks like SWE-Bench Pro with 1M context support, advancing open-source LLMs toward multi-capability frontiers.

入选理由:MiniMax M3 在 SWE-Bench Pro 基准测试中取得 59.0% 正确率,领先多数开源模型。

FeaturedTweet#Open-source model#Large language model#Coding capability#Long context#MiniMax英文
1/ Today at #GoogleIO, we’re releasing Gemini 3.5, our latest family of models combining frontier in...

Jeff Dean Announces Gemini 3.5

Jeff Dean(@JeffDean)268 字 (约 2 分钟)
85

Google releases the Gemini 3.5 family, starting with 3.5 Flash for complex agentic workflows. It outperforms 3.1 Pro on coding and agent benchmarks and runs 4x faster, reaching 12x in Antigravity.

入选理由:Gemini 3.5 Flash 专为执行复杂、长周期的智能体工作流而设计。

FeaturedTweet#Google#Gemini#AI Agents#LLM#Google I/O英文

跨材料问答 · MCP Atlas

回答基于:MCP Atlas 相关 2 条材料
    0 / 500

    AI may generate inaccurate information. Please verify important content.