General 365 最近有什么新动态？

traeai 已收录 1 篇与 General 365 相关的内容。最新一篇是「美团 LongCat 开源 General 365：树立推理评测新标尺」，由美团技术团队发布。

产品

General 365

美团开源的通用推理评测基准测试平台。

别名：General365

美团开源的通用推理评测基准测试平台。

已跟踪 1 条高相关材料

美团 LongCat 开源 General 365：树立推理评测新标尺

美团技术团队 · 8.5 分

美团开源 General 365 基准测试，揭示大模型在通用推理上的真实能力边界。

美团技术团队5月18日2029 字 (约 9 分钟)

Meituan opensource General 365 benchmark, revealing the real capability boundary of large models in general reasoning.

入选理由：Gemini 3 Pro 在 General 365 上准确率仅 62.8%，多数模型未达及格线

FeaturedArticle#Large Model#Reasoning Evaluation#General Reasoning中文

回答基于：General 365 相关 1 条材料