T
traeai
Sign in

模型

Mellum2

别名:Mellum 2

JetBrains开源的12B参数AI模型,采用MoE架构,专为软件工程环境设计,推理速度比同类模型快50%。

已跟踪 2 条高相关材料

TraeAI 观察

相关材料

已收录 2 条与 Mellum2 相关的内容,按评分排序。

Introducing Mellum2: A 12B Mixture-of-Experts Model by JetBrains

Introducing Mellum2: A 12B Mixture-of-Experts Model by JetBrains

Hugging Face Blog564 字 (约 3 分钟)
90

JetBrains releases Mellum2, a 12B-parameter MoE model activating only 2.5B params per token, offering 2x+ faster inference than peers, optimized for text/code tasks and private/RAG deployments.

入选理由:Mellum2 是 12B 参数 MoE 模型,每 token 仅激活 2.5B 参数,推理效率提升 2x+,适合高吞吐生产环境。

FeaturedArticle#MoE#JetBrains#Large Model#Code Generation#RAG英文
Mellum2 Goes Open Source: A Fast Model for AI Workflows

Mellum2 Goes Open Source: A Fast Model for AI Workflows

The JetBrains Blog606 字 (约 3 分钟)
85

Mellum2 is an open-source 12B parameter AI model from JetBrains, using MoE architecture to activate only 2.5B parameters per token, reducing inference time by over 50% compared to similar-sized models, specifically designed for software engineering environments with applications in routing, RAG pipelines, and private AI deployment.

入选理由:Mellum2采用MoE架构,12B参数模型每token仅激活2.5B参数,推理速度比同类模型快50%,显著降低生产环境延迟和成本

FeaturedArticle#AI#Model#Mellum2#MoE#Software Engineering中文

跨材料问答 · Mellum2

回答基于:Mellum2 相关 2 条材料
    0 / 500

    AI may generate inaccurate information. Please verify important content.