论文

BTX

Q: BTX 最近有什么新动态？

traeai 已收录 1 篇与 BTX 相关的内容。最新一篇是「EMO: Pretraining mixture of experts for emergent modularity」，由 Hugging Face Blog 发布。

别名：BTX model

早期尝试基于预定义领域进行专家路由的研究工作。

已跟踪 1 条高相关材料

TraeAI 观察

如果只读 3 篇

EMO: Pretraining mixture of experts for emergent modularity

Hugging Face Blog · 9 分

EMO 是一种通过端到端预训练实现模块化涌现的混合专家模型，仅需12.5%的专家即可保持接近全模型性能，同时支持按需组合专家，显著提升大模型部署效率与灵活性。

EMO: Pretraining Mixture of Experts for Emergent Modularity

Hugging Face Blog5月9日1748 字 (约 7 分钟)

EMO is a mixture-of-experts model that achieves modular structure emergence through end-to-end pretraining, retaining near-full-model performance with only 12.5% of experts activated.

入选理由：EMO 使用14B总参数、1B活跃参数，仅激活1/8专家即达近全模型性能。

FeaturedArticle#Mixture of Experts#Modularity#Large Language Model#AI Research#Pretraining中文

跨材料问答 · BTX

回答基于：BTX 相关 1 条材料