T
traeai
Sign in

模型

什么是 Qwen3

也叫:qwen 3

2025年发布的Qwen大模型系列,包含dense和MoE架构

📰 Qwen3 最新动态

已收录 2 篇与「Qwen3」相关的 AI 资讯和分析。

Qwen最新3.7 Max预览版空降!两代超大杯并行迭代,林俊旸走了但还在加速

Qwen 3.7 Max Preview ranks 13th in Arena's text domain and 16th in vision domain, both topping Chinese models. Alibaba's LLM iteration pace has significantly accelerated since 2025, with release cycles shortened from 4-6 months to 2-3 months, and nearly monthly updates in 2026, demonstrating sustained acceleration.

入选理由:Qwen 3.7-Max-Preview在Arena文本榜排名第13,是全球前十五唯一中国模型

FeaturedArticle#Qwen#Large Language Model#Alibaba#Arena Leaderboard#Model Iteration中文
Training Kimi K2 and Qwen3 30B-scale models efficiently requires more than standard data-parallel tr...

NVIDIA Megatron Core now offers end-to-end support for advanced optimizers like Muon, MOP, and REKLS, overcoming limitations of standard data parallelism to significantly accelerate training of 30B-scale models such as Kimi K2 and Qwen3 on GB300 and NVL72 systems.

入选理由:传统数据并行已不足以高效训练30B+大模型,需引入高阶优化器。

FeaturedTweet#NVIDIA Megatron Core#Muon#Qwen3#Kimi K2#LLM Training Optimization英文

与「Qwen3」经常一起出现的 AI 术语。

💡 想追踪「Qwen3」的长期趋势?去 实体雷达 · Qwen3 查看详细分析和跨材料问答。

AI may generate inaccurate information. Please verify important content.