Introducing NVIDIA Nemotron 3 Ultra: An Open 550B Model for Long-Running Agents
NVIDIA today launches Nemotron 3 Ultra, a 550B-parameter open model built on the same architecture as Nemotron 3 Super, optimized for long-running AI agents. It employs LatentMoE to quadruple the number of experts at the same inference cost, introduces multi-token prediction to boost single-user inference speed, and is released under the Linux Foundation’s Open MDW license to enable enterprise deployment.
入选理由:Nemotron 3 Ultra 为 550B 参数模型,基于与 Nemotron 3 Super 相同架构,面向长时运行的智能代理场景。

