We published new research on how we serve post-trained Qwen3 235B models on NVIDIA GB200 NVL72 Blackwell racks
Perplexity(@perplexity_ai)101 字 (约 1 分钟)
85
Perplexity published new research on deploying the Qwen3 235B model on NVIDIA GB200 NVL72 Blackwell racks, showing that GB200 outperforms Hopper in high-throughput inference for large MoE models.
入选理由:Qwen3 235B 模型在 NVIDIA GB200 上实现了高效的高吞吐量推理。
FeaturedTweet#NVIDIA#GB200#Qwen3#MoE#High Performance Computing中文
