SGLang 最近有什么新动态？

traeai 已收录 7 篇与 SGLang 相关的内容。最新一篇是「[AINews] Reve 2 and Ideogram 4: Layouts in Imagegen」，由 Latent Space 发布。

产品

什么是 SGLang？

用于 UniRL 的可插拔 rollout 引擎。

为什么现在值得关注？

如果只读 3 篇

[AINews] Reve 2 and Ideogram 4: Layouts in Imagegen

Latent Space · 8.7 分

1、Most RL stacks are built for one modality. UniRL applies a single post-training loop — generate → ...

Hunyuan(@TXhunyuan) · 8.5 分

Benchmarking inference at scale: coding agents

Together AI Blog · 8.5 分

📰 SGLang 最新动态

已收录 7 篇与「SGLang」相关的 AI 资讯和分析。

Reve 2 and Ideogram 4: Layouts in Imagegen

Latent Space6月4日1547 字 (约 7 分钟)

Advances in image composition are simultaneously broken by Reve 2 and Ideogram 4, with Ideogram 4 now the top-ranked open image model on Arena. Microsoft released MAI-Thinking-1 achieving 97% on AIME 2025 without synthetic data or distillation, publishing detailed training stacks and MoE scaling. Frontier Tuning enables enterprise workflow models to reach GPT-5.4 quality with up to 10× efficiency gains, while Gemma 4 12B and others strengthen local-first deployment momentum.

入选理由：Ideogram 4.0 登顶 Arena 开放图像模型榜单，图像布局能力显著提升。

FeaturedArticle#ImageGen#Layouts#MAI-Thinking-1#Frontier Tuning#Gemma 4 12B英文

1、Most RL stacks are built for one modality. UniRL applies a single post-training loop — generate → ...

Hunyuan(@TXhunyuan)6月10日257 字 (约 2 分钟)

UniRL 是一个统一的强化学习框架，支持多种模态和模型，通过单一训练循环实现跨模型家族的训练。

入选理由：UniRL 使用单一训练循环（generate → score → advantage → update → sync）支持多种模态和模型。

FeaturedTweet#强化学习#UniRL#Tencent#AI框架英文

Benchmarking inference at scale: coding agents

Together AI Blog5月21日1358 字 (约 6 分钟)

Together Inference Engine delivers 31% more TPS than next fastest OSS engine on same hardware, maintains 2× better TTFT at saturation. Performance gains come from full-stack optimization.

入选理由：ThunderMLA、自定义内核重写和端到端优化使Together引擎比其他OSS引擎多31%的TPS

FeaturedArticle#Together AI#Inference Engine#Coding Agent#Performance Optimization#TTFT英文

Domestic GPU Forms an Open Source Alliance, Attracting Core Developers of SGLang and More!

量子位5月15日3400 字 (约 14 分钟)

Chinese GPU manufacturer Moore Threads attracts core developers of multiple large model inference frameworks through open source activities.

入选理由：SGLang在12个H100节点上实现52.3k输入token/s/node性能

FeaturedArticle#GPU#Open Source Ecosystem#Large Model Inference中文

163: 详解DeepSeekV4：Infra巨鲸、百万上下文走进现实、极致效率优化

晚点聊 LateTalk5月1日2079 字 (约 9 分钟)

DeepSeekV4发布，通过组合创新和工程优化，在R1的“测试时扩展”范式下，实现百万上下文从理论到实用的飞跃，对Agent和多步复杂任务具有重要意义。

入选理由：DeepSeek V4沿用现有范式，未带来范式变化，但通过一系列技术创新显著提升了长上下文处理能力。

FeaturedPodcast#DeepSeek#大模型#注意力机制#优化器#稀疏注意力中文

SGLang is hitting 180 tok/s/GPU on DeepSeek-V4 decode with ~1M context on Blackwell. Good to see f...

NVIDIA AI(@NVIDIAAI)5月1日212 字 (约 1 分钟)

NVIDIA AI 报告称，SGLang 在 Blackwell 硬件上使用 DeepSeek-V4 模型解码达到 180 tok/s/GPU 的速度，约 1M 上下文，得益于 LMSYS 组织针对 Blackwell 的特定优化，提高了混合稀疏注意力的利用效率。

入选理由：SGLang 在 DeepSeek-V4 解码任务上实现高性能，达 180 tok/s/GPU。

FeaturedTweet#NVIDIA#DeepSeek-V4#SGLang#Blackwell#LMSYS中文

> Ecosystem: Compatible with llama.cpp, MLX, @LMStudio, vLLM, @ollama, @UnslothAI, and SGLang.
&g...

Google AI Developers: Gemma 4 Ecosystem Compatibility and Downloads

Google AI Developers(@googleaidevs)6月4日78 字 (约 1 分钟)

Google announces its model weights are compatible with major open-source ecosystems and can be directly downloaded from Hugging Face and Kaggle, lowering deployment barriers.

入选理由：Gemma 4 权重与 llama.cpp、vLLM、Ollama 等生态兼容，便于本地部署与推理。

FeaturedTweet#Gemma#Open-source Ecosystem#Model Deployment#Hugging Face#Kaggle英文

与「SGLang」经常一起出现的 AI 术语。

OpenAI Gemma 4 12B Ideogram MAI-Thinking-1 dspy.GEPA GPT-Image 2 Reve DRPO FlowDPPO Tencent UniRL Hunyuan-Image 3

💡 想追踪「SGLang」的长期趋势？去实体雷达 · SGLang 查看详细分析和跨材料问答。

什么是 SGLang？

为什么现在值得关注？

如果只读 3 篇

📰 SGLang 最新动态

Reve 2 and Ideogram 4: Layouts in Imagegen

1、Most RL stacks are built for one modality. UniRL applies a single post-training loop — generate → ...

Benchmarking inference at scale: coding agents

Domestic GPU Forms an Open Source Alliance, Attracting Core Developers of SGLang and More!

163: 详解DeepSeekV4：Infra巨鲸、百万上下文走进现实、极致效率优化

SGLang is hitting 180 tok/s/GPU on DeepSeek-V4 decode with ~1M context on Blackwell. Good to see f...

Google AI Developers: Gemma 4 Ecosystem Compatibility and Downloads

🔗 相关术语