GRPO 最近有什么新动态？

traeai 已收录 5 篇与 GRPO 相关的内容。最新一篇是「从高拟真到真可用，LongCat-Video-Avatar 1.5 正式开源」，由美团技术团队发布。

概念

什么是 GRPO？

Q: 什么是 GRPO？

论文中提到的技能生成方法。

为什么现在值得关注？

如果只读 3 篇

从高拟真到真可用，LongCat-Video-Avatar 1.5 正式开源

美团技术团队 · 9.2 分

https://t.co/nw0GoHamCI

宝玉(@dotey) · 9.2 分

英伟达推出 AI 框架 Polar，让 Codex 跑分暴涨 594.74%

AI HOT 精选 · 8.5 分

📰 GRPO 最新动态

已收录 5 篇与「GRPO」相关的 AI 资讯和分析。

From High Fidelity to Real Usability: LongCat-Video-Avatar 1.5 is Now Open-Sourced

美团技术团队5月25日2226 字 (约 9 分钟)

The open-sourced version of LongCat-Video-Avatar 1.5 significantly improves lip-sync accuracy, physical realism, and long-video stability for commercial-grade digital human video generation.

入选理由：采用 Whisper-large 编码器，提升唇形同步与动作稳定性。

FeaturedArticle#Digital Human#Video Generation#AI Model#Open Source Project#Meituan中文

DeepSeek's $10 Trillion Grand Strategy [Translation]

宝玉(@dotey)5月25日5655 字 (约 23 分钟)

DeepSeek builds a low-cost, high-efficiency model system through multiple foundational innovations to drive China's $10 trillion AI hardware ecosystem and achieve its own $1 trillion valuation.

入选理由：DeepSeek V4 Pro在100万上下文中仅需5.48GB HBM显存，远低于竞品的60-89GB。

FeaturedTweet#DeepSeek#AI Model#MoE#KV Cache Optimization#Hardware Ecosystem中文

英伟达推出 AI 框架 Polar，让 Codex 跑分暴涨 594.74%

AI HOT 精选5月28日907 字 (约 4 分钟)

英伟达推出开源框架 Polar，显著提升 Codex 等智能体的性能和效率。

入选理由：Polar 框架让 Codex 在 SWE-Bench Verified 测试中的 pass@1 分数提升了 594.74%。

FeaturedArticle#英伟达#Polar#AI 框架#Codex#强化学习中文

Don't rush to RL after SFT! Your multimodal large model may have been training with injuries

量子位5月17日2434 字 (约 10 分钟)

SFT may introduce distribution bias during the training of multimodal large models, leading to performance degradation in the RL phase. PRISM addresses this issue through a three-stage pipeline.

入选理由：SFT可能导致模型性能下降，如Qwen3-VL-8B SFT后准确率下降5.2%

FeaturedArticle#Multimodal#Large Model#PRISM中文

// Automating SKILL.md Generation // Increasingly, mining sessions is one of the best ways to impro...

elvis(@omarsar0)6月19日209 字 (约 1 分钟)

文章介绍了一种通过挖掘会话数据自动生成SKILL.md的方法，但实际效果有限。

入选理由：OpenAI 的 Codex 能够从交互中打包技能，但效果有限。

FeaturedTweet#AI#OpenAI#Codex#技能生成中英混合

与「GRPO」经常一起出现的 AI 术语。

美团 DMD LongCat-Video-Avatar 1.5 Whisper-large MoE MLA 梁文锋长江存储 DeepSeek per_request prefix_merging SWE-bench Verified

💡 想追踪「GRPO」的长期趋势？去实体雷达 · GRPO 查看详细分析和跨材料问答。

什么是 GRPO？

为什么现在值得关注？

如果只读 3 篇

📰 GRPO 最新动态

From High Fidelity to Real Usability: LongCat-Video-Avatar 1.5 is Now Open-Sourced

DeepSeek's $10 Trillion Grand Strategy [Translation]

英伟达推出 AI 框架 Polar，让 Codex 跑分暴涨 594.74%

Don't rush to RL after SFT! Your multimodal large model may have been training with injuries

// Automating SKILL.md Generation // Increasingly, mining sessions is one of the best ways to impro...

🔗 相关术语