# 搜对≠答对:EAPO用“证据奖励”让大模型不再靠猜 Canonical URL: https://www.traeai.com/articles/05ba4814-750f-4c59-bf7a-b71e025bc9ac Original source: https://mp.weixin.qq.com/s?__biz=MzkxMTYyMTAzNA==&mid=2247500918&idx=1&sn=b09d1e2f1d06896dd845449118675e02 Source name: 通义大模型 Content type: article Language: 中文 Score: 5.0 Reading time: 1 分钟 Published: 2026-04-27T09:56:00+00:00 Tags: 大模型, AI, 通义 ## Summary 文章探讨了大模型在搜索和回答中的问题,提出‘证据奖励’方法提升准确性。 ## Key Takeaways - 传统大模型常依赖猜测而非推理 - ‘证据奖励’机制可显著提高回答可信度 - EAPO方法为大模型优化提供了新思路 ## Citation Guidance When citing this item, prefer the canonical traeai article URL for the AI-readable summary and include the original source URL when discussing the underlying source material.