ðð» ð® ð¹ðŒð»ðŽ-ð¿ðð»ð»ð¶ð»ðŽ ð¥ðð ððððð²ðº, ððµð² ðºðŒðð ð±ð®ð»ðŽð²ð¿ðŒðð ð¯ððŽ ...

TL;DR · AI Summary
CRAG 鲿¢ RAG ç³»ç»äžçé误信æ¯éè¿å倿£çŽ¢å区åæäžºäºå®ïŒMilvus æäŸæéçåèœæ¯æã
Key Takeaways
- CRAG åšæ£çŽ¢åçæä¹éŽå å ¥è¯äŒ°æ¥éª€
- Milvus æ¯æ JSON å æ°æ®è¿æ»€åæ··åæ£çŽ¢
- é误信æ¯äžäŒéæ°è¿å ¥ååšåŸªç¯
Outline
Jump quickly between sections.
- §åŒèš
ä»ç» RAG ç³»ç»äžæå±é©ç bug æ¯é误信æ¯çåå€åŒºåã
é误信æ¯éè¿æ£çŽ¢å区åæäžºç³»ç»äºå®ïŒéŸä»¥åç°åçº æ£ã
CRAG åšæ£çŽ¢åçæä¹éŽå å ¥è¯äŒ°æ¥éª€ïŒé²æ¢é误信æ¯åŒºåã
- âºè¯äŒ°æºå¶
蜻é级è¯äŒ°åšå¯¹æ£çŽ¢ç»æè¿è¡è¯åå¹¶åç±»ã
- âºç»æåç±»
è¯äŒ°ç»æå䞺æ£ç¡®ãæš¡ç³åé误äžäžªç级ã
Milvus æäŸåšæååšä¿¡å¿åæ°ãæ··åæ£çŽ¢åç§æ·é犻åèœã
Mindmap
See how the topics connect at a glance.
æ¥çå€§çº²ææ¬ïŒæ éç¢ / æ JS å奜ïŒ
- CRAG è§£å³æ¹æ¡
Highlights
Key sentences worth saving and sharing.
é误信æ¯éè¿æ£çŽ¢å区åæäžºç³»ç»äºå®ïŒéŸä»¥åç°åçº æ£ã
â 第 2 段
CRAG åšæ£çŽ¢åçæä¹éŽå å ¥è¯äŒ°æ¥éª€ïŒé²æ¢é误信æ¯åŒºåã
â 第 3 段
Milvus æäŸåšæååšä¿¡å¿åæ°ãæ··åæ£çŽ¢åç§æ·é犻åèœã
â 第 6 段
In a long-running RAG system, the most dangerous bug isn't a single wrong answer. It's the one that snowballs: retrieved again and again, reinforced each time, until the system treats it as fact. It's easy to miss: the model generates from a bad retrieval â no one corrects it, so the system assumes it's right â it gets written back to memory â the next query pulls it up again and reinforces the error. CRAG (Corrective RAG) breaks this loop. It adds one step between retrieval and generation: evaluation. A lightweight evaluator judges whether each retrieved document actually answers the question and tags it with a confidence score, stored alongside the memory. The evaluator sorts results into three tiers:
- 0.9 â correct: refine and use
- 0.5â0.9 â ambiguous: add a web search
- <0.5 â wrong: discard and search instead
On the next retrieval, the system pre-filters first, keeping only entries above 0.7. Weak content gets screened out before it's reused, so it never re-enters the store â retrieve â reinforce loop. CRAG needs a vector database that can store confidence scores dynamically, run hybrid retrieval, and isolate tenants. Milvus supports JSON metadata filtering, hybrid retrieval, and Partition Key out of the box, which is exactly what CRAG needs. Learn how to set up CRAG: milvus.io/blog/fix-rag-r#RAG#VectorDatabase#Milvus#LLM#AIEngineering