่ฟ”ๅ›ž้ฆ–้กต
Milvus(@milvusio)

Most people's vector database defaults to keeping all data in memory. ๐—ง๐—ต๐—ฎ๐˜ ๐—บ๐—ถ๐—ด๐—ต๐˜ ๐—ฏ๐—ฒ ...

7.5Score
Most people's vector database defaults to keeping all data in memory. ๐—ง๐—ต๐—ฎ๐˜ ๐—บ๐—ถ๐—ด๐—ต๐˜ ๐—ฏ๐—ฒ ...
AI ๆทฑๅบฆๆ็‚ผ
  • ้ป˜่ฎคๅ…จๅ†…ๅญ˜ๅญ˜ๅ‚จๅœจ1ไบฟ+ๅ‘้‡่ง„ๆจกไธ‹ๆˆๆœฌ้ซ˜3-10ๅ€
  • MMapๆ–นๆกˆ็”จๆœฌๅœฐ็ฃ็›˜ๆŒ‰้œ€ๅŠ ่ฝฝ๏ผŒๅปถ่ฟŸ็จณๅฎšไธ”ๅ†…ๅญ˜ๅ ็”จ้™ไฝŽ
  • ๅˆ†ๅฑ‚ๅญ˜ๅ‚จ้€‚ๅˆๆœ‰ๅ†ท็ƒญๆ•ฐๆฎๅŒบๅˆ†็š„ๅœบๆ™ฏ๏ผŒๆ˜พ่‘—่Š‚็œๅ†…ๅญ˜ๅ’Œ็ฃ็›˜
#Milvus#ๅ‘้‡ๆ•ฐๆฎๅบ“#ๅญ˜ๅ‚จไผ˜ๅŒ–#MMap#ๅˆ†ๅฑ‚ๅญ˜ๅ‚จ
ๆ‰“ๅผ€ๅŽŸๆ–‡

Two ways to stop paying for idle data (both config changes in https://t.co/0Ad3zYktCi" / X

Post

Conversation

Most people's vector database defaults to keeping all data in memory. ๐—ง๐—ต๐—ฎ๐˜ ๐—บ๐—ถ๐—ด๐—ต๐˜ ๐—ฏ๐—ฒ ๐—ฐ๐—ผ๐˜€๐˜๐—ถ๐—ป๐—ด ๐˜†๐—ผ๐˜‚ ๐Ÿฏ-๐Ÿญ๐Ÿฌ๐˜… ๐—บ๐—ผ๐—ฟ๐—ฒ ๐˜๐—ต๐—ฎ๐—ป ๐—ถ๐˜ ๐—ป๐—ฒ๐—ฒ๐—ฑ๐˜€ ๐˜๐—ผ (once you scale to production with 100M+ vectors). Two ways to stop paying for idle data (both config changes in Milvus, not redesigns): โ€ข ๐— ๐— ๐—ฎ๐—ฝ (v2.3+):data on local disk, loaded on demand. Stable latency. Disk must hold the full dataset. โ€ข ๐—ง๐—ถ๐—ฒ๐—ฟ๐—ฒ๐—ฑ ๐˜€๐˜๐—ผ๐—ฟ๐—ฎ๐—ด๐—ฒ (v2.6+): hot data cached locally, cold data in S3. Cache misses add 50-200ms. Both need NVMe SSDs (10K+ IOPS). ๐—ข๐—ป ๐Ÿญ๐Ÿฌ๐Ÿฌ๐—  ๐˜ƒ๐—ฒ๐—ฐ๐˜๐—ผ๐—ฟ๐˜€ (๐Ÿณ๐Ÿฒ๐Ÿด-๐—ฑ๐—ถ๐—บ, ๐—ณ๐—น๐—ผ๐—ฎ๐˜๐Ÿฏ๐Ÿฎ): ๐—ฃ๐—ถ๐—ฐ๐—ธ ๐— ๐— ๐—ฎ๐—ฝ ๐—ถ๐—ณ: โ€ข P99 < 20ms โ€” data is local, no network fetch, no surprise spikes. ~77-230 GB memory (vs 768 GB default). โ€ข Uniform access โ€” tiered storage's cache doesn't help if everything gets hit equally ๐—ฃ๐—ถ๐—ฐ๐—ธ ๐˜๐—ถ๐—ฒ๐—ฟ๐—ฒ๐—ฑ ๐˜€๐˜๐—ผ๐—ฟ๐—ฎ๐—ด๐—ฒ ๐—ถ๐—ณ: โ€ข Cost is the priority โ€” <77 GB memory (vs 768 GB default). Saves on both memory and disk (70-90% less) โ€ข Clear 80/20 access pattern โ€” hot data cached, cold data stays cheap in S3 โ€ข 500M+ vectors โ€” one node's disk can't hold it all Full config walkthrough !Image 1: ๐Ÿ‘‡milvus.io/blog/how-to-cu

The media could not be played.