🚨 Bad news, folks: Google just uncovered a fundamental bug in RAG! 🔍 TL;DR: Our go-to embedding search might not be all it’s cracked up to be. Turns out, with fixed vector dimensions, it’s impossible to retrieve all relevant documents from the database. Google proved this both theoretically and experimentally. So, what’s the scoop? Modern search and RAG often rely on single-vector embeddings: one vector per query and document, measuring similarity with dot products or cosine similarity, then snagging the top-k closest matches. But here’s the kicker: is it even possible to always return the correct top-k docs for any query with fixed-vector dimensions? Spoiler alert: Nope! And this flops even with simple examples. Why? As your knowledge base grows, so do the diverse combos of queries and relevant docs we need to keep track of. But guess what? The search space is limited by embedding dimensions. So once you hit a certain number of docs, placing those points in space correctly fo
🚨 Bad news, folks: Google just uncovered a fundamental bug in RAG
16 сентября 202516 сен 2025
2 мин