Timeline
Google Research publishes TurboQuant paper claiming 80% AI cost reduction and 6x memory reduction
New RAG paradigm with iterative retrieval at multiple reasoning steps achieves 15-20% accuracy gain on HotpotQA
Positioned as go-to technique for dynamic, fact-heavy applications with frequently changing information
Research exposed a critical vulnerability where just 5 poisoned documents can corrupt RAG systems.
Clarification article published explaining distinction between RAG and fine-tuning for LLM applications
Publication of a framework moving RAG systems from proof-of-concept to production, outlining anti-patterns and a five-pillar architecture.
Ethan Mollick declared the end of the 'RAG era' as dominant paradigm for AI agents
Novel compression algorithm unveiled that reduces LLM memory footprint by 6x
Ecosystem
Retrieval-Augmented Generation
TurboQuant
No mapped relationships