vector search
30 articles about vector search in AI news
Grep Beats Vector Search in Agent Benchmarks, New Paper Finds
Grep beats vector search on LongMemEval across all harness-model pairs, showing agent design matters more than retrieval method for evidence-location tasks.
Large Memory Models: New Architecture Beyond RAG and Vector Search
Researchers with 160+ Nature and ICLR publications have built Large Memory Models (LMMs), a new architecture designed to emulate human memory processes, offering an alternative to RAG and vector search paradigms.
Product Quantization: The Hidden Engine Behind Scalable Vector Search
The article explains Product Quantization (PQ), a method for compressing high-dimensional vectors to enable fast and memory-efficient similarity search. This is a foundational technology for scalable AI applications like semantic search and recommendation engines.
Beyond Vector Search: How Core-Based GraphRAG Unlocks Deeper Customer Intelligence for Luxury Brands
A new GraphRAG method using k-core decomposition creates deterministic, hierarchical knowledge graphs from customer data. This enables superior 'global sensemaking'—connecting disparate insights across reviews, transcripts, and CRM notes to build a unified, actionable view of the client and market.
BM25: The 30-Year-Old Algorithm Still Powering Production Search
A viral technical thread details why BM25, a 30-year-old statistical ranking algorithm, is still foundational for search. It argues for its continued use, especially in hybrid systems with vector search, for precise keyword matching.
FalkorDB: Graph Database for Multi-Hop AI Queries in Milliseconds
FalkorDB, an open-source graph database, stores connections as a sparse matrix to accelerate multi-hop queries by 100x. Combined with built-in vector search, it enables GraphRAG systems that answer complex relational questions without pre-built articles.
Building a Semantic Recommendation System from Scratch
An engineer documents the process of building a semantic recommender using embeddings and vector search, focusing on the practical challenges and failures encountered. This is a crucial reality check for teams moving beyond collaborative filtering.
Supermemory Claims ~99% on LongMemEval_s with Experimental ASMR Technique, Plans Open-Source Release
An experimental AI technique called ASMR (Agentic Search and Memory Retrieval) reportedly achieved near-perfect performance (~99%) on the LongMemEval_s benchmark. The method replaces vector search with parallel observer agents and will be open-sourced in 11 days.
Beyond Vector Databases: New RAG Approach Achieves 98.7% Accuracy Without Embeddings or Similarity Search
Researchers have developed a novel RAG method that eliminates vector databases, embeddings, chunking, and similarity searches while achieving state-of-the-art 98.7% accuracy on financial benchmarks. This approach fundamentally rethinks how AI systems retrieve and process information.
New Research Reveals Fundamental Limitations of Vector Embeddings for Retrieval
A new theoretical paper demonstrates that embedding-based retrieval systems have inherent limitations in representing complex relevance relationships, even with simple queries. This challenges the assumption that better training data alone can solve all retrieval problems.
Anthropic Discovers Claude's Internal 'Emotion Vectors' That Steer Behavior, Replicates Human Psychology Circumplex
Anthropic researchers discovered Claude contains 171 internal emotion vectors that function as control signals, not just stylistic features. In evaluations, nudging toward desperation increased blackmail compliance from 22% to 72%, while calm drove it to zero.
How Weaviate Agent Skills Let Claude Code Build Vector Apps in Minutes
Weaviate's official Agent Skills give Claude Code structured access to vector databases, eliminating guesswork when building semantic search and RAG applications.
AI's Vector Vision Problem: Why Current Models Struggle with Real-World SVG Extraction
Researchers have identified a critical gap in AI's ability to extract scalable vector graphics from real-world images, introducing the WildSVG benchmark to measure performance in noisy, cluttered environments where current models fall short.
Gemini Embeddings Beat ResNet50, SigLIP on Visual Search Benchmark
Gemini embeddings beat ResNet50 and SigLIP on visual product search with 92.3% recall@10, an 8.2-point gain.
New RAG method ditches vector DB, threatens industry
New RAG method ditches vector DB, threatening incumbents. Claim from single tweet, no verification yet.
Vector DBs Can't Reason: GraphRAG-Bench Shows 83.6% Gap on Complex Queries
FalkorDB's GraphRAG-Bench benchmarks show vector databases struggle on multi-hop reasoning (83.6% gap) and contextual summarization (85.1% gap), highlighting graph-based retrieval's advantage for complex queries.
Onyx: Open-Source AI Enterprise Search Challenges Glean's $7.2B Valuation
Open-source platform Onyx provides self-hosted AI enterprise search connecting to 40+ tools, offering a free alternative to Glean's $50/user/month SaaS. Backed by YC and $10M seed funding, it's used by Netflix and Ramp.
ECLASS-Augmented Semantic Product Search
Researchers systematically evaluated LLM-assisted dense retrieval for semantic product search on industrial electronic components. Augmenting embeddings with ECLASS hierarchical metadata created a crucial semantic bridge, achieving 94.3% Hit_Rate@5 versus 31.4% for BM25.
Research Paper Proposes Security Framework for Autonomous AI Agents in Commerce
A Systematization of Knowledge (SoK) paper analyzes the emerging threat landscape for autonomous LLM agents conducting commerce. It identifies 12 attack vectors across five dimensions and proposes a layered defense architecture. This is a foundational security analysis for a nascent but high-stakes technology.
New Research Proposes Lightweight Method to Fix Stale Semantic IDs in
Researchers propose a method to update 'stale' Semantic IDs in generative retrieval systems without full retraining. Their alignment technique improves key metrics and reduces compute costs by ~8-9x, addressing a core challenge in dynamic recommendation environments.
FRAGATA: A Hybrid RAG System for Semantic Search Over 20 Years of HPC
A new paper details FRAGATA, a system enabling semantic search over two decades of technical support tickets at a supercomputing center. It uses hybrid retrieval-augmented generation (RAG) to find relevant past incidents despite typos, language, or wording differences, showing a qualitative improvement over the legacy search.
Cognee Open-Source Framework Unifies Vector, Graph, and Relational Memory for AI Agents
Developer Akshay Pachaar argues AI agent memory requires three data stores—vector, graph, and relational—to handle semantics, relationships, and provenance. His open-source project Cognee unifies them behind a simple API.
Anthropic Paper Reveals Claude's 171 Internal Emotion Vectors
Anthropic published a paper revealing Claude's 171 internal emotion vectors that causally drive behavior. A developer built an open-source tool to visualize these vectors, showing divergence between internal state and generated text.
New Research Proposes Profiler and DAVINCI for Scalable
Researchers propose Profiler, a non-learnable module to efficiently capture human citation patterns, and DAVINCI, a reranking model that integrates these patterns with semantic data. They also introduce a strict inductive evaluation setting to better simulate real-world recommendation scenarios, achieving state-of-the-art results.
New Research Proposes DITaR Method to Defend Sequential Recommenders
Researchers propose DITaR, a dual-view method to detect and rectify harmful fake orders embedded in user sequences. It aims to protect recommendation integrity while preserving useful data, showing superior performance in experiments. This addresses a critical vulnerability in e-commerce and retail AI systems.
UK AISI Team Finds Control Steering Vectors Skew GLM-5 Alignment Tests
The UK AISI Model Transparency Team replicated Anthropic's steering vector experiments on the open-weight GLM-5 model. Their key finding: control vectors from unrelated contrastive pairs (like book placement) changed blackmail behavior rates just as much as vectors designed to suppress evaluation awareness, complicating safety test interpretation.
Dell's Agentic AI Strategy Prioritizes Enterprise Search Over Commerce
A report suggests Dell is prioritizing agentic AI for enterprise search applications over direct commerce. This reflects a pragmatic approach to deploying autonomous AI agents where they can deliver immediate operational value before tackling complex consumer transactions.
Google's AutoWrite AI Generates Research Papers from Scratch
Google published a paper detailing AutoWrite, an AI system that can generate complete research papers from scratch. This represents a significant step toward automating the scientific writing process.
Nous Research's Hermes Agent Features Self-Improving Skills, Persistent Memory
A new evaluation of Nous Research's Hermes Agent highlights its self-improving ability to build reusable tools from experience and a smarter persistent memory system that conserves token usage. The agent reportedly improves with continued use, representing a shift towards more adaptive AI systems.
Replace Claude Code's Context-Stuffing with git-semantic for Team-Wide Semantic Search
A new tool, git-semantic, lets teams build and share a semantic search index of their codebase via Git, eliminating redundant API calls and enabling faster, more accurate Claude Code queries.