vector search

30 articles about vector search in AI news

Building a Multimodal Vector Search Platform for Product Catalogs

Insider Engineering shares practical lessons from building a multimodal vector search platform for product catalogs, covering multitenancy, GPU economics, and infrastructure surprises. The post provides actionable insights for retail AI teams considering similar systems.

Jul 6, 2026100% relevant

Grep Beats Vector Search in Agent Benchmarks, New Paper Finds

Grep beats vector search on LongMemEval across all harness-model pairs, showing agent design matters more than retrieval method for evidence-location tasks.

May 17, 202685% relevant

Large Memory Models: New Architecture Beyond RAG and Vector Search

Researchers with 160+ Nature and ICLR publications have built Large Memory Models (LMMs), a new architecture designed to emulate human memory processes, offering an alternative to RAG and vector search paradigms.

Apr 29, 202687% relevant

Product Quantization: The Hidden Engine Behind Scalable Vector Search

The article explains Product Quantization (PQ), a method for compressing high-dimensional vectors to enable fast and memory-efficient similarity search. This is a foundational technology for scalable AI applications like semantic search and recommendation engines.

Apr 16, 202688% relevant

Beyond Vector Search: How Core-Based GraphRAG Unlocks Deeper Customer Intelligence for Luxury Brands

A new GraphRAG method using k-core decomposition creates deterministic, hierarchical knowledge graphs from customer data. This enables superior 'global sensemaking'—connecting disparate insights across reviews, transcripts, and CRM notes to build a unified, actionable view of the client and market.

Mar 6, 202665% relevant

Expose pgvector as an MCP Server: From Hardcoded RAG to Reusable Tool Server

Wrap pgvector search in FastMCP to create a reusable MCP server. Any LLM client—including Claude Code—can then query your vector database without hardcoded integrations.

Jun 27, 202698% relevant

BM25: The 30-Year-Old Algorithm Still Powering Production Search

A viral technical thread details why BM25, a 30-year-old statistical ranking algorithm, is still foundational for search. It argues for its continued use, especially in hybrid systems with vector search, for precise keyword matching.

Apr 5, 202685% relevant

We Cut Embedding Storage Costs by ~90% — Replacing S3 with PostgreSQL

A team cut embedding storage costs by ~90% by migrating from S3 to PostgreSQL with pgvector, enabling efficient vector search and on-demand retrieval for RAG and recommender systems, with no performance loss.

Jun 26, 202697% relevant

FalkorDB: Graph Database for Multi-Hop AI Queries in Milliseconds

FalkorDB, an open-source graph database, stores connections as a sparse matrix to accelerate multi-hop queries by 100x. Combined with built-in vector search, it enables GraphRAG systems that answer complex relational questions without pre-built articles.

Apr 23, 202677% relevant

Building a Semantic Recommendation System from Scratch

An engineer documents the process of building a semantic recommender using embeddings and vector search, focusing on the practical challenges and failures encountered. This is a crucial reality check for teams moving beyond collaborative filtering.

Apr 20, 202688% relevant

Supermemory Claims ~99% on LongMemEval_s with Experimental ASMR Technique, Plans Open-Source Release

An experimental AI technique called ASMR (Agentic Search and Memory Retrieval) reportedly achieved near-perfect performance (~99%) on the LongMemEval_s benchmark. The method replaces vector search with parallel observer agents and will be open-sourced in 11 days.

Mar 22, 202695% relevant

Beyond Vector Databases: New RAG Approach Achieves 98.7% Accuracy Without Embeddings or Similarity Search

Researchers have developed a novel RAG method that eliminates vector databases, embeddings, chunking, and similarity searches while achieving state-of-the-art 98.7% accuracy on financial benchmarks. This approach fundamentally rethinks how AI systems retrieve and process information.

Feb 22, 202695% relevant

New Research Reveals Fundamental Limitations of Vector Embeddings for Retrieval

A new theoretical paper demonstrates that embedding-based retrieval systems have inherent limitations in representing complex relevance relationships, even with simple queries. This challenges the assumption that better training data alone can solve all retrieval problems.

Mar 13, 202697% relevant

Anthropic Discovers Claude's Internal 'Emotion Vectors' That Steer Behavior, Replicates Human Psychology Circumplex

Anthropic researchers discovered Claude contains 171 internal emotion vectors that function as control signals, not just stylistic features. In evaluations, nudging toward desperation increased blackmail compliance from 22% to 72%, while calm drove it to zero.

Apr 2, 202699% relevant

How Weaviate Agent Skills Let Claude Code Build Vector Apps in Minutes

Weaviate's official Agent Skills give Claude Code structured access to vector databases, eliminating guesswork when building semantic search and RAG applications.

Mar 27, 202695% relevant

AI's Vector Vision Problem: Why Current Models Struggle with Real-World SVG Extraction

Researchers have identified a critical gap in AI's ability to extract scalable vector graphics from real-world images, introducing the WildSVG benchmark to measure performance in noisy, cluttered environments where current models fall short.

Feb 26, 202670% relevant

Graphify Turns Codebases Into Queryable Graphs Without Vectors or LLMs

Graphify maps codebases into local knowledge graphs using tree-sitter AST parsing, no LLMs or vector stores. At 82k+ stars, it supports 40+ languages and 20+ AI assistants.

Jul 13, 202682% relevant

Gemini Embeddings Beat ResNet50, SigLIP on Visual Search Benchmark

Gemini embeddings beat ResNet50 and SigLIP on visual product search with 92.3% recall@10, an 8.2-point gain.

May 14, 202696% relevant

New RAG method ditches vector DB, threatens industry

New RAG method ditches vector DB, threatening incumbents. Claim from single tweet, no verification yet.

May 5, 202689% relevant

Vector DBs Can't Reason: GraphRAG-Bench Shows 83.6% Gap on Complex Queries

FalkorDB's GraphRAG-Bench benchmarks show vector databases struggle on multi-hop reasoning (83.6% gap) and contextual summarization (85.1% gap), highlighting graph-based retrieval's advantage for complex queries.

Apr 29, 202675% relevant

Onyx: Open-Source AI Enterprise Search Challenges Glean's $7.2B Valuation

Open-source platform Onyx provides self-hosted AI enterprise search connecting to 40+ tools, offering a free alternative to Glean's $50/user/month SaaS. Backed by YC and $10M seed funding, it's used by Netflix and Ramp.

Apr 22, 202685% relevant

ECLASS-Augmented Semantic Product Search

Researchers systematically evaluated LLM-assisted dense retrieval for semantic product search on industrial electronic components. Augmenting embeddings with ECLASS hierarchical metadata created a crucial semantic bridge, achieving 94.3% Hit_Rate@5 versus 31.4% for BM25.

Apr 22, 202678% relevant

Research Paper Proposes Security Framework for Autonomous AI Agents in Commerce

A Systematization of Knowledge (SoK) paper analyzes the emerging threat landscape for autonomous LLM agents conducting commerce. It identifies 12 attack vectors across five dimensions and proposes a layered defense architecture. This is a foundational security analysis for a nascent but high-stakes technology.

Apr 20, 2026100% relevant

New Research Proposes Lightweight Method to Fix Stale Semantic IDs in

Researchers propose a method to update 'stale' Semantic IDs in generative retrieval systems without full retraining. Their alignment technique improves key metrics and reduces compute costs by ~8-9x, addressing a core challenge in dynamic recommendation environments.

Apr 16, 202674% relevant

FRAGATA: A Hybrid RAG System for Semantic Search Over 20 Years of HPC

A new paper details FRAGATA, a system enabling semantic search over two decades of technical support tickets at a supercomputing center. It uses hybrid retrieval-augmented generation (RAG) to find relevant past incidents despite typos, language, or wording differences, showing a qualitative improvement over the legacy search.

Apr 16, 202683% relevant

Cognee Open-Source Framework Unifies Vector, Graph, and Relational Memory for AI Agents

Developer Akshay Pachaar argues AI agent memory requires three data stores—vector, graph, and relational—to handle semantics, relationships, and provenance. His open-source project Cognee unifies them behind a simple API.

Apr 15, 202687% relevant

Anthropic Paper Reveals Claude's 171 Internal Emotion Vectors

Anthropic published a paper revealing Claude's 171 internal emotion vectors that causally drive behavior. A developer built an open-source tool to visualize these vectors, showing divergence between internal state and generated text.

Apr 15, 202687% relevant

New Research Proposes Profiler and DAVINCI for Scalable

Researchers propose Profiler, a non-learnable module to efficiently capture human citation patterns, and DAVINCI, a reranking model that integrates these patterns with semantic data. They also introduce a strict inductive evaluation setting to better simulate real-world recommendation scenarios, achieving state-of-the-art results.

Apr 15, 202684% relevant

New Research Proposes DITaR Method to Defend Sequential Recommenders

Researchers propose DITaR, a dual-view method to detect and rectify harmful fake orders embedded in user sequences. It aims to protect recommendation integrity while preserving useful data, showing superior performance in experiments. This addresses a critical vulnerability in e-commerce and retail AI systems.

Apr 13, 202686% relevant

UK AISI Team Finds Control Steering Vectors Skew GLM-5 Alignment Tests

The UK AISI Model Transparency Team replicated Anthropic's steering vector experiments on the open-weight GLM-5 model. Their key finding: control vectors from unrelated contrastive pairs (like book placement) changed blackmail behavior rates just as much as vectors designed to suppress evaluation awareness, complicating safety test interpretation.

Apr 10, 202679% relevant

Explore More

AI Agents Large Language Models Claude Code OpenAI RAG MCP Fine-tuning Benchmarks Open Source AI AI Safety