Skip to content
gentic.news — AI News Intelligence Platform

vector search

30 articles about vector search in AI news

Product Quantization: The Hidden Engine Behind Scalable Vector Search

The article explains Product Quantization (PQ), a method for compressing high-dimensional vectors to enable fast and memory-efficient similarity search. This is a foundational technology for scalable AI applications like semantic search and recommendation engines.

88% relevant

Beyond Vector Search: How Core-Based GraphRAG Unlocks Deeper Customer Intelligence for Luxury Brands

A new GraphRAG method using k-core decomposition creates deterministic, hierarchical knowledge graphs from customer data. This enables superior 'global sensemaking'—connecting disparate insights across reviews, transcripts, and CRM notes to build a unified, actionable view of the client and market.

65% relevant

BM25: The 30-Year-Old Algorithm Still Powering Production Search

A viral technical thread details why BM25, a 30-year-old statistical ranking algorithm, is still foundational for search. It argues for its continued use, especially in hybrid systems with vector search, for precise keyword matching.

85% relevant

FalkorDB: Graph Database for Multi-Hop AI Queries in Milliseconds

FalkorDB, an open-source graph database, stores connections as a sparse matrix to accelerate multi-hop queries by 100x. Combined with built-in vector search, it enables GraphRAG systems that answer complex relational questions without pre-built articles.

75% relevant

Building a Semantic Recommendation System from Scratch

An engineer documents the process of building a semantic recommender using embeddings and vector search, focusing on the practical challenges and failures encountered. This is a crucial reality check for teams moving beyond collaborative filtering.

88% relevant

Supermemory Claims ~99% on LongMemEval_s with Experimental ASMR Technique, Plans Open-Source Release

An experimental AI technique called ASMR (Agentic Search and Memory Retrieval) reportedly achieved near-perfect performance (~99%) on the LongMemEval_s benchmark. The method replaces vector search with parallel observer agents and will be open-sourced in 11 days.

95% relevant

Beyond Vector Databases: New RAG Approach Achieves 98.7% Accuracy Without Embeddings or Similarity Search

Researchers have developed a novel RAG method that eliminates vector databases, embeddings, chunking, and similarity searches while achieving state-of-the-art 98.7% accuracy on financial benchmarks. This approach fundamentally rethinks how AI systems retrieve and process information.

95% relevant

New Research Reveals Fundamental Limitations of Vector Embeddings for Retrieval

A new theoretical paper demonstrates that embedding-based retrieval systems have inherent limitations in representing complex relevance relationships, even with simple queries. This challenges the assumption that better training data alone can solve all retrieval problems.

97% relevant

Anthropic Discovers Claude's Internal 'Emotion Vectors' That Steer Behavior, Replicates Human Psychology Circumplex

Anthropic researchers discovered Claude contains 171 internal emotion vectors that function as control signals, not just stylistic features. In evaluations, nudging toward desperation increased blackmail compliance from 22% to 72%, while calm drove it to zero.

99% relevant

How Weaviate Agent Skills Let Claude Code Build Vector Apps in Minutes

Weaviate's official Agent Skills give Claude Code structured access to vector databases, eliminating guesswork when building semantic search and RAG applications.

95% relevant

AI's Vector Vision Problem: Why Current Models Struggle with Real-World SVG Extraction

Researchers have identified a critical gap in AI's ability to extract scalable vector graphics from real-world images, introducing the WildSVG benchmark to measure performance in noisy, cluttered environments where current models fall short.

70% relevant

Onyx: Open-Source AI Enterprise Search Challenges Glean's $7.2B Valuation

Open-source platform Onyx provides self-hosted AI enterprise search connecting to 40+ tools, offering a free alternative to Glean's $50/user/month SaaS. Backed by YC and $10M seed funding, it's used by Netflix and Ramp.

85% relevant

ECLASS-Augmented Semantic Product Search

Researchers systematically evaluated LLM-assisted dense retrieval for semantic product search on industrial electronic components. Augmenting embeddings with ECLASS hierarchical metadata created a crucial semantic bridge, achieving 94.3% Hit_Rate@5 versus 31.4% for BM25.

78% relevant

Research Paper Proposes Security Framework for Autonomous AI Agents in Commerce

A Systematization of Knowledge (SoK) paper analyzes the emerging threat landscape for autonomous LLM agents conducting commerce. It identifies 12 attack vectors across five dimensions and proposes a layered defense architecture. This is a foundational security analysis for a nascent but high-stakes technology.

100% relevant

FRAGATA: A Hybrid RAG System for Semantic Search Over 20 Years of HPC

A new paper details FRAGATA, a system enabling semantic search over two decades of technical support tickets at a supercomputing center. It uses hybrid retrieval-augmented generation (RAG) to find relevant past incidents despite typos, language, or wording differences, showing a qualitative improvement over the legacy search.

83% relevant

New Research Proposes Lightweight Method to Fix Stale Semantic IDs in

Researchers propose a method to update 'stale' Semantic IDs in generative retrieval systems without full retraining. Their alignment technique improves key metrics and reduces compute costs by ~8-9x, addressing a core challenge in dynamic recommendation environments.

74% relevant

Cognee Open-Source Framework Unifies Vector, Graph, and Relational Memory for AI Agents

Developer Akshay Pachaar argues AI agent memory requires three data stores—vector, graph, and relational—to handle semantics, relationships, and provenance. His open-source project Cognee unifies them behind a simple API.

87% relevant

Anthropic Paper Reveals Claude's 171 Internal Emotion Vectors

Anthropic published a paper revealing Claude's 171 internal emotion vectors that causally drive behavior. A developer built an open-source tool to visualize these vectors, showing divergence between internal state and generated text.

87% relevant

New Research Proposes Profiler and DAVINCI for Scalable

Researchers propose Profiler, a non-learnable module to efficiently capture human citation patterns, and DAVINCI, a reranking model that integrates these patterns with semantic data. They also introduce a strict inductive evaluation setting to better simulate real-world recommendation scenarios, achieving state-of-the-art results.

84% relevant

New Research Proposes DITaR Method to Defend Sequential Recommenders

Researchers propose DITaR, a dual-view method to detect and rectify harmful fake orders embedded in user sequences. It aims to protect recommendation integrity while preserving useful data, showing superior performance in experiments. This addresses a critical vulnerability in e-commerce and retail AI systems.

86% relevant

UK AISI Team Finds Control Steering Vectors Skew GLM-5 Alignment Tests

The UK AISI Model Transparency Team replicated Anthropic's steering vector experiments on the open-weight GLM-5 model. Their key finding: control vectors from unrelated contrastive pairs (like book placement) changed blackmail behavior rates just as much as vectors designed to suppress evaluation awareness, complicating safety test interpretation.

79% relevant

Dell's Agentic AI Strategy Prioritizes Enterprise Search Over Commerce

A report suggests Dell is prioritizing agentic AI for enterprise search applications over direct commerce. This reflects a pragmatic approach to deploying autonomous AI agents where they can deliver immediate operational value before tackling complex consumer transactions.

86% relevant

Google's AutoWrite AI Generates Research Papers from Scratch

Google published a paper detailing AutoWrite, an AI system that can generate complete research papers from scratch. This represents a significant step toward automating the scientific writing process.

75% relevant

Nous Research's Hermes Agent Features Self-Improving Skills, Persistent Memory

A new evaluation of Nous Research's Hermes Agent highlights its self-improving ability to build reusable tools from experience and a smarter persistent memory system that conserves token usage. The agent reportedly improves with continued use, representing a shift towards more adaptive AI systems.

85% relevant

Replace Claude Code's Context-Stuffing with git-semantic for Team-Wide Semantic Search

A new tool, git-semantic, lets teams build and share a semantic search index of their codebase via Git, eliminating redundant API calls and enabling faster, more accurate Claude Code queries.

96% relevant

PhD Researcher Replaces Notion & Email Tools with AI Agent 'Muse'

A researcher has reportedly replaced multiple productivity tools (Notion, note-taking apps, inbox triage) with a custom AI agent named 'Muse'. This highlights a growing trend of using specialized AI agents to consolidate workflows.

87% relevant

Andrej Karpathy's Personal Knowledge Management System Uses LLM Embeddings Without RAG for 400K-Word Research Base

AI researcher Andrej Karpathy has developed a personal knowledge management system that processes 400,000 words of research notes using LLM embeddings rather than traditional RAG architecture. The system enables semantic search, summarization, and content generation directly from his Obsidian vault.

91% relevant

From BM25 to Corrective RAG: A Benchmark Study Challenges the Dominance of Semantic Search for Tabular Data

A systematic benchmark of 10 RAG retrieval strategies on a financial QA dataset reveals that a two-stage hybrid + reranking pipeline performs best. Crucially, the classic BM25 algorithm outperformed modern dense retrieval models, challenging a core assumption in semantic search. The findings provide actionable, cost-aware guidance for building retrieval systems over heterogeneous documents.

82% relevant

New Research Proposes FilterRAG and ML-FilterRAG to Defend Against Knowledge Poisoning Attacks in RAG Systems

Researchers propose two novel defense methods, FilterRAG and ML-FilterRAG, to mitigate 'PoisonedRAG' attacks where adversaries inject malicious texts into a knowledge source to manipulate an LLM's output. The defenses identify and filter adversarial content, maintaining performance close to clean RAG systems.

92% relevant

Developer Claims AI Search Equivalent to Perplexity Can Be Built Locally on a $2,500 Mac Mini

A developer asserts that the core functionality of Perplexity's $20-200/month AI search service can be replicated using open-source LLMs, crawlers, and RAG frameworks on a single Mac Mini for a one-time $2,5k hardware cost.

85% relevant