nlp

30 articles about nlp in AI news

VMLOps Publishes NLP Engineer System Design Interview Guide

VMLOps has published 'The NLP Engineer's System Design Interview Guide,' a detailed resource covering architecture, scaling, and trade-offs for real-world NLP systems. It provides a structured framework for both interviewers and candidates.

Apr 20, 202675% relevant

Talkie: Vintage LLM Trained on 260B Pre-1931 English Tokens

Talkie is a new 'vintage language model' trained on 260 billion tokens of historical English text from before 1931, developed by a team including Alec Radford, co-author of the original GPT paper. It offers a unique linguistic artifact for NLP research.

Apr 28, 202685% relevant

LIDS Framework Revolutionizes LLM Summary Evaluation with Statistical Rigor

Researchers introduce LIDS, a novel method combining BERT embeddings, SVD decomposition, and statistical inference to evaluate LLM-generated summaries with unprecedented accuracy and interpretability. The framework provides layered theme analysis with controlled false discovery rates, addressing a critical gap in NLP assessment.

Mar 3, 202675% relevant

Kering Deploys AI-Powered Sustainable Sourcing Assistant on Google Cloud

Kering launched a Sustainable Sourcing Assistant on Google Cloud's Vertex AI. The tool helps luxury brands like Gucci and Saint Laurent evaluate materials for environmental and social impact, advancing sustainability in procurement.

Jul 3, 202672% relevant

Canada Deploys Grace Blackwell via $220M Bell-Cohere Deal

Canada's $220M Bell-Cohere deal puts Grace Blackwell on domestic soil for sovereign AI, reducing reliance on US cloud providers.

Jun 20, 202695% relevant

AI could unlock €320 billion for European retail, new analysis finds

A new fashionunited.uk analysis estimates AI could unlock up to €320 billion for European retail. The figure underscores AI's potential in automation, personalization, and supply chain optimization across the sector.

Jun 17, 202698% relevant

Google Open-Sources DiffusionGemma, 26B Model Hits 1K Tokens/Sec on H100

Google open-sourced DiffusionGemma, a 26B-parameter diffusion text model hitting 1,000 tokens/sec on H100 — 4x faster than autoregressive models, but with lower quality.

Jun 10, 2026100% relevant

Instacart's Semantic IDs: Product Understanding at Scale

Instacart's engineering team details a semantic ID system for product understanding at scale, using embeddings to create meaningful identifiers that enhance search and recommendations. This approach captures nuanced product relationships, improving relevance for grocery e-commerce.

Jun 2, 2026100% relevant

Alibaba + Nanjing Univ Claim 9.36X Faster Million-Token Prefill vs FlashAttention-2

Alibaba + Nanjing Univ claim 9.36X faster million-token prefill vs FlashAttention-2, targeting the key bottleneck in long-context LLM inference.

May 25, 202685% relevant

ModelBest Drops BitCPM-CANN: First 1.58-bit LLM on Ascend 910B

ModelBest released BitCPM-CANN, the first 1.58-bit ternary LLM on Ascend 910B NPUs, using 6× less VRAM than BF16 with minimal capability loss.

May 24, 202687% relevant

MLLM Raters Show Central Tendency Bias in Clinical Scoring

Study finds GPT-5 and other MLLMs show central tendency bias in clinical scoring, compressing predictions toward scale midpoint despite prompt modifications.

May 19, 202670% relevant

Simple Graph Heuristic Beats Generative Recommenders on 10 of 14 Benchmarks

A no-training graph heuristic beats generative recommenders on 10 of 14 benchmarks, exposing shortcut-solvable datasets. Relative NDCG@10 gains hit 44% on Amazon CDs.

May 11, 2026100% relevant

New CASIA Benchmark Exposes Fragmented Face Swapping Evaluation

CASIA researchers released a face swapping survey and benchmark on April 27, 2026, aiming to standardize evaluation across fragmented GAN and diffusion model methods.

May 5, 202674% relevant

Meta Tuna-2: Encoder-Free Multimodal Model Beats VAE-Based Rivals

Meta released Tuna-2, an encoder-free multimodal model that understands and generates images from raw pixels. It beats encoder-based models on fine-grained perception benchmarks, challenging the dominant VAE/vision encoder paradigm.

Apr 28, 202690% relevant

R³AG: A New Routing Framework That Matches Queries to Retriever

R³AG is a novel routing framework that dynamically selects the optimal retriever for each query in RAG systems, considering not just relevance but also how well the retrieved document helps the generator produce correct answers. It uses contrastive learning to model query-specific preferences, consistently outperforming existing methods on knowledge-intensive tasks.

Apr 28, 202678% relevant

ERA Framework Improves RAG Honesty by Modeling Knowledge Conflicts as

ERA replaces scalar confidence scores with explicit evidence distributions to distinguish between uncertainty and ambiguity in RAG systems, improving abstention behavior and calibration.

Apr 24, 202688% relevant

New MoE Framework Tames User Interest Shifts in Long-Sequence Recommendations

Researchers propose MoS, a model-agnostic MoE approach that handles long user sequences by detecting session hopping – where user interests shift across sessions. The theme-aware routing mechanism filters irrelevant sessions, while multi-scale fusion captures global and local patterns. Results show SOTA on benchmarks with fewer FLOPs than alternatives.

Apr 24, 202694% relevant

ESGLens: A New RAG Framework for Automated ESG Report Analysis and Score

ESGLens combines RAG with prompt engineering to extract structured ESG data, answer questions, and predict scores. Evaluated on ~300 reports, it achieved a Pearson correlation of 0.48 against LSEG scores. The paper highlights promise but also significant limitations.

Apr 23, 202682% relevant

AI Turned Thrift Into a Profitable Fashion Machine

The article details how AI technologies are being deployed in the thrift and resale fashion industry to automate critical operations like pricing, authentication, and inventory management, turning a traditionally labor-intensive sector into a scalable, data-driven profit engine.

Apr 21, 2026100% relevant

RoTE: A New Plug-and-Play Module to Sharpen Time-Aware Sequential

A new research paper introduces RoTE, a multi-level temporal embedding module for sequential recommenders. It explicitly models the time spans between user interactions, a factor often overlooked, leading to significant performance gains on standard benchmarks.

Apr 16, 202682% relevant

ETH Zurich & Anthropic AI Links Anonymous Accounts via Writing Style

Researchers built an AI that identifies authors from anonymous accounts by analyzing writing style. It achieved over 80% accuracy, raising significant privacy concerns for online anonymity.

Apr 16, 202689% relevant

Binghamton University Tests Robotic Guide Dog with Natural Language Interface

Researchers at Binghamton University have developed a robotic guide dog prototype that communicates with users using natural language. The system, built on a Unitree Go2 platform, was demonstrated navigating a user through a test environment.

Apr 15, 202685% relevant

AI Reshapes Luxury Travel—But Human Expertise Remains Essential

A new report highlights how AI is being integrated into luxury travel for personalized itineraries, predictive service, and backend operations. However, the consensus is that AI should augment, not replace, the human expertise and emotional intelligence that define true luxury service.

Apr 14, 202680% relevant

AI-Powered Password Leak Detection: A Critical Security Shift

Security experts are leveraging AI to detect when user passwords appear in data breaches, enabling immediate alerts. This shifts the security paradigm from periodic manual checks to continuous, automated monitoring.

Apr 13, 202685% relevant

AI-Based Recommendation System Market Projected to Reach $34.4 Billion by 2033

A market analysis projects the AI-based recommendation system sector will grow significantly, reaching a valuation of USD 34.4 billion by 2033. This underscores the technology's transition from a nice-to-have feature to a core, high-value component of digital business strategy.

Apr 13, 202696% relevant

Japan's Labor Crisis Drives AI Adoption to Offset 15M Worker Shortfall

Facing a 14-year population decline and a projected shortfall of 15 million workers, Japan's AI strategy is fundamentally different: automation is a necessity for survival, not a tool for efficiency.

Apr 12, 202685% relevant

Google Releases TIPSv2 Vision Encoder for Multi-Task Dense Prediction

Google has released the TIPSv2-B/14 vision encoder model on Hugging Face. It performs three dense prediction tasks—depth estimation, surface normal prediction, and semantic segmentation—from a single backbone.

Apr 11, 202685% relevant

Google's MCP Toolbox Connects AI Agents to 20+ Databases in <10 Lines

Google released MCP Toolbox, an open-source server that connects AI agents to enterprise databases like Postgres and BigQuery using plain English. It requires less than 10 lines of code and works with LangChain, LlamaIndex, and any MCP-compatible client.

Apr 10, 202695% relevant

Is the Future of Shopping Hiding Inside Luxury Hotels?

The Business of Fashion examines the emerging trend where luxury hotels are transforming into sophisticated retail environments. This represents a strategic shift in how luxury brands reach affluent consumers in curated, experiential settings.

Apr 10, 202672% relevant

Kuaishou's Dual-Rerank: A New Industrial Framework for High-Stakes

Researchers from Kuaishou introduce Dual-Rerank, a framework designed for industrial-scale generative reranking. It addresses the dual dilemma of structural trade-offs (AR vs. NAR models) and optimization gaps (SL vs. RL) through Sequential Knowledge Distillation and List-wise Decoupled Reranking Optimization. A/B tests on production traffic show significant improvements in user satisfaction and watch time with reduced latency.

Apr 10, 202682% relevant

Explore More

AI Agents Large Language Models Claude Code OpenAI RAG MCP Fine-tuning Benchmarks Open Source AI AI Safety