survey
30 articles about survey in AI news
AI Memory Survey: Three Systems Needed for Human-Like Recall
A new survey paper proposes that modern AI requires three distinct memory systems—parametric, retrieval, and agent memory—to achieve human-like cognition, highlighting control as the key bottleneck.
40-Author Survey Unveils 'Levels × Laws' Framework for Agent World Models
A 40-author survey introduces a 'levels × laws' framework for world models in AI agents, spanning 3 capability levels and 4 law regimes, synthesizing 400+ works. It provides a shared vocabulary for designing and evaluating world models across traditionally siloed research communities.
Anthropic Survey: 81,000 People Rank AI Economic Hopes & Fears
Anthropic published new research analyzing the economic hopes and worries expressed by 81,000 people in a prior survey on AI. The findings aim to guide AI development toward public priorities.
Fortune Survey: 29% of Workers Admit to Sabotaging Company AI Plans
A Fortune survey finds 29% of workers admit to sabotaging company AI initiatives, a figure that rises to 44% among Gen Z. This exposes a critical human-factor challenge in enterprise AI adoption beyond technical hurdles.
Omar Saadoun's PaperWiki AI Agents Now Generate Personalized Research Surveys
Omar Saadoun announced that his PaperWiki platform now uses AI agents to generate personalized survey papers from a user's LLM-generated knowledge base. These surveys are self-improving and update automatically as new papers are published.
Survey Paper 'The Latent Space' Maps Evolution from Token Generation to Latent Computation in Language Models
Researchers have published a comprehensive survey charting the evolution of language model architectures from token-level autoregression to methods that perform computation in continuous latent spaces. This work provides a unified framework for understanding recent advances in reasoning, planning, and long-context modeling.
AI Adoption Saves Average US Worker 2.5 Hours Weekly, New Survey Shows
A new survey finds the average American worker using AI reports saving 2.5 hours per week, a 6% time reduction. Early data suggests these time savings may be translating into broader productivity growth.
IBM Research Survey Proposes Framework for Optimizing LLM Agent Workflows
IBM researchers published a comprehensive survey categorizing approaches to LLM agent workflow optimization along three dimensions: when structure is determined, which components get optimized, and what signals guide optimization.
Pseudo Label NCF: A Novel Approach to Cold-Start Recommendation Using Survey Data and Dual Embeddings
New research introduces Pseudo Label NCF, a method that enhances Neural Collaborative Filtering for extreme data sparsity. It uses survey-derived 'pseudo labels' to create dual embedding spaces, improving ranking accuracy while revealing a trade-off between embedding separability and performance.
Duke CFO Survey: AI Impact Targets Clerical & Admin Work First, Not Broader Workforce
A Duke University survey of 400 U.S. CFOs finds AI is beginning to reduce clerical and administrative roles, while broader workforce impacts remain limited. The data suggests a targeted, phased adoption pattern rather than immediate mass displacement.
Anthropic Captures 73% of Enterprise AI Spend, OpenAI Drops to 26% According to Industry Survey
A survey of enterprise AI spending shows a dramatic shift, with Anthropic now commanding 73% of budget allocation compared to OpenAI's 26%. This represents a near-total reversal from OpenAI's previous market dominance.
Anthropic's Claude User Survey Draws 81,000 Responses in One Week
Anthropic conducted a qualitative survey of Claude users, receiving nearly 81,000 responses in one week. The company describes it as the largest study of its kind on AI use, dreams, and fears.
Survey Benchmarks Four Approaches to Synthetic Brain Signal Generation for BCI Data Scarcity
A comprehensive survey categorizes and benchmarks four methodological approaches to generating synthetic brain signals for BCIs, addressing data scarcity and privacy constraints. The authors provide an open-source codebase for comparing knowledge-based, feature-based, model-based, and translation-based generative algorithms.
Survey: 40% of Non-Managers Say AI Saves Them No Time at Work
A Guardian report highlights a growing divide: 92% of executives say AI makes them more productive, while 40% of non-managers report it saves them no time, creating a 'workslop' tax.
arXiv Survey Maps KV Cache Optimization Landscape: 5 Strategies for Million-Token LLM Inference
A comprehensive arXiv review categorizes five principal KV cache optimization techniques—eviction, compression, hybrid memory, novel attention, and combinations—to address the linear memory scaling bottleneck in long-context LLM inference. The analysis finds no single dominant solution, with optimal strategy depending on context length, hardware, and workload.
Anthropic Survey of 80,508 Users Reveals AI's Dual Perception: Hope for Work & Growth, Fear of Unreliability & Job Loss
Anthropic's global study of 80,508 users finds people simultaneously hold hope and fear about AI. Top hopes center on work improvement and personal growth, while top concerns are unreliability, job loss, and reduced autonomy.
New CASIA Benchmark Exposes Fragmented Face Swapping Evaluation
CASIA researchers released a face swapping survey and benchmark on April 27, 2026, aiming to standardize evaluation across fragmented GAN and diffusion model methods.
Gallup: 50% of US Workers Now Use AI on the Job, Doubling Since 2023
A Gallup survey of nearly 24,000 US workers in Q1 2026 shows 50% now use AI at work, up from just 21% in 2023. This marks a critical mass for enterprise AI tools and signals a shift from experimentation to operational integration.
The Next Frontier for Self-Driving Cars: Teaching AI to Think Like a Human
A new survey argues that autonomous driving's biggest hurdle is no longer perception but a lack of robust reasoning. The integration of large language models offers a path forward but creates a critical tension between slow deliberation and split-second safety.
Beyond Sequence Generation: The Emergence of Agentic Reinforcement Learning for LLMs
A new survey paper argues that LLM reinforcement learning must evolve beyond narrow sequence generation to embrace true agentic capabilities. The research introduces a comprehensive taxonomy for agentic RL, mapping environments, benchmarks, and frameworks shaping this emerging field.
Claude Code /goal Uses Haiku Evaluator, Runs Unattended Until Condition Met
Claude Code /goal runs unattended until a condition is met, using Haiku evaluator. Agent View manages multiple background sessions. Requires v2.1.139.
CLAUDE.md Wastes 7K+ Tokens Per Turn; Skills Cut to 50
A 1,000-line CLAUDE.md burns 7,000-10,000 tokens per turn on instructions the model already knows. Skills using progressive disclosure cut that to ~50 tokens.
Codex Hits ChatGPT Mobile App, Unlocks AI Coding on iOS/Android
Codex lands in ChatGPT mobile app. The code-generation tool was desktop-only since early 2025. First reported by @kimmonismus.
Almanac: Open-Source Wiki Auto-Updates From Claude Code Chats
Almanac auto-generates a markdown wiki from Claude Code chats and repo history, solving the agent context gap. Free open-source tool, MacOS-only.
Prithvi-EO Fails Cross-Country Crop Yield Generalization, Paper Shows
Prithvi-EO and ViT-Base embeddings yield universally negative R² under cross-country maize yield prediction, failing to beat traditional spectral features due to yield distribution shift.
TikTok Brain Has an EEG Signature: Frontal Theta Drops 0.395
Zhejiang University EEG study finds 0.395 correlation between short-video addiction and suppressed frontal-lobe theta waves during attention tasks, indicating algorithmic engagement optimization dampens executive control.
Four years of AI coding: speed gains, cognitive atrophy, rate-limit panic
A developer's four-year account shows speed gains from Copilot to Cursor to Claude Code, but reveals cognitive atrophy and rate-limit dependency as hidden costs.
Simple Graph Heuristic Beats Generative Recommenders on 10 of 14 Benchmarks
A no-training graph heuristic beats generative recommenders on 10 of 14 benchmarks, exposing shortcut-solvable datasets. Relative NDCG@10 gains hit 44% on Amazon CDs.
Two-Tower vs Vector DB + LLM: Which Wins for RecSys at Scale?
Two-tower models offer sub-10ms latency for cold-start; vector DB + LLM provides richer semantics. Hybrid architectures reduce churn by 15-20%.
NJ Voters Back AI Data Center Bans as NIMBY Push Grows
54% of New Jersey voters support banning AI data centers, per a Jan 2026 poll. Nationally, 49% oppose local construction, threatening hyperscaler buildout plans.