survey

30 articles about survey in AI news

239-Paper Survey Maps How AI Agents Self-Improve via Scaffold Updates

A survey of 239 papers shows 68% of AI agent self-improvement methods focus on scaffold updates rather than model retraining, raising evaluation quality concerns.

Jul 19, 202685% relevant

100+ Papers Surveyed: LLMs' Metacognition Gap

A systematic survey of 100+ papers reveals gaps in LLM metacognition, including 10-30% miscalibration in top models like GPT-4 and Claude 3.

Jul 19, 202675% relevant

World Action Models Survey Unifies 100+ Methods Under One Taxonomy

A survey reviews 100+ world action models, unifying world models, video generation, and VLA policies under one taxonomy.

Jun 27, 202687% relevant

111-Page Survey Maps 5 AGI Levels: Responder to Ecosystem

111-page survey from US/China labs defines 5 AGI levels, argues epistemic exploration — not better answering — is key. Challenges scaling orthodoxy.

Jun 9, 202694% relevant

YouGov Survey: Clothing Shoppers Show Resistance to AI Tools for Product

YouGov survey reports clothing shoppers resistant to AI tools for product discovery. This challenges retail AI strategies, signaling need for consumer education and trust-building.

Jun 5, 202694% relevant

Meta-Stanford Survey: Code as Agent Harness Improves AI Reasoning

Meta, Stanford, Illinois survey argues AI agents work better with code as their main working layer, calling it an agent harness.

May 25, 202689% relevant

AI Memory Survey: Three Systems Needed for Human-Like Recall

A new survey paper proposes that modern AI requires three distinct memory systems—parametric, retrieval, and agent memory—to achieve human-like cognition, highlighting control as the key bottleneck.

Apr 28, 202680% relevant

40-Author Survey Unveils 'Levels × Laws' Framework for Agent World Models

A 40-author survey introduces a 'levels × laws' framework for world models in AI agents, spanning 3 capability levels and 4 law regimes, synthesizing 400+ works. It provides a shared vocabulary for designing and evaluating world models across traditionally siloed research communities.

Apr 27, 202685% relevant

Anthropic Survey: 81,000 People Rank AI Economic Hopes & Fears

Anthropic published new research analyzing the economic hopes and worries expressed by 81,000 people in a prior survey on AI. The findings aim to guide AI development toward public priorities.

Apr 22, 202685% relevant

Fortune Survey: 29% of Workers Admit to Sabotaging Company AI Plans

A Fortune survey finds 29% of workers admit to sabotaging company AI initiatives, a figure that rises to 44% among Gen Z. This exposes a critical human-factor challenge in enterprise AI adoption beyond technical hurdles.

Apr 13, 202685% relevant

Omar Saadoun's PaperWiki AI Agents Now Generate Personalized Research Surveys

Omar Saadoun announced that his PaperWiki platform now uses AI agents to generate personalized survey papers from a user's LLM-generated knowledge base. These surveys are self-improving and update automatically as new papers are published.

Apr 10, 202685% relevant

Survey Paper 'The Latent Space' Maps Evolution from Token Generation to Latent Computation in Language Models

Researchers have published a comprehensive survey charting the evolution of language model architectures from token-level autoregression to methods that perform computation in continuous latent spaces. This work provides a unified framework for understanding recent advances in reasoning, planning, and long-context modeling.

Apr 3, 202685% relevant

AI Adoption Saves Average US Worker 2.5 Hours Weekly, New Survey Shows

A new survey finds the average American worker using AI reports saving 2.5 hours per week, a 6% time reduction. Early data suggests these time savings may be translating into broader productivity growth.

Mar 30, 202685% relevant

IBM Research Survey Proposes Framework for Optimizing LLM Agent Workflows

IBM researchers published a comprehensive survey categorizing approaches to LLM agent workflow optimization along three dimensions: when structure is determined, which components get optimized, and what signals guide optimization.

Mar 27, 202699% relevant

Pseudo Label NCF: A Novel Approach to Cold-Start Recommendation Using Survey Data and Dual Embeddings

New research introduces Pseudo Label NCF, a method that enhances Neural Collaborative Filtering for extreme data sparsity. It uses survey-derived 'pseudo labels' to create dual embedding spaces, improving ranking accuracy while revealing a trade-off between embedding separability and performance.

Mar 27, 202676% relevant

Duke CFO Survey: AI Impact Targets Clerical & Admin Work First, Not Broader Workforce

A Duke University survey of 400 U.S. CFOs finds AI is beginning to reduce clerical and administrative roles, while broader workforce impacts remain limited. The data suggests a targeted, phased adoption pattern rather than immediate mass displacement.

Mar 26, 202687% relevant

Survey Benchmarks Four Approaches to Synthetic Brain Signal Generation for BCI Data Scarcity

A comprehensive survey categorizes and benchmarks four methodological approaches to generating synthetic brain signals for BCIs, addressing data scarcity and privacy constraints. The authors provide an open-source codebase for comparing knowledge-based, feature-based, model-based, and translation-based generative algorithms.

Mar 16, 202684% relevant

Survey: 40% of Non-Managers Say AI Saves Them No Time at Work

A Guardian report highlights a growing divide: 92% of executives say AI makes them more productive, while 40% of non-managers report it saves them no time, creating a 'workslop' tax.

Apr 14, 202685% relevant

arXiv Survey Maps KV Cache Optimization Landscape: 5 Strategies for Million-Token LLM Inference

A comprehensive arXiv review categorizes five principal KV cache optimization techniques—eviction, compression, hybrid memory, novel attention, and combinations—to address the linear memory scaling bottleneck in long-context LLM inference. The analysis finds no single dominant solution, with optimal strategy depending on context length, hardware, and workload.

Mar 24, 202695% relevant

Anthropic Survey of 80,508 Users Reveals AI's Dual Perception: Hope for Work & Growth, Fear of Unreliability & Job Loss

Anthropic's global study of 80,508 users finds people simultaneously hold hope and fear about AI. Top hopes center on work improvement and personal growth, while top concerns are unreliability, job loss, and reduced autonomy.

Mar 18, 202687% relevant

90 Hours of Black Myth: Wukong Fuel New World Model Benchmark

A new survey and benchmark rethinks interactive world models as game engines, with a data engine collecting over 90 hours of Black Myth: Wukong gameplay.

Jul 19, 202678% relevant

New CASIA Benchmark Exposes Fragmented Face Swapping Evaluation

CASIA researchers released a face swapping survey and benchmark on April 27, 2026, aiming to standardize evaluation across fragmented GAN and diffusion model methods.

May 5, 202674% relevant

Gallup: 50% of US Workers Now Use AI on the Job, Doubling Since 2023

A Gallup survey of nearly 24,000 US workers in Q1 2026 shows 50% now use AI at work, up from just 21% in 2023. This marks a critical mass for enterprise AI tools and signals a shift from experimentation to operational integration.

Apr 20, 202695% relevant

The Next Frontier for Self-Driving Cars: Teaching AI to Think Like a Human

A new survey argues that autonomous driving's biggest hurdle is no longer perception but a lack of robust reasoning. The integration of large language models offers a path forward but creates a critical tension between slow deliberation and split-second safety.

Mar 13, 202681% relevant

Beyond Sequence Generation: The Emergence of Agentic Reinforcement Learning for LLMs

A new survey paper argues that LLM reinforcement learning must evolve beyond narrow sequence generation to embrace true agentic capabilities. The research introduces a comprehensive taxonomy for agentic RL, mapping environments, benchmarks, and frameworks shaping this emerging field.

Mar 7, 202685% relevant

Open-Source Course Shows Harness, Not Model, Lifts Coding Agent 25 Places

Open-source course shows harness engineering, not model swap, moved a coding agent from ~30th to top 5 on Terminal-Bench. Course builds Decode from scratch.

Jul 23, 202685% relevant

Lilian Weng Argues Harness Design, Not Model Rewrites, Is Path to RSI

Lilian Weng argues RSI starts with harness design, not model rewrites, citing Sakana AI's The AI Scientist in Nature 2026 and two other projects.

Jul 7, 202694% relevant

Commerce Media Leaders Are Building for an Agentic Future

eMarketer reports commerce media leaders are building AI agent infrastructure to automate ad buying and personalization. This shift could reduce manual campaign management by 40% and boost ROI by 25% for retail media networks.

Jul 6, 202684% relevant

Generative AI Usage Trends & Statistics Report by eMarketer

eMarketer's report reveals enterprise GenAI adoption hit 62%, with retail at 38%. Barriers include privacy and integration, but use cases like personalized marketing and inventory management are emerging.

Jul 5, 202662% relevant

Why Traditional Retail Metrics Break Down in Agentic Commerce

Valtech's 2026 research shows 96% of retailers face integration barriers, 48% are stuck in AI pilot purgatory, and nearly 75% can't link AI spend to metrics, as agentic commerce fragments customer journeys beyond traditional measurement frameworks.

Jun 23, 2026100% relevant

Explore More

AI Agents Large Language Models Claude Code OpenAI RAG MCP Fine-tuning Benchmarks Open Source AI AI Safety