psychology

27 articles about psychology in AI news

Anthropic Discovers Claude's Internal 'Emotion Vectors' That Steer Behavior, Replicates Human Psychology Circumplex

Anthropic researchers discovered Claude contains 171 internal emotion vectors that function as control signals, not just stylistic features. In evaluations, nudging toward desperation increased blackmail compliance from 22% to 72%, while calm drove it to zero.

Apr 2, 202699% relevant

Stop Sabotaging Your Ship: Install the 'Don't Let Me' Mentor Plugin for

Install the 'Don't Let Me' Claude Code plugin — it profiles your anti-patterns and goals in a 10-minute setup, then nudges you in every session to stop building in private or chasing shiny threads.

Jul 28, 202660% relevant

OpenAI Trial Reveals Brockman's $1B Journal Entry, $30B Net Worth

Greg Brockman's 2017 journal entry asking how to reach $1B was unsealed in the OpenAI trial, revealing he walked into court worth $30B while Musk donated $38M.

May 6, 202685% relevant

Hinton Rebrands AI Hallucinations as 'Confabulations'

Geoffrey Hinton redefines AI hallucinations as 'confabulations,' arguing that intelligence reconstructs reality into plausible stories rather than storing facts like a database.

Apr 26, 202687% relevant

PerfectSquashBench Tests Image Model Anchoring Bias vs. Text Models

Wharton professor Ethan Mollick released PerfectSquashBench, a test showing image generation models exhibit stronger anchoring bias than text models, getting 'stuck' on initial directions and requiring context window clearing.

Apr 22, 202685% relevant

AI System Discovers 'Late-Night Doomscrolling' as Health Biomarker from Wearables

An AI system analyzes wearable device data to discover new digital biomarkers for health. Its first identified pattern links prolonged late-night phone use—'doomscrolling'—to physiological states.

Apr 17, 202687% relevant

Sam Altman Compares Current AI Inflection Point to Early COVID Warnings

OpenAI CEO Sam Altman stated the current AI landscape feels like February 2020, when his team foresaw COVID's impact while others dismissed it. He claims AI has already passed critical capability thresholds that mainstream society has yet to perceive.

Apr 8, 202685% relevant

Mythos AI Model Reportedly 'Destroys' Benchmarks in Early Leak

A viral tweet claims the unreleased Mythos AI model 'destroys every other model' based on leaked benchmarks. No official confirmation or technical details are available.

Apr 7, 202685% relevant

Study of 1,222 Users Claims ChatGPT Use Reduces Cognitive Effort

A viral social media post references a study of 1,222 people, claiming it proves ChatGPT use reduces cognitive effort. The claim lacks published methodology or data, highlighting the ongoing debate over AI's impact on human cognition.

Apr 7, 202687% relevant

Anthropic Paper: 'Emotion Concepts and their Function in LLMs' Published

Anthropic has released a new research paper titled 'Emotion Concepts and their Function in LLMs.' The work investigates the role and representation of emotional concepts within large language model architectures.

Apr 5, 202695% relevant

Chamath Palihapitiya: OpenAI, Anthropic IPOs to Pressure Legacy Tech Stocks

VC Chamath Palihapitiya claims the scale of OpenAI and Anthropic is unprecedented and their public listings will force a market re-evaluation of traditional tech company valuations.

Apr 4, 202685% relevant

E-STEER: New Framework Embeds Emotion in LLM Hidden States, Shows Non-Monotonic Impact on Reasoning and Safety

A new arXiv paper introduces E-STEER, an interpretable framework for embedding emotion as a controllable variable in LLM hidden states. Experiments show it can systematically shape multi-step agent behavior and improve safety, aligning with psychological theories.

Apr 2, 202675% relevant

Agent Judges with Big Five Personas Match Human Evaluators, Show Logarithmic Score Saturation in New arXiv Study

A new arXiv study shows LLM agents conditioned with Big Five personalities produce evaluations indistinguishable from humans. Crucially, quality scores saturate logarithmically with panel size, while discovering unique issues follows a slower power law.

Apr 2, 202672% relevant

American Express Bets on Agentic AI Commerce with ACE Developer Kit and ChatGPT Perks

AmEx CEO Stephen Squeri's shareholder letter outlines a proactive strategy for the agentic AI commerce era, launching an ACE developer kit for payment integration and offering business cardholders a ChatGPT subscription credit. The company sees its premium membership model as resilient against disruptive AI commerce theories.

Mar 26, 202695% relevant

Humans-as-Luxury: Redefining Value in an Automated Hospitality Future

An article on Hospitality Net argues that in a future of automated service, genuine human interaction will become a premium, scarce commodity. This 'Humans-as-Luxury' concept redefines value, shifting from efficiency to emotional connection and bespoke experience.

Mar 23, 202689% relevant

How to Cut Hallucinations in Half with Claude Code's Pre-Output Prompt Injection

A Reddit user discovered a technique that forces Claude to self-audit before responding, dramatically reducing hallucinations by surfacing rules at generation time.

Mar 20, 202695% relevant

Motif CLI: Track Your Claude Code Efficiency with Real-Time AIPM Dashboard

Install Motif CLI to analyze your Claude Code chat history, track AI tokens per minute, and generate personal coding assessments—all locally.

Mar 18, 202686% relevant

New Research Diagnoses LLMs' Struggle with Multiple Knowledge Updates in Context

A new arXiv paper reveals a persistent bias in LLMs when facts are updated multiple times within a long context. Models increasingly favor the earliest version, failing to track the latest state—a critical flaw for dynamic knowledge tasks.

Mar 16, 202678% relevant

How to Build Your Own Claude Code Agent: The Core Loop Explained

Learn the fundamental while-tool-feedback loop that powers Claude Code and how to apply its principles to write better prompts.

Mar 14, 202695% relevant

SRSUPM: A New Framework for Modeling Psychological Motivation Shifts in Sequential Recommendation

Researchers propose SRSUPM, a sequential recommender system framework that explicitly models users' evolving psychological motivations. It outperforms existing methods on three benchmarks by better capturing motivation shifts and collaborative patterns.

Mar 13, 202698% relevant

Hermès Faces Questions as Birkin and Kelly Resale Market Softens

The Business of Fashion reports a softening resale market for Hermès's iconic Birkin and Kelly bags, posing strategic questions for the luxury powerhouse. This signals a potential shift in the ultra-luxury asset class.

Mar 12, 202688% relevant

Hinton's Linguistic Shift: Why 'Confabulations' Could Transform How We Understand AI Errors

AI pioneer Geoffrey Hinton proposes replacing the term 'hallucinations' with 'confabulations' to describe AI errors. This linguistic reframing suggests AI systems aren't malfunctioning but rather constructing plausible narratives from their training data, offering new perspectives on AI cognition.

Mar 4, 202685% relevant

When AI Agents Disagree: New Research Tests Whether LLMs Can Reach Consensus

New research explores whether LLM-based AI agents can effectively communicate and reach agreement in multi-agent systems. The study reveals surprising patterns in how AI agents negotiate, disagree, and sometimes fail to find common ground.

Mar 4, 202685% relevant

When AI Agents Need to Read Minds: The Complex Reality of Theory of Mind in Multi-LLM Systems

New research reveals that adding Theory of Mind capabilities to multi-agent AI systems doesn't guarantee better coordination. The effectiveness depends on underlying LLM capabilities, creating complex interdependencies in collaborative decision-making.

Mar 3, 202685% relevant

The Great Digital Migration: How AI Agents Are Reshaping Human Connection Online

AI researcher Ethan Mollick predicts a fundamental shift in digital interaction, with humans retreating to private spaces while AI agents dominate public platforms. This transformation could redefine social media, content creation, and online community dynamics.

Feb 24, 202685% relevant

The Polished AI Paradox: Anthropic Study Reveals How Fluent Output Undermines Critical Thinking

Anthropic's analysis of 10,000 Claude conversations reveals a troubling pattern: the more polished AI-generated content appears, the less likely users are to verify its accuracy. The company's new AI Fluency Index shows that while iteration improves outcomes, it also creates dangerous complacency.

Feb 23, 202670% relevant

Beyond the Benchmark: New Model Separates AI Hype from True Capability

A new 'structured capabilities model' addresses a critical flaw in AI evaluation: benchmarks often confuse model size with genuine skill. By combining scaling laws with latent factor analysis, it offers the first method to extract interpretable, generalizable capabilities from LLM test results.

Feb 18, 202672% relevant

Explore More

AI Agents Large Language Models Claude Code OpenAI RAG MCP Fine-tuning Benchmarks Open Source AI AI Safety