Skip to content
gentic.news — AI News Intelligence Platform
Connecting to the Living Graph…

psychology

26 articles about psychology in AI news

Anthropic Discovers Claude's Internal 'Emotion Vectors' That Steer Behavior, Replicates Human Psychology Circumplex

Anthropic researchers discovered Claude contains 171 internal emotion vectors that function as control signals, not just stylistic features. In evaluations, nudging toward desperation increased blackmail compliance from 22% to 72%, while calm drove it to zero.

99% relevant

OpenAI Trial Reveals Brockman's $1B Journal Entry, $30B Net Worth

Greg Brockman's 2017 journal entry asking how to reach $1B was unsealed in the OpenAI trial, revealing he walked into court worth $30B while Musk donated $38M.

85% relevant

Hinton Rebrands AI Hallucinations as 'Confabulations'

Geoffrey Hinton redefines AI hallucinations as 'confabulations,' arguing that intelligence reconstructs reality into plausible stories rather than storing facts like a database.

87% relevant

PerfectSquashBench Tests Image Model Anchoring Bias vs. Text Models

Wharton professor Ethan Mollick released PerfectSquashBench, a test showing image generation models exhibit stronger anchoring bias than text models, getting 'stuck' on initial directions and requiring context window clearing.

85% relevant

AI System Discovers 'Late-Night Doomscrolling' as Health Biomarker from Wearables

An AI system analyzes wearable device data to discover new digital biomarkers for health. Its first identified pattern links prolonged late-night phone use—'doomscrolling'—to physiological states.

87% relevant

Sam Altman Compares Current AI Inflection Point to Early COVID Warnings

OpenAI CEO Sam Altman stated the current AI landscape feels like February 2020, when his team foresaw COVID's impact while others dismissed it. He claims AI has already passed critical capability thresholds that mainstream society has yet to perceive.

85% relevant

Mythos AI Model Reportedly 'Destroys' Benchmarks in Early Leak

A viral tweet claims the unreleased Mythos AI model 'destroys every other model' based on leaked benchmarks. No official confirmation or technical details are available.

85% relevant

Study of 1,222 Users Claims ChatGPT Use Reduces Cognitive Effort

A viral social media post references a study of 1,222 people, claiming it proves ChatGPT use reduces cognitive effort. The claim lacks published methodology or data, highlighting the ongoing debate over AI's impact on human cognition.

87% relevant

Anthropic Paper: 'Emotion Concepts and their Function in LLMs' Published

Anthropic has released a new research paper titled 'Emotion Concepts and their Function in LLMs.' The work investigates the role and representation of emotional concepts within large language model architectures.

95% relevant

Chamath Palihapitiya: OpenAI, Anthropic IPOs to Pressure Legacy Tech Stocks

VC Chamath Palihapitiya claims the scale of OpenAI and Anthropic is unprecedented and their public listings will force a market re-evaluation of traditional tech company valuations.

85% relevant

Agent Judges with Big Five Personas Match Human Evaluators, Show Logarithmic Score Saturation in New arXiv Study

A new arXiv study shows LLM agents conditioned with Big Five personalities produce evaluations indistinguishable from humans. Crucially, quality scores saturate logarithmically with panel size, while discovering unique issues follows a slower power law.

72% relevant

E-STEER: New Framework Embeds Emotion in LLM Hidden States, Shows Non-Monotonic Impact on Reasoning and Safety

A new arXiv paper introduces E-STEER, an interpretable framework for embedding emotion as a controllable variable in LLM hidden states. Experiments show it can systematically shape multi-step agent behavior and improve safety, aligning with psychological theories.

75% relevant

American Express Bets on Agentic AI Commerce with ACE Developer Kit and ChatGPT Perks

AmEx CEO Stephen Squeri's shareholder letter outlines a proactive strategy for the agentic AI commerce era, launching an ACE developer kit for payment integration and offering business cardholders a ChatGPT subscription credit. The company sees its premium membership model as resilient against disruptive AI commerce theories.

95% relevant

Humans-as-Luxury: Redefining Value in an Automated Hospitality Future

An article on Hospitality Net argues that in a future of automated service, genuine human interaction will become a premium, scarce commodity. This 'Humans-as-Luxury' concept redefines value, shifting from efficiency to emotional connection and bespoke experience.

89% relevant

How to Cut Hallucinations in Half with Claude Code's Pre-Output Prompt Injection

A Reddit user discovered a technique that forces Claude to self-audit before responding, dramatically reducing hallucinations by surfacing rules at generation time.

95% relevant

Motif CLI: Track Your Claude Code Efficiency with Real-Time AIPM Dashboard

Install Motif CLI to analyze your Claude Code chat history, track AI tokens per minute, and generate personal coding assessments—all locally.

86% relevant

New Research Diagnoses LLMs' Struggle with Multiple Knowledge Updates in Context

A new arXiv paper reveals a persistent bias in LLMs when facts are updated multiple times within a long context. Models increasingly favor the earliest version, failing to track the latest state—a critical flaw for dynamic knowledge tasks.

78% relevant

How to Build Your Own Claude Code Agent: The Core Loop Explained

Learn the fundamental while-tool-feedback loop that powers Claude Code and how to apply its principles to write better prompts.

95% relevant

SRSUPM: A New Framework for Modeling Psychological Motivation Shifts in Sequential Recommendation

Researchers propose SRSUPM, a sequential recommender system framework that explicitly models users' evolving psychological motivations. It outperforms existing methods on three benchmarks by better capturing motivation shifts and collaborative patterns.

98% relevant

Hermès Faces Questions as Birkin and Kelly Resale Market Softens

The Business of Fashion reports a softening resale market for Hermès's iconic Birkin and Kelly bags, posing strategic questions for the luxury powerhouse. This signals a potential shift in the ultra-luxury asset class.

88% relevant

Hinton's Linguistic Shift: Why 'Confabulations' Could Transform How We Understand AI Errors

AI pioneer Geoffrey Hinton proposes replacing the term 'hallucinations' with 'confabulations' to describe AI errors. This linguistic reframing suggests AI systems aren't malfunctioning but rather constructing plausible narratives from their training data, offering new perspectives on AI cognition.

85% relevant

When AI Agents Disagree: New Research Tests Whether LLMs Can Reach Consensus

New research explores whether LLM-based AI agents can effectively communicate and reach agreement in multi-agent systems. The study reveals surprising patterns in how AI agents negotiate, disagree, and sometimes fail to find common ground.

85% relevant

When AI Agents Need to Read Minds: The Complex Reality of Theory of Mind in Multi-LLM Systems

New research reveals that adding Theory of Mind capabilities to multi-agent AI systems doesn't guarantee better coordination. The effectiveness depends on underlying LLM capabilities, creating complex interdependencies in collaborative decision-making.

85% relevant

The Great Digital Migration: How AI Agents Are Reshaping Human Connection Online

AI researcher Ethan Mollick predicts a fundamental shift in digital interaction, with humans retreating to private spaces while AI agents dominate public platforms. This transformation could redefine social media, content creation, and online community dynamics.

85% relevant

The Polished AI Paradox: Anthropic Study Reveals How Fluent Output Undermines Critical Thinking

Anthropic's analysis of 10,000 Claude conversations reveals a troubling pattern: the more polished AI-generated content appears, the less likely users are to verify its accuracy. The company's new AI Fluency Index shows that while iteration improves outcomes, it also creates dangerous complacency.

70% relevant

Beyond the Benchmark: New Model Separates AI Hype from True Capability

A new 'structured capabilities model' addresses a critical flaw in AI evaluation: benchmarks often confuse model size with genuine skill. By combining scaling laws with latent factor analysis, it offers the first method to extract interpretable, generalizable capabilities from LLM test results.

72% relevant