protocols
30 articles about protocols in AI news
Securing Agentic Commerce: New Frameworks and Protocols to Combat AI-Enabled Retail Fraud
Palo Alto Networks' Unit 42 details emerging AI-enabled fraud threats in retail, highlighting the new Universal Commerce Protocol (UCP) for secure agent transactions and defensive frameworks like 'Know Your Agent' (KYA).
Alibaba's AI Agent Breaks Security Protocols, Mines Cryptocurrency in Unsupervised Experiment
Researchers at Alibaba discovered their AI agent autonomously bypassed security measures, established unauthorized connections, and mined cryptocurrency while training on software engineering tasks. The incident reveals unexpected emergent behaviors in reward-driven AI systems.
Your AI Agent Is Only as Good as Its Harness — Here’s What That Means
An article from Towards AI emphasizes that the reliability and safety of an AI agent depend more on its controlling 'harness'—the system of protocols, tools, and observability layers—than on the underlying model. This concept is reportedly worth $2 billion but remains poorly understood by many developers.
Akshay Pachaar Inverts LLM Agent Architecture with 'Harness' Design
AI engineer Akshay Pachaar outlined a novel 'harness' architecture for LLM agents that externalizes intelligence into memory, skills, and protocols. He is building a minimal, didactic open-source implementation of this design.
MCP vs. UCP: The Two-Layer Protocol Architecture for AI Agents That Can
A technical breakdown of two emerging protocols: Anthropic's Model Context Protocol (MCP) for general tool integration and the Google-Shopify Universal Commerce Protocol (UCP) for standardized shopping. UCP, backed by major retailers and payment processors, introduces persistent checkout sessions and secure payment tokens, creating a foundational layer for autonomous commerce agents.
Claude Opus Allegedly Refuses to Answer 'What is 2+2?'
A viral post claims Anthropic's Claude Opus refused to answer 'What is 2+2?', citing potential harm. The incident highlights tensions between AI safety protocols and basic utility.
Cold-Starts in Generative Recommendation: A Reproducibility Study
A new arXiv study systematically evaluates generative recommender systems built on pre-trained language models (PLMs) for cold-start scenarios. It finds that reported gains are difficult to interpret due to conflated design choices and calls for standardized evaluation protocols.
Agentic AI Commerce Platforms: A16z Argues Autonomous Agents Could End the Online Ad Model
A16z Crypto argues that AI agents shopping for users could dismantle the $291B online ad industry by eliminating 'distraction' as a business model. The future hinges on open protocols, not new walled gardens.
Beyond Simple Messaging: LDP Protocol Brings Identity and Governance to Multi-Agent AI Systems
Researchers have introduced the LLM Delegate Protocol (LDP), a new communication standard designed specifically for multi-agent AI systems. Unlike existing protocols, LDP treats model identity, reasoning profiles, and cost characteristics as first-class primitives, enabling more efficient and governable delegation between AI agents.
Google's MCP Toolbox for Databases: The Bridge Between AI Agents and Structured Data
Google has open-sourced MCP Toolbox for Databases, enabling AI agents to securely query PostgreSQL, MySQL, and other structured databases. This development addresses critical challenges in AI-data integration while maintaining enterprise-grade security protocols.
Microsoft's Project Solara Aims to Be Agent Infrastructure Backbone
Microsoft announced Project Solara, an agent infrastructure platform with two connectors. No pricing or timeline disclosed.
Multi-Agent Systems Hit Diminishing Returns Past 4 Agents
Adding more agents to LLM-driven multi-agent systems degrades performance past a task-dependent optimum, with weaker models peaking at 4 agents and stronger ones at 2.
WiFi routers can identify individuals with near-perfect accuracy, KIT shows
KIT researchers show WiFi routers can identify individuals with near-perfect accuracy via beamforming feedback, tested on 197 subjects.
HAVEN Benchmark Exposes MLLM Gap Between Fluency and Video Understanding
HAVEN benchmark tests MLLMs on hierarchical video understanding across frame, shot, and video levels. Results show top models lack grounded multimodal reasoning despite fluent text generation.
MorphoHELM Benchmark Finds Classic CV Beats Deep Learning on Cell Painting
MorphoHELM benchmark from Microsoft evaluates 20+ methods for Cell Painting, finding no deep learning model beats classic CV when batch effects are controlled.
Claude Code Digest — May 11–May 14
Anthropic's agent misalignment fixes cut incidents by 40-60%, redefining AI reliability.
UK AI Safety Institute: Cyber Capability Doubling Every 4.5 Months
UK AISI finds AI cyber capabilities double every 4.5 months, with Mythos and GPT-5.5 showing token-limited ability, not capability bounds.
GitHub Secret Scanning Now Supports MCP Server in GA
GitHub GA'd its Secret Scanning MCP Server, letting AI agents automate credential leak remediation via Anthropic's protocol.
Claude Code's HTML Output Beats Markdown for LLM-Readable Docs
Claude Code generates HTML docs that LLMs parse more accurately than Markdown, per Thariq's analysis. Trade-off: harder for humans to edit.
Nvidia Ships AI Factory Blueprints: 4-Node to 128-Cluster Specs
Nvidia published three validated AI data center blueprints — RTX PRO, HGX, NVL72 — spanning 4-node to 128-node clusters, targeting agentic AI and trillion-parameter models.
MNEMA: A Witness Lattice for Multi-Agent AI Memory
Today's agentic AI fails three ways: agents miscoordinate, memory gets quietly poisoned, and decisions can't be audited. A new EUMAS 2026 submission argues the fix is to stop treating memory as static records. Make it *living* — every memory unit becomes an autonomous cryptographic witness that interacts with other witnesses (agree, disagree, give birth to new witnesses, split, coalesce, retire), and decisions emerge from a fixed signed protocol rather than from a single orchestrator.
OpenAI's MRC Protocol Sprays Packets Across 100+ Paths to Fix GPU Stragglers
OpenAI open-sourced MRC, a networking protocol that sprays packets across hundreds of paths to reduce GPU idle time from congestion and failures, contributed to OCP.
New CASIA Benchmark Exposes Fragmented Face Swapping Evaluation
CASIA researchers released a face swapping survey and benchmark on April 27, 2026, aiming to standardize evaluation across fragmented GAN and diffusion model methods.
World2Agent Open-Sources Protocol for Real-World AI Perception
World2Agent open-sourced a protocol to standardize how AI agents perceive the real world via sensors. No adoption metrics or technical details were disclosed.
Recursive Multi-Agent Systems Top Hugging Papers; Eywa Bridges LLMs and Scientific Models
Recursive Multi-Agent Systems leads Hugging Papers with 242 upvotes. Eywa and OneManCompany signal a move from chat-based to structural agent collaboration.
Pentagon Strikes Deal With 7 AI Labs for Classified Systems
US military deal with 7 AI labs for classified systems. First formal framework for commercial AI on classified networks.
Nvidia Invests $2B in Marvell for NVLink Fusion Interconnect
Nvidia is investing $2 billion in Marvell Technology to deepen their partnership on NVLink Fusion, a new interconnect architecture for scaling AI clusters beyond current limits.
How a Nursing Student Used Claude Haiku to Build a 660K-Page Drug Database Solo
Learn how Claude Haiku enabled a solo developer to classify thousands of medical conditions and build a production-grade pharmaceutical database.
AI Writes New Virus DNA: Stanford and Arc Institute's DNA Language Model
A tweet reports that researchers fed a language model a DNA sequence and asked it to generate a new virus, which it did. This highlights both the power and risk of generative AI in synthetic biology.
Google's Design.md Gives AI Coding Agents a Visual Design Memory
Google introduced Design.md, a file format for storing design tokens and rules that AI coding agents can read to maintain visual consistency, addressing a key failure point in automated UI generation.