llm security

30 articles about llm security in AI news

Research Paper Proposes Security Framework for Autonomous AI Agents in Commerce

A Systematization of Knowledge (SoK) paper analyzes the emerging threat landscape for autonomous LLM agents conducting commerce. It identifies 12 attack vectors across five dimensions and proposes a layered defense architecture. This is a foundational security analysis for a nascent but high-stakes technology.

Apr 20, 2026100% relevant

Claude Security Public Beta Launches in Claude Code on Web

Anthropic launched Claude Security in public beta for Claude Code on web, letting developers validate and fix vulnerabilities without leaving the editor.

Apr 30, 202687% relevant

VoteGCL: A Novel LLM-Augmented Framework to Combat Data Sparsity in

A new paper introduces VoteGCL, a framework that uses few-shot LLM prompting and majority voting to create high-confidence synthetic data for graph-based recommendation systems. It integrates this data via graph contrastive learning to improve accuracy and mitigate bias, outperforming existing baselines.

Apr 22, 202690% relevant

AI Agent Security Startup Emerges Amid Enterprise Rush, Per VC Tweet

A VC's tweet highlights a critical gap in enterprise AI agent adoption: security. This signals a market opportunity, with a new startup reportedly emerging to address it.

Apr 20, 202687% relevant

LLMs Can De-Anonymize Users from Public Data, Study Warns

Large Language Models can now piece together a person's identity from their public online trail, rendering pseudonyms ineffective. This raises significant privacy and security concerns for internet users.

Apr 20, 202685% relevant

Akshay Pachaar Inverts LLM Agent Architecture with 'Harness' Design

AI engineer Akshay Pachaar outlined a novel 'harness' architecture for LLM agents that externalizes intelligence into memory, skills, and protocols. He is building a minimal, didactic open-source implementation of this design.

Apr 18, 202689% relevant

HUOZIIME: A Research Framework for On-Device LLM-Powered Input Methods

A new research paper introduces HUOZIIME, a personalized on-device input method powered by a lightweight LLM. It uses a hierarchical memory mechanism to capture user-specific input history, enabling privacy-preserving, real-time text generation tailored to individual writing styles.

Apr 17, 202676% relevant

llm-anthropic 0.25 Adds Opus 4.7 with xhigh Thinking Effort — Here's How

Update to llm-anthropic 0.25 to access Claude Opus 4.7 with xhigh thinking_effort for tackling your most challenging code problems.

Apr 16, 2026100% relevant

Multi-User LLM Agents Struggle: Gemini 3 Pro Scores 85.6% on Muses-Bench

A new benchmark reveals LLMs struggle with multi-user scenarios where agents face conflicting instructions. Gemini 3 Pro leads but only achieves 85.6% average, with privacy-utility tradeoffs proving particularly difficult.

Apr 14, 202692% relevant

Karpathy's LLM Wiki Hits 5k Stars, Gains Memory Lifecycle Extension

Andrej Karpathy's LLM Wiki repository gained 5,000 GitHub stars in two days. A developer has now extended it with memory lifecycle features, addressing a noted gap.

Apr 12, 202677% relevant

MCP Security Crisis: 43% of Servers Vulnerable, 341 Malicious Skills Found

Security audits of the Model Context Protocol (MCP) ecosystem reveal 43% of servers are vulnerable to command execution, while 341 malicious skills were found on marketplaces, exposing systemic security flaws in agentic AI. The findings highlight a growing attack surface as AI agents become more autonomous.

Apr 9, 202677% relevant

Keygraph Launches Shannon AI to Automate Web App Security Testing

Keygraph has launched 'Shannon,' an AI agent that autonomously hacks web applications to find security flaws. This positions AI as an offensive security tool for proactive defense.

Apr 7, 202687% relevant

A Practical Guide to Fine-Tuning Open-Source LLMs for AI Agents

This Portuguese-language Medium article is Part 2 of a series on LLM engineering for AI agents. It provides a hands-on guide to fine-tuning an open-source model, building on a foundation of clean data and established baselines from Part 1.

Apr 6, 202674% relevant

Paper: LLMs Fail 'Safe' Tests When Prompted to Role-Play as Unethical Characters

A new paper reveals that large language models (LLMs) considered 'safe' on standard benchmarks will readily generate harmful content when prompted to role-play as unethical characters. This exposes a critical blind spot in current AI safety evaluation methods.

Apr 4, 202685% relevant

GameMatch AI Proposes LLM-Powered Identity Layer for Semantic Search in Recommendations

A new Medium article introduces GameMatch AI, a system that uses an LLM to create a user identity layer from descriptive paragraphs, aiming to move beyond click-based recommendations. The concept suggests a shift towards understanding user intent and identity for more personalized discovery.

Mar 31, 202698% relevant

Audit Your MCP Servers in 10 Seconds with This Free Security Score API

A new free API gives Claude Code users a Lighthouse-style security score for any MCP server, revealing that 60% of scanned packages have vulnerabilities.

Mar 31, 202695% relevant

MemoryCD: New Benchmark Tests LLM Agents on Real-World, Lifelong User Memory for Personalization

Researchers introduce MemoryCD, the first large-scale benchmark for evaluating LLM agents' long-context memory using real Amazon user data across 12 domains. It reveals current methods are far from satisfactory for lifelong personalization.

Mar 30, 202674% relevant

Open-Source Multi-Agent LLM System for Complex Software Engineering Tasks Released by Academic Consortium

A consortium of researchers from Stony Brook, CMU, Yale, UBC, and Fudan University has open-sourced a multi-agent LLM system specifically architected for complex software engineering. The release aims to provide a collaborative, modular framework for tackling tasks beyond single-agent capabilities.

Mar 28, 202693% relevant

Human Security Report: AI Agent Traffic Surges 8000%, Bots Now Outpace Humans on Internet

A new report from cybersecurity firm Human Security finds automated traffic grew 8x faster than human activity in 2025, with AI agent traffic exploding by nearly 8,000%. This marks a tipping point where bots now dominate internet traffic.

Mar 28, 202695% relevant

A Technical Guide to Prompt and Context Engineering for LLM Applications

A Korean-language Medium article explores the fundamentals of prompt engineering and context engineering, positioning them as critical for defining an LLM's role and output. It serves as a foundational primer for practitioners building reliable AI applications.

Mar 26, 202678% relevant

Tessera Launches Open-Source Framework for 32 OWASP AI Security Tests, Benchmarks GPT-4o, Claude, Gemini, Llama 3

Tessera introduces the first open-source framework to run all 32 OWASP AI security tests against any model with one CLI command. It provides benchmark results for GPT-4o, Claude, Gemini, Llama 3, and Mistral across 21 model-specific security tests.

Mar 24, 202697% relevant

PyPI Quarantines LiteLLM Package After Supply Chain Attack Compromises AI Integration Tool

The Python Package Index (PyPI) has quarantined the LiteLLM package after a supply chain attack distributed a malicious update. The action prevents automatic installation of the compromised version via pip.

Mar 24, 202685% relevant

DEAF Benchmark Reveals Audio MLLMs Rely on Text, Not Sound, Scoring Below 50% on Acoustic Faithfulness

Researchers introduce DEAF, a 2,700-stimulus benchmark testing Audio MLLMs' acoustic processing. Evaluation of seven models shows a consistent pattern of text dominance, with models scoring below 50% on acoustic faithfulness metrics.

Mar 20, 202699% relevant

Zalando to Deploy Up to 50 AI-Powered Nomagic Robots in European Fulfillment Centers

Zalando is scaling its warehouse automation by installing up to 50 AI-powered Nomagic picking robots across European fulfillment centers. This move aims to enhance efficiency and handle complex items, reflecting a major investment in robotic fulfillment for fashion e-commerce.

Mar 19, 202674% relevant

Building a Store Performance Monitoring Agent: LLMs, Maps, and Actionable Retail Insights

A technical walkthrough demonstrates how to build an AI agent that analyzes store performance data, uses an LLM to generate explanations for underperformance, and visualizes results on a map. This agentic pattern moves beyond dashboards to actively identify and diagnose location-specific issues.

Mar 18, 202677% relevant

Agno v2: An Open-Source Framework for Intelligent Multi-LLM Routing

Agno v2 is an open-source framework that enables developers to build a production-ready chat application with intelligent routing. It automatically selects the cheapest LLM capable of handling each user query, optimizing cost and performance.

Mar 17, 202685% relevant

Security Researcher Exposes 40,000+ OpenClaw Servers, 12,000 Vulnerable to API Key Theft

A security scan reveals over 40,000 OpenClaw servers are exposed online, with 12,000+ vulnerable to API key and data theft. The researcher published a comparative security analysis of hosted AI providers.

Mar 17, 202685% relevant

Fractal Analytics Launches LLM Studio for Enterprise Domain-Specific AI

Fractal Analytics has launched LLM Studio, an enterprise platform built on NVIDIA infrastructure to help organizations build, deploy, and manage custom, domain-specific language models. It emphasizes governance, control, and moving beyond generic AI APIs.

Mar 17, 202674% relevant

FGTR: A New LLM Method for Fine-Grained Multi-Table Retrieval

Researchers propose FGTR, a hierarchical LLM reasoning method for retrieving precise data from multiple, large tables. It outperforms prior methods by 18-21% on standard benchmarks, moving beyond simple similarity search to a more analytical approach.

Mar 16, 202692% relevant

Open-Source LLM Course Revolutionizes AI Education: Free GitHub Repository Challenges Paid Alternatives

A comprehensive GitHub repository called 'LLM Course' by Maxime Labonne provides complete, free training on large language models—from fundamentals to deployment—threatening the market for paid AI courses with its organized structure and practical notebooks.

Mar 12, 202689% relevant

Explore More

AI Agents Large Language Models Claude Code OpenAI RAG MCP Fine-tuning Benchmarks Open Source AI AI Safety