agent skills
30 articles about agent skills in AI news
Addy Osmani Unveils 'Agent Skills' for AI-Powered Development
Google VP Addy Osmani teased a new framework called 'Agent Skills' for constructing AI agents, likely a significant move to standardize and simplify agent-based development workflows.
Base44 Launches Superagent Skills: No-Code Library for Adding Domain-Specific Functions to AI Agents
Base44 has launched Superagent Skills, a library of pre-built, domain-specific functions that can be added to AI agents with a single click. The no-code system allows for combining skills and creating custom ones via natural language description.
How Weaviate Agent Skills Let Claude Code Build Vector Apps in Minutes
Weaviate's official Agent Skills give Claude Code structured access to vector databases, eliminating guesswork when building semantic search and RAG applications.
SkillNet: The First Systematic Framework for Creating and Evolving AI Agent Skills
Researchers have developed SkillNet, an open infrastructure that structures over 200,000 AI agent skills within a unified ontology. The system improves agent performance by 40% while reducing execution steps by 30%, treating skills as evolving assets rather than temporary solutions.
GitHub Repository Unleashes 1,715+ Production-Ready AI Agent Skills
A new GitHub repository has surfaced containing over 1,715 production-ready AI agent skills that developers can install and deploy in seconds. This collection represents a significant leap in accessible AI tooling, potentially accelerating agent-based application development across industries.
SkillsBench Reveals AI Agent Skills: Powerful But Unpredictable
A new benchmark reveals AI agent skills boost performance by 16% on average, but benefits vary wildly across domains. Surprisingly, models can't create the skills they benefit from using.
OpenClaw Skills: The GitHub Repository That's Supercharging AI Agents with 1,700+ Ready-to-Use Capabilities
A new GitHub repository called 'awesome-openclaw-skills' has emerged, offering over 1,715 production-ready AI agent skills that can be installed with a single CLI command. This collection promises to dramatically accelerate AI agent development by providing pre-built capabilities ranging from browser automation to complex data processing.
Google's gws CLI: The AI-Agent-Ready Tool That Dynamically Masters Workspace APIs
Google has open-sourced gws, a CLI tool that dynamically interfaces with all Google Workspace APIs and ships with built-in AI agent skills. It eliminates custom tooling and automatically adapts to new API endpoints.
SSL: Structured Skill Language Boosts Skill Discovery MRR to 0.707
Researchers propose SSL, a three-layer typed JSON representation for AI agent skills, replacing unstructured SKILL.md prose. Using an LLM normalizer, SSL improves Skill Discovery MRR from 0.573 to 0.707 and Risk Assessment macro F1 from 0.744 to 0.787 on a newly released 6,184-skill corpus.
Google's Official CLI for Workspace: A Strategic Move Toward AI-Powered Productivity
Google has released an official CLI tool for Google Workspace, integrating AI agent skills to automate tasks across Gmail, Drive, Calendar, and Docs. This move signals a shift toward AI-enhanced developer and enterprise workflows.
From Prototype to Production: Streamlining LLM Evaluation for Luxury Clienteling & Chatbots
NVIDIA's new NeMo Evaluator Agent Skills dramatically simplifies testing and monitoring of conversational AI agents. For luxury retail, this means faster, more reliable deployment of high-quality clienteling assistants and customer service chatbots.
Build Reusable Data Science Workflows with Claude Skills and Subagents
Claude Skills and Subagents let you package prompts into reusable modules, freeing data scientists from repetitive AI adjustments for EDA, modeling, and deployment.
MiniMax Open-Sources Three Agent Music Skills for MMX-CLI
MiniMax has open-sourced three 'Music Skills' for its MMX-CLI agent platform. The skills allow AI agents to generate music, sing in a persona, and curate playlists from a user's local library.
How to Build a Custom AI Agent with Claude Code's Skills, SubAgents, and Hooks
A developer's deep dive into customizing Claude Code with 7 skills, 5 subagents, and quality-check hooks—showing how to move beyond basic prompting to create a truly autonomous coding assistant.
Trace2Skill Framework Distills Execution Traces into Declarative Skills via Parallel Sub-Agents
Researchers introduced Trace2Skill, a framework that uses parallel sub-agents to analyze execution trajectories and distill them into transferable declarative skills. This enables performance improvements in larger models without parameter updates.
Awesome Finance Skills: Open-Source Plugin Adds Real-Time Market Analysis to AI Agents
Developer open-sources Awesome Finance Skills, a plug-and-play toolkit that gives AI agents real-time financial data access, sentiment analysis, and automated research report generation. The MIT-licensed package works with Claude Code, OpenClaw, and other popular agent frameworks.
Memento-Skills Agent System Achieves 116.2% Relative Improvement on Humanity's Last Exam Without LLM Updates
Memento-Skills is a generalist agent system that autonomously constructs and adapts task-specific agents through experience. It enables continual learning without updating LLM parameters, achieving 26.2% and 116.2% relative improvements on GAIA and Humanity's Last Exam benchmarks.
How Adding 'Skills' to MCP Tools Cuts Agent Token Usage by 87%
Adding structured 'skills' descriptions to MCP tools dramatically reduces token consumption in custom agents—here's how to implement it in your Claude Code workflows.
Anthropic Cybersecurity Skills: Open-Source GitHub Repo Provides 611+ Structured Security Skills for AI Agents
A developer has released an open-source GitHub repository containing 611+ structured cybersecurity skills designed for AI agents. Each skill includes procedures, scripts, and templates, built on the agentskills.io standard.
Beyond Basic Connections: How MCP and Skills Create Truly Capable AI Agents
While MCP standardizes tool connectivity for AI agents, Skills provide the procedural knowledge needed for effective execution. Understanding this distinction is crucial for building production-ready AI systems that can perform complex tasks autonomously.
Nous Research's Hermes Agent Features Self-Improving Skills, Persistent Memory
A new evaluation of Nous Research's Hermes Agent highlights its self-improving ability to build reusable tools from experience and a smarter persistent memory system that conserves token usage. The agent reportedly improves with continued use, representing a shift towards more adaptive AI systems.
XSkill Framework Enables AI Agents to Learn Continuously from Experience and Skills
Researchers have developed XSkill, a dual-stream continual learning framework that allows AI agents to improve over time by distilling reusable knowledge from past successes and failures. The approach combines experience-based tool selection with skill-based planning, significantly reducing errors and boosting performance across multiple benchmarks.
EvoSkill: How AI Agents Are Learning to Teach Themselves New Skills
Researchers have developed EvoSkill, a self-evolving framework where AI agents automatically discover and refine their own capabilities through failure analysis. The system improves performance by up to 12% on complex tasks and demonstrates skill transfer between different domains.
Add 197 Bioinformatics Skills to Claude Code with SciAgent-Skills
A ready-to-use plugin that transforms Claude Code into a bioinformatics expert without fine-tuning or RAG setup.
Akshay Pachaar Inverts LLM Agent Architecture with 'Harness' Design
AI engineer Akshay Pachaar outlined a novel 'harness' architecture for LLM agents that externalizes intelligence into memory, skills, and protocols. He is building a minimal, didactic open-source implementation of this design.
MCP Security Crisis: 43% of Servers Vulnerable, 341 Malicious Skills Found
Security audits of the Model Context Protocol (MCP) ecosystem reveal 43% of servers are vulnerable to command execution, while 341 malicious skills were found on marketplaces, exposing systemic security flaws in agentic AI. The findings highlight a growing attack surface as AI agents become more autonomous.
MiniMax M2.7 AI Agent Rewrites Its Own Harness, Achieving 9 Gold Medals on MLE Bench Lite Without Retraining
MiniMax's M2.7 agent autonomously rewrites its own operational harness—skills, memory, and workflow rules—through a self-optimization loop. After 100+ internal rounds, it earned 9 gold medals on OpenAI's MLE Bench Lite without weight updates.
How Cult of Claude's Directory Lets You Install 3,000+ Skills in Seconds
Cult of Claude aggregates 3,296 skills and 785 agents you can immediately use with Claude Code via simple CLI commands.
AI Agents Get a Memory Upgrade: New Framework Treats Multi-Agent Memory as Computer Architecture
A new paper proposes treating multi-agent memory systems as a computer architecture problem, introducing a three-layer hierarchy and identifying critical protocol gaps. This approach could significantly improve reasoning, skills, and tool usage in collaborative AI systems.
10 Claude Code Skills That Actually Work: A Solo Developer's Vetted List
A curated list of the most effective Claude Code skills for developers, based on hands-on testing, focusing on practical MCP servers and workflow enhancements.