red team
30 articles about red team in AI news
Decepticon Open-Sources Autonomous AI Red Team for Full Kill Chain
Decepticon, a new open-source multi-agent AI system, autonomously executes the entire cyber kill chain for red teaming, from reconnaissance to exfiltration, enabling continuous security testing.
Mythos AI Red Team Reports: A 6-9 Month Warning Window for CISOs
AI researcher Ethan Mollick highlights a critical gap: few large organizations treat AI red team reports from groups like Mythos as urgent threats, despite a historical 6-9 month diffusion window to malicious actors.
Swarm Plugin Enforces Consistent 9/10 Outputs from Claude Code Teams
The Swarm plugin for Claude Code creates a structured team of agents that review and score work before it reaches you, solving the problem of inconsistent output quality.
Garry Tan's gstack: Install This 56k-Star 'Virtual Team' for Claude Code
YC CEO Garry Tan open-sourced gstack, a pack of slash commands that turns Claude Code into a structured team of specialists, claiming it helps ship 10k-20k lines of code daily.
Google Quantum AI Team Reduces Bitcoin-Cracking Qubit Estimate to ~500k, Enabling 9-Minute Key Derivation
Google researchers have compiled Shor's algorithm to solve Bitcoin's 256-bit elliptic curve problem with ~1.2k logical qubits, translating to <500k physical qubits—a 20x reduction from 2023 estimates. This makes 'on-spend' attacks against unconfirmed transactions theoretically plausible with fast-clock quantum hardware.
OpenAI Launches ChatGPT Workspace Agents for Team Automation
OpenAI has introduced workspace agents within ChatGPT, powered by Codex, designed to automate complex, multi-step workflows for teams across shared environments like Slack. These agents can gather context, execute tasks, request approvals, and run continuously in the cloud.
Microsoft Fires Candy Crush AI Team After Years of Level-Design Tool Development
A developer claims Microsoft fired the AI team at King, the Candy Crush developer, after they spent years building tools to automate level design. This highlights the tension between long-term AI R&D and corporate cost-cutting.
Emergent AI Launches Work Stress Copilot, Integrates with Slack & Teams
Emergent AI has launched a new 'Work Stress Copilot' agent that integrates with Slack and Microsoft Teams to autonomously manage calendar scheduling, email triage, and meeting prep. The tool aims to directly reduce cognitive load by automating repetitive administrative work.
Replace Claude Code's Context-Stuffing with git-semantic for Team-Wide Semantic Search
A new tool, git-semantic, lets teams build and share a semantic search index of their codebase via Git, eliminating redundant API calls and enabling faster, more accurate Claude Code queries.
OpenAI Unbundles Codex API, Launches Metered Pilot with Usage-Based Pricing
OpenAI has unbundled its Codex code-generation model from ChatGPT Business, making it available as a standalone, usage-metered product. This allows teams to pilot Codex without purchasing full ChatGPT seats and ties costs directly to coding output.
AI-Powered 'Vibe-Coded' Companies Emerge as AI Collapses Traditional Staffing Models
Entrepreneur Matthew Gallagher used AI to automate core business functions—coding, marketing, support—allowing his company to scale without building a large managerial team. This demonstrates AI's current strength: drastically reducing coordination costs to enable solo or small teams to execute like corporations.
DeepMind Secretly Assembled ~20-Person Team to Train AI for High-Frequency Trading, Aiming at Renaissance
Demis Hassabis formed a covert ~20-researcher team within DeepMind to develop AI-powered high-frequency trading algorithms, reportedly targeting rival Renaissance Technologies. Google leadership disapproved, leading to the project's quiet termination.
Requestly Launches Git-Synced API Client to Replace Scattered Postman Setups
Requestly has launched an AI-powered API client that automatically syncs team collections through Git, eliminating stale docs and configuration drift. The tool directly targets the collaboration pain points of Postman and Insomnia users.
LeCun's Team Publishes LeWorldModel: A 15M-Parameter World Model That Mathematically Prevents Training Collapse
Yann LeCun's team has open-sourced LeWorldModel, a 15M-parameter world model that uses a novel SIGReg regularizer to make representation collapse mathematically impossible. It trains on a single GPU in hours and enables efficient physical prediction for robotics and autonomous systems.
OpenAI Shifts Sora Team to World-Model Research, Reportedly Cancels Video Model for Compute
A report claims OpenAI has redirected its Sora team to focus on world-model research for robotics and canceled the video model to free compute for a new, powerful LLM codenamed 'Spud.'
AI Product Teams: How Luxury Brands Can 10x Development Velocity with Autonomous Agents
A developer built a full deal intelligence platform in one week using two AI agents as team members. This structured approach—43 sprints, 6,800-line strategy—demonstrates how luxury brands can accelerate digital innovation with AI-powered product development.
The End of 'Who Has the Latest Version?': How AI-Powered Real-Time Collaboration is Transforming Development
AI-driven real-time shared workspaces are eliminating traditional development bottlenecks like version conflicts and sync errors. These platforms enable entire teams to work from a single, live state, fundamentally changing how developers collaborate.
Clinical LLM Rejection Predictor Hits AUROC 0.719 in 4.5-Month Study
Clinical LLM rejection predictor achieves AUROC 0.719 in 4.5-month study using deployment-specific context to forecast user rejection before response generation.
og-local: The Local Privacy Proxy That Redacts Secrets Before They Reach
og-local is a local proxy that redacts PII/secrets from Claude Code API calls using an ONNX model. Install via curl, run ogl claude. No cloud round-trip, no data leaks.
Claude Code's June 15 Agentic Credit Split: How to Avoid Hitting the $20 Wall
Claude Code's June 15 agentic credit split moves `claude -p` and CI workflows to a separate $20/month bucket on Pro. Upgrade to Max 5x or switch to direct API for production pipelines.
MIT Spinoff's Nuclear-Inspired Cooling Targets Data Center Water Use
MIT spinoff Infinite Cooling unveiled a nuclear-inspired cooling system that recycles data center heat and water, targeting 40% water use reduction. The tech faces competition from liquid cooling but offers retrofits for existing towers.
GrubMarket Launches AI Agent for Food Distributor Sales Teams
GrubMarket launches an AI agent for food distributor sales teams, offering real-time data and automated recommendations to boost efficiency. This applies directly to retail and luxury supply chain sales operations.
Anthropic Splits --print Into $100/Mo Credit, Torpedoes CLI Workarounds
Anthropic splits `claude --print` and Agent SDK usage into separate monthly credits. Pro gets $20/mo, Max gets $100-$200/mo. Credits don't roll over.
Shopify Drops Redis for MySQL in Inventory Reservations, Scales 10x
Shopify replaced Redis with MySQL for inventory reservations, achieving 10x scalability and handling 50,000 writes per second.
Claude Code Plugin Deploys 17-Agent SDLC Team With Orchestrator
Team-of-agents plugin adds 17 specialist AI agents with an orchestrator to Claude Code, using confidence signals to gate output quality.
MIT Hackathon Team Builds Wearable AI for Physical Movement Guidance
MIT hackathon team builds wearable AI for real-time physical movement guidance via sensors and on-device inference, demoed by @kimmonismus.
Embedding distance predicts VLM typographic attack success (r=-0.93)
A new study shows that embedding distance between image text and harmful prompt strongly predicts attack success rate (r=-0.71 to -0.93). The researchers introduce CWA-SSA optimization to recover readability and bypass safety alignment without model access.
SSL: Structured Skill Language Boosts Skill Discovery MRR to 0.707
Researchers propose SSL, a three-layer typed JSON representation for AI agent skills, replacing unstructured SKILL.md prose. Using an LLM normalizer, SSL improves Skill Discovery MRR from 0.573 to 0.707 and Risk Assessment macro F1 from 0.744 to 0.787 on a newly released 6,184-skill corpus.
LLM-Based Customer Digital Twins Predict Preferences with 87.7% Accuracy
A new arXiv paper proposes using LLM-based 'customer digital twins' (CDTs) — agents built from individual Reddit review histories via RAG — to perform conjoint analysis. The CDTs predict actual user preferences with 87.73% accuracy in a computer monitor case study, offering a scalable alternative to traditional market research.
RedParrot: Semantic Caching Speeds Up NL-to-DSL for Business Analytics by
Xiaohongshu researchers propose RedParrot, a framework that caches normalized structural patterns of natural language queries to bypass expensive LLM pipelines, achieving 3.6x speedup and 8.26% accuracy improvement on enterprise datasets.