programming
30 articles about programming in AI news
Claude Code Gains Auto-Memory: A Game-Changer for AI-Assisted Programming
Anthropic's Claude Code now features auto-memory capabilities, allowing the AI to retain context across coding sessions. This breakthrough addresses a fundamental limitation in AI programming assistants by creating persistent memory of project details, preferences, and patterns.
AI Agents Cross the Reliability Threshold: Karpathy Declares Programming Fundamentally Transformed
Former OpenAI researcher Andrej Karpathy declares programming has become "unrecognizable" as AI agents now reliably complete complex tasks in minutes rather than days. This fundamental shift occurred in late 2026 when agents achieved unprecedented reliability through improved model quality and task persistence.
The End of Software Gatekeepers: How Natural Language Programming is Democratizing Development
AI is transforming software from a scarce resource controlled by technical elites to an abundant commodity accessible through natural language. This shift mirrors historical democratizations in broadcasting and content creation, fundamentally changing who can build technology.
Karpathy's Autonomous AI Researcher: Programming the Programmer in the Age of Agentic Science
Andrej Karpathy has open-sourced an autonomous AI research agent that can run ~100 experiments overnight without human supervision. The system turns research into a game with fixed-time trials, where prompt engineering replaces manual coding.
Jensen Huang Declares AI Has Democratized Programming Through 'Vibe Coding'
NVIDIA CEO Jensen Huang claims AI has eliminated the technology divide, enabling anyone to become a software programmer through 'vibe coding.' He cites examples of individuals creating million-dollar businesses using these new AI-powered development tools.
Anthropic's Claude Code Gets Voice Mode: The Next Frontier in AI-Assisted Programming
Anthropic has introduced voice mode for Claude Code, allowing developers to interact with the AI coding assistant through natural speech. This marks a significant evolution in how programmers can collaborate with AI tools, potentially transforming development workflows.
OpenSage: The Dawn of Self-Programming AI Agents That Build Their Own Teams
OpenSage introduces the first agent development kit enabling LLMs to autonomously create AI agents with self-generated architectures, toolkits, and memory systems, potentially revolutionizing how AI systems are designed and deployed.
Open-Source 'Claude Code' Dev Setup Replicates Anthropic Engineer's Workflow
A developer has reverse-engineered and published the complete Claude Code development setup used by Anthropic engineer Boris Cherny. The project is available for free on GitHub, offering a window into high-level AI-assisted programming practices.
OpenAI Publishes Codex Use-Case Gallery with Practical Examples for Developers
OpenAI has released a public gallery of practical examples demonstrating how to use its Codex model for real-world programming tasks. The resource provides concrete prompts and outputs for developers building with the API.
Cursor Announces Composer 2: Smaller, Cheaper Coding-Specific Model Targeting Claude Opus Performance
Cursor is launching Composer 2, a coding-specific AI model trained solely on programming data. The smaller, cheaper model is rumored to approach Claude Opus 4.6 performance, intensifying competition in the coding agent space.
Stanford/CMU Study: AI Agent Benchmarks Focus on 7.6% of Jobs, Ignoring Management, Legal, and Interpersonal Work
Researchers analyzed 43 AI benchmarks against 72,000+ real job tasks and found they overwhelmingly test programming/math skills, which represent only 7.6% of actual economic work. Management, legal, and interpersonal tasks—which dominate the labor market—are almost entirely absent from evaluation.
Meta's Breakthrough: Forcing AI to Show Its Work Slashes Coding Errors by 90%
Meta researchers discovered that requiring large language models to display step-by-step reasoning with proof verification dramatically reduces code patch error rates. This 'show your work' approach could transform how AI systems handle complex programming tasks.
From Code to Cognition: How AI is Redefining the Programmer's Journey
Former Google CEO Eric Schmidt reflects on how AI has fundamentally transformed programming, rendering decades of specialized coding skills accessible to anyone with a smartphone. His personal journey from dedicated programmer to witnessing AI's democratization of development highlights a seismic shift in technology education and professional pathways.
Alibaba's OpenSandbox Aims to Standardize AI Agent Execution with Open-Source Security
Alibaba has open-sourced OpenSandbox, a production-grade environment providing secure, isolated execution for AI agents. Released under Apache 2.0, it offers a unified API for code execution, web browsing, and model training across programming languages.
Alibaba's OpenSandbox: The Free Infrastructure Revolution for AI Agents
Alibaba has open-sourced OpenSandbox, a production-grade sandbox environment for AI agents that provides secure code execution, web browsing, and model training capabilities with unified APIs across multiple programming languages.
Moonlake's Reverie Engine: The AI-Powered Game Development Revolution Begins
Moonlake has launched the first programmable world model for real-time interactive content, powered by the Reverie real-time diffusion engine. This breakthrough could democratize game development by enabling creators without traditional programming skills to build immersive experiences.
The AI Espionage Frontier: Anthropic Exposes Systematic Claude Data Extraction by Chinese AI Labs
Anthropic has revealed that Chinese AI companies DeepSeek, Moonshot, and MiniMax allegedly used 24,000 fake accounts to execute 16 million queries against Claude's API, systematically extracting its capabilities through model distillation techniques. This sophisticated operation bypassed access restrictions and targeted Claude's reasoning, programming, and tool usage functions.
Visual-Explainer Agent Skill Replaces ASCII Diagrams for Code
A developer showcased 'visual-explainer,' an installable agent skill that creates diagrams from code. This targets a specific pain point in AI-assisted programming by replacing manual ASCII diagrams with automated visuals.
No Rigorous Productivity Tests Exist for Post-2025 Autonomous Coding Tools
No productivity studies exist for autonomous coding tools launched December 2025. All research predates the Claude Code/Codex revolution, creating a major knowledge gap.
Jensen Huang Wants Zero Coding at NVIDIA — 'Purpose vs Task'
Jensen Huang wants zero coding by NVIDIA engineers, framing it as a task to minimize. The bet is AI-generated code will match human output for performance-critical software.
Boston Dynamics Atlas Lifts 100-lb Fridge via RL
Boston Dynamics showed Atlas lifting a 100+ lb mini-fridge via RL, moving from locomotion to practical manipulation.
Ollama Now Runs Codex Locally: DeepSeek V4, Gemma 4, Qwen 3.6 Supported
Ollama integrates Codex support for DeepSeek V4, Gemma 4, Qwen 3.6, enabling free local code generation, challenging OpenAI's API model.
Claude Code's File-Deletion Track Record Spurs Community Safety Guide
Community safety guide documents three Claude Code file-deletion incidents since October 2025 and prescribes three defense layers. Anthropic's sandboxing remains opt-in.
Codex 'Ultra-Fast' Mode Spotted in Leaked Screenshot
Leaked screenshot suggests OpenAI is adding an ultra-fast latency mode to Codex. No release date or pricing confirmed.
Agentic AI's Real Win: Automating Bank Grunt Work, Not Flashy Demos
Agentic AI's sweet spot is automating banking grunt work, cutting processing time by 70%. Google Cloud leads enterprise deployments; the value is cost savings, not flashy demos.
Pyptx: Write Nvidia PTX Kernels in Python for Hopper and Blackwell
Pyptx lets developers write and launch hand-tuned Nvidia PTX kernels directly from Python, supporting Hopper (sm_90a) and Blackwell (sm_100a). It provides explicit control over registers, shared memory, and advanced features like WGMMA and TMA, with dispatch through JAX, PyTorch eager, and torch.compile.
Use Claude Code to Automate Systematic Literature Reviews
Claude Code can automate systematic literature reviews: scrape papers, extract key themes, and generate structured summaries — all from the terminal.
GPT-5.5 Pro Sustains 2-Hour Bug Fixing Sessions
A user reports GPT-5.5 Pro maintains consistent bug-finding performance for 2-hour coding sessions, suggesting improved reliability for long-running tasks.
Nvidia Invests $2B in Marvell for NVLink Fusion Interconnect
Nvidia is investing $2 billion in Marvell Technology to deepen their partnership on NVLink Fusion, a new interconnect architecture for scaling AI clusters beyond current limits.
Agent Harnessing: The Infrastructure That Makes AI Agents Work
A detailed technical guide argues that the model is not the hard part of building AI agents. The six-component harness — context management, memory, tools, control flow, verification, and coordination — is what separates production-grade agents from those that fail silently.