programming

30 articles about programming in AI news

Claude Code Gains Auto-Memory: A Game-Changer for AI-Assisted Programming

Anthropic's Claude Code now features auto-memory capabilities, allowing the AI to retain context across coding sessions. This breakthrough addresses a fundamental limitation in AI programming assistants by creating persistent memory of project details, preferences, and patterns.

Feb 27, 202685% relevant

AI Agents Cross the Reliability Threshold: Karpathy Declares Programming Fundamentally Transformed

Former OpenAI researcher Andrej Karpathy declares programming has become "unrecognizable" as AI agents now reliably complete complex tasks in minutes rather than days. This fundamental shift occurred in late 2026 when agents achieved unprecedented reliability through improved model quality and task persistence.

Feb 26, 202675% relevant

NVFP4 GEMM on RTX Pro Blackwell: SM12x Breaks from B200 Programming Model

NVIDIA's SM12x architecture drops tcgen05.mma for mma.sync, breaking B200 kernel compatibility. SM8x kernels port easily; developers must maintain separate codebases.

Jun 21, 202686% relevant

The End of Software Gatekeepers: How Natural Language Programming is Democratizing Development

AI is transforming software from a scarce resource controlled by technical elites to an abundant commodity accessible through natural language. This shift mirrors historical democratizations in broadcasting and content creation, fundamentally changing who can build technology.

Mar 9, 202685% relevant

Karpathy's Autonomous AI Researcher: Programming the Programmer in the Age of Agentic Science

Andrej Karpathy has open-sourced an autonomous AI research agent that can run ~100 experiments overnight without human supervision. The system turns research into a game with fixed-time trials, where prompt engineering replaces manual coding.

Mar 7, 202695% relevant

Jensen Huang Declares AI Has Democratized Programming Through 'Vibe Coding'

NVIDIA CEO Jensen Huang claims AI has eliminated the technology divide, enabling anyone to become a software programmer through 'vibe coding.' He cites examples of individuals creating million-dollar businesses using these new AI-powered development tools.

Mar 5, 202685% relevant

Anthropic's Claude Code Gets Voice Mode: The Next Frontier in AI-Assisted Programming

Anthropic has introduced voice mode for Claude Code, allowing developers to interact with the AI coding assistant through natural speech. This marks a significant evolution in how programmers can collaborate with AI tools, potentially transforming development workflows.

Mar 3, 202685% relevant

OpenSage: The Dawn of Self-Programming AI Agents That Build Their Own Teams

OpenSage introduces the first agent development kit enabling LLMs to autonomously create AI agents with self-generated architectures, toolkits, and memory systems, potentially revolutionizing how AI systems are designed and deployed.

Feb 20, 202675% relevant

Open-Source 'Claude Code' Dev Setup Replicates Anthropic Engineer's Workflow

A developer has reverse-engineered and published the complete Claude Code development setup used by Anthropic engineer Boris Cherny. The project is available for free on GitHub, offering a window into high-level AI-assisted programming practices.

Apr 13, 202677% relevant

OpenAI Publishes Codex Use-Case Gallery with Practical Examples for Developers

OpenAI has released a public gallery of practical examples demonstrating how to use its Codex model for real-world programming tasks. The resource provides concrete prompts and outputs for developers building with the API.

Mar 28, 202685% relevant

Cursor Announces Composer 2: Smaller, Cheaper Coding-Specific Model Targeting Claude Opus Performance

Cursor is launching Composer 2, a coding-specific AI model trained solely on programming data. The smaller, cheaper model is rumored to approach Claude Opus 4.6 performance, intensifying competition in the coding agent space.

Mar 19, 202685% relevant

Stanford/CMU Study: AI Agent Benchmarks Focus on 7.6% of Jobs, Ignoring Management, Legal, and Interpersonal Work

Researchers analyzed 43 AI benchmarks against 72,000+ real job tasks and found they overwhelmingly test programming/math skills, which represent only 7.6% of actual economic work. Management, legal, and interpersonal tasks—which dominate the labor market—are almost entirely absent from evaluation.

Mar 16, 202685% relevant

Meta's Breakthrough: Forcing AI to Show Its Work Slashes Coding Errors by 90%

Meta researchers discovered that requiring large language models to display step-by-step reasoning with proof verification dramatically reduces code patch error rates. This 'show your work' approach could transform how AI systems handle complex programming tasks.

Mar 8, 202685% relevant

From Code to Cognition: How AI is Redefining the Programmer's Journey

Former Google CEO Eric Schmidt reflects on how AI has fundamentally transformed programming, rendering decades of specialized coding skills accessible to anyone with a smartphone. His personal journey from dedicated programmer to witnessing AI's democratization of development highlights a seismic shift in technology education and professional pathways.

Mar 7, 202685% relevant

Alibaba's OpenSandbox Aims to Standardize AI Agent Execution with Open-Source Security

Alibaba has open-sourced OpenSandbox, a production-grade environment providing secure, isolated execution for AI agents. Released under Apache 2.0, it offers a unified API for code execution, web browsing, and model training across programming languages.

Mar 3, 202675% relevant

Alibaba's OpenSandbox: The Free Infrastructure Revolution for AI Agents

Alibaba has open-sourced OpenSandbox, a production-grade sandbox environment for AI agents that provides secure code execution, web browsing, and model training capabilities with unified APIs across multiple programming languages.

Mar 2, 202695% relevant

Moonlake's Reverie Engine: The AI-Powered Game Development Revolution Begins

Moonlake has launched the first programmable world model for real-time interactive content, powered by the Reverie real-time diffusion engine. This breakthrough could democratize game development by enabling creators without traditional programming skills to build immersive experiences.

Feb 25, 202685% relevant

The AI Espionage Frontier: Anthropic Exposes Systematic Claude Data Extraction by Chinese AI Labs

Anthropic has revealed that Chinese AI companies DeepSeek, Moonshot, and MiniMax allegedly used 24,000 fake accounts to execute 16 million queries against Claude's API, systematically extracting its capabilities through model distillation techniques. This sophisticated operation bypassed access restrictions and targeted Claude's reasoning, programming, and tool usage functions.

Feb 23, 202680% relevant

Visual-Explainer Agent Skill Replaces ASCII Diagrams for Code

A developer showcased 'visual-explainer,' an installable agent skill that creates diagrams from code. This targets a specific pain point in AI-assisted programming by replacing manual ASCII diagrams with automated visuals.

Apr 6, 202693% relevant

Microsoft's Study Proves Claude Code Boosts PR Output by 24%

Claude Code users merge 24% more PRs per Microsoft's study. Drive adoption via peer visibility, not mandates. Retention correlates with coding activity, not demographics.

Jul 13, 202670% relevant

Soofi S 30B-A3B: German open model tops English, German benchmarks

German consortium releases Soofi S 30B-A3B, an open MoE model beating OLMo 3 and Apertus 70B on English and German benchmarks while activating only 3.2B of 31.6B parameters.

Jul 13, 2026100% relevant

OpenAI Finds 30% of SWE-Bench Pro Tasks Are Broken, Pulls Endorsement

OpenAI finds ~30% of SWE-Bench Pro tasks broken, pulls endorsement. Human reviewers flagged 249 flawed tasks.

Jul 9, 202695% relevant

Simplexity Ships 100 Robots 11 Months After Founding

Simplexity Robotics shipped 100 i7 Pro robots 11 months after founding. Claims all-scenario capability without reconfiguration, but lacks third-party validation.

Jul 7, 202688% relevant

Mistral's Leanstral 1.5 hits 100% on miniF2F, finds 5 real bugs

Mistral's Leanstral 1.5 scores 100% on miniF2F, solves 587 Putnam problems, and finds 5 real bugs in open-source code.

Jul 4, 2026100% relevant

Zhipu AI Launches Claude Code Clone 'ZCode' with GLM-5.2

Zhipu AI launched ZCode, a Claude Code clone for GLM-5.2, with three pricing tiers and remote execution via WeChat.

Jul 1, 202683% relevant

Claude Code Steganography Flagged Chinese Users; Anthropic Rolls Back

Anthropic's Claude Code 2.1.91 used steganography to detect Chinese users. After Reddit exposure, Anthropic rolled back the feature, calling it an experiment against model distillation.

Jul 1, 2026100% relevant

Qualcomm in Talks to Acquire Modular for $4B, Landing Lattner

Qualcomm nears $4B acquisition of Modular, Chris Lattner's AI infra startup. Deal targets inference software for edge and data center AI chips.

Jun 22, 202682% relevant

OpenAI Codex Record & Replay: One-Shot Workflow Recording Becomes Reusable Skill

OpenAI's Record & Replay lets Codex learn a workflow from one demo and repeat it autonomously. The feature is blocked in the EU, UK, and Switzerland.

Jun 20, 202694% relevant

Moonshot AI, State Bank Launch First AI-Native Credit Card in China

Moonshot AI's Kimi launches world's first AI-native credit card with state-owned bank, converting spending into compute credits.

Jun 13, 202690% relevant

GitHub Copilot App Revealed via Leaked Screenshot

Leaked screenshot reveals GitHub Copilot mobile app, suggesting Microsoft expands AI coding to phones. No official confirmation or release date.

Jun 2, 2026100% relevant

Explore More

AI Agents Large Language Models Claude Code OpenAI RAG MCP Fine-tuning Benchmarks Open Source AI AI Safety