Skip to content
gentic.news — AI News Intelligence Platform
Connecting to the Living Graph…

programming

30 articles about programming in AI news

Claude Code Gains Auto-Memory: A Game-Changer for AI-Assisted Programming

Anthropic's Claude Code now features auto-memory capabilities, allowing the AI to retain context across coding sessions. This breakthrough addresses a fundamental limitation in AI programming assistants by creating persistent memory of project details, preferences, and patterns.

85% relevant

AI Agents Cross the Reliability Threshold: Karpathy Declares Programming Fundamentally Transformed

Former OpenAI researcher Andrej Karpathy declares programming has become "unrecognizable" as AI agents now reliably complete complex tasks in minutes rather than days. This fundamental shift occurred in late 2026 when agents achieved unprecedented reliability through improved model quality and task persistence.

75% relevant

The End of Software Gatekeepers: How Natural Language Programming is Democratizing Development

AI is transforming software from a scarce resource controlled by technical elites to an abundant commodity accessible through natural language. This shift mirrors historical democratizations in broadcasting and content creation, fundamentally changing who can build technology.

85% relevant

Karpathy's Autonomous AI Researcher: Programming the Programmer in the Age of Agentic Science

Andrej Karpathy has open-sourced an autonomous AI research agent that can run ~100 experiments overnight without human supervision. The system turns research into a game with fixed-time trials, where prompt engineering replaces manual coding.

95% relevant

Jensen Huang Declares AI Has Democratized Programming Through 'Vibe Coding'

NVIDIA CEO Jensen Huang claims AI has eliminated the technology divide, enabling anyone to become a software programmer through 'vibe coding.' He cites examples of individuals creating million-dollar businesses using these new AI-powered development tools.

85% relevant

Anthropic's Claude Code Gets Voice Mode: The Next Frontier in AI-Assisted Programming

Anthropic has introduced voice mode for Claude Code, allowing developers to interact with the AI coding assistant through natural speech. This marks a significant evolution in how programmers can collaborate with AI tools, potentially transforming development workflows.

85% relevant

OpenSage: The Dawn of Self-Programming AI Agents That Build Their Own Teams

OpenSage introduces the first agent development kit enabling LLMs to autonomously create AI agents with self-generated architectures, toolkits, and memory systems, potentially revolutionizing how AI systems are designed and deployed.

75% relevant

Open-Source 'Claude Code' Dev Setup Replicates Anthropic Engineer's Workflow

A developer has reverse-engineered and published the complete Claude Code development setup used by Anthropic engineer Boris Cherny. The project is available for free on GitHub, offering a window into high-level AI-assisted programming practices.

77% relevant

OpenAI Publishes Codex Use-Case Gallery with Practical Examples for Developers

OpenAI has released a public gallery of practical examples demonstrating how to use its Codex model for real-world programming tasks. The resource provides concrete prompts and outputs for developers building with the API.

85% relevant

Cursor Announces Composer 2: Smaller, Cheaper Coding-Specific Model Targeting Claude Opus Performance

Cursor is launching Composer 2, a coding-specific AI model trained solely on programming data. The smaller, cheaper model is rumored to approach Claude Opus 4.6 performance, intensifying competition in the coding agent space.

85% relevant

Stanford/CMU Study: AI Agent Benchmarks Focus on 7.6% of Jobs, Ignoring Management, Legal, and Interpersonal Work

Researchers analyzed 43 AI benchmarks against 72,000+ real job tasks and found they overwhelmingly test programming/math skills, which represent only 7.6% of actual economic work. Management, legal, and interpersonal tasks—which dominate the labor market—are almost entirely absent from evaluation.

85% relevant

Meta's Breakthrough: Forcing AI to Show Its Work Slashes Coding Errors by 90%

Meta researchers discovered that requiring large language models to display step-by-step reasoning with proof verification dramatically reduces code patch error rates. This 'show your work' approach could transform how AI systems handle complex programming tasks.

85% relevant

From Code to Cognition: How AI is Redefining the Programmer's Journey

Former Google CEO Eric Schmidt reflects on how AI has fundamentally transformed programming, rendering decades of specialized coding skills accessible to anyone with a smartphone. His personal journey from dedicated programmer to witnessing AI's democratization of development highlights a seismic shift in technology education and professional pathways.

85% relevant

Alibaba's OpenSandbox Aims to Standardize AI Agent Execution with Open-Source Security

Alibaba has open-sourced OpenSandbox, a production-grade environment providing secure, isolated execution for AI agents. Released under Apache 2.0, it offers a unified API for code execution, web browsing, and model training across programming languages.

75% relevant

Alibaba's OpenSandbox: The Free Infrastructure Revolution for AI Agents

Alibaba has open-sourced OpenSandbox, a production-grade sandbox environment for AI agents that provides secure code execution, web browsing, and model training capabilities with unified APIs across multiple programming languages.

95% relevant

Moonlake's Reverie Engine: The AI-Powered Game Development Revolution Begins

Moonlake has launched the first programmable world model for real-time interactive content, powered by the Reverie real-time diffusion engine. This breakthrough could democratize game development by enabling creators without traditional programming skills to build immersive experiences.

85% relevant

The AI Espionage Frontier: Anthropic Exposes Systematic Claude Data Extraction by Chinese AI Labs

Anthropic has revealed that Chinese AI companies DeepSeek, Moonshot, and MiniMax allegedly used 24,000 fake accounts to execute 16 million queries against Claude's API, systematically extracting its capabilities through model distillation techniques. This sophisticated operation bypassed access restrictions and targeted Claude's reasoning, programming, and tool usage functions.

80% relevant

Visual-Explainer Agent Skill Replaces ASCII Diagrams for Code

A developer showcased 'visual-explainer,' an installable agent skill that creates diagrams from code. This targets a specific pain point in AI-assisted programming by replacing manual ASCII diagrams with automated visuals.

93% relevant

No Rigorous Productivity Tests Exist for Post-2025 Autonomous Coding Tools

No productivity studies exist for autonomous coding tools launched December 2025. All research predates the Claude Code/Codex revolution, creating a major knowledge gap.

72% relevant

Jensen Huang Wants Zero Coding at NVIDIA — 'Purpose vs Task'

Jensen Huang wants zero coding by NVIDIA engineers, framing it as a task to minimize. The bet is AI-generated code will match human output for performance-critical software.

77% relevant

Boston Dynamics Atlas Lifts 100-lb Fridge via RL

Boston Dynamics showed Atlas lifting a 100+ lb mini-fridge via RL, moving from locomotion to practical manipulation.

85% relevant

Ollama Now Runs Codex Locally: DeepSeek V4, Gemma 4, Qwen 3.6 Supported

Ollama integrates Codex support for DeepSeek V4, Gemma 4, Qwen 3.6, enabling free local code generation, challenging OpenAI's API model.

83% relevant

Claude Code's File-Deletion Track Record Spurs Community Safety Guide

Community safety guide documents three Claude Code file-deletion incidents since October 2025 and prescribes three defense layers. Anthropic's sandboxing remains opt-in.

100% relevant

Codex 'Ultra-Fast' Mode Spotted in Leaked Screenshot

Leaked screenshot suggests OpenAI is adding an ultra-fast latency mode to Codex. No release date or pricing confirmed.

89% relevant

Agentic AI's Real Win: Automating Bank Grunt Work, Not Flashy Demos

Agentic AI's sweet spot is automating banking grunt work, cutting processing time by 70%. Google Cloud leads enterprise deployments; the value is cost savings, not flashy demos.

92% relevant

Pyptx: Write Nvidia PTX Kernels in Python for Hopper and Blackwell

Pyptx lets developers write and launch hand-tuned Nvidia PTX kernels directly from Python, supporting Hopper (sm_90a) and Blackwell (sm_100a). It provides explicit control over registers, shared memory, and advanced features like WGMMA and TMA, with dispatch through JAX, PyTorch eager, and torch.compile.

91% relevant

Use Claude Code to Automate Systematic Literature Reviews

Claude Code can automate systematic literature reviews: scrape papers, extract key themes, and generate structured summaries — all from the terminal.

100% relevant

GPT-5.5 Pro Sustains 2-Hour Bug Fixing Sessions

A user reports GPT-5.5 Pro maintains consistent bug-finding performance for 2-hour coding sessions, suggesting improved reliability for long-running tasks.

85% relevant

Nvidia Invests $2B in Marvell for NVLink Fusion Interconnect

Nvidia is investing $2 billion in Marvell Technology to deepen their partnership on NVLink Fusion, a new interconnect architecture for scaling AI clusters beyond current limits.

100% relevant

Agent Harnessing: The Infrastructure That Makes AI Agents Work

A detailed technical guide argues that the model is not the hard part of building AI agents. The six-component harness — context management, memory, tools, control flow, verification, and coordination — is what separates production-grade agents from those that fail silently.

88% relevant