Skip to content
gentic.news — AI News Intelligence Platform
Connecting to the Living Graph…

code generation

30 articles about code generation in AI news

ReCUBE Benchmark Reveals GPT-5 Scores Only 37.6% on Repository-Level Code Generation

Researchers introduce ReCUBE, a benchmark isolating LLMs' ability to use repository-wide context for code generation. GPT-5 achieves just a 37.57% strict pass rate, showing the task remains highly challenging.

96% relevant

China's DeepSeek-R1: Open-Source AI Agent Runs Locally with Web Search, Code Generation, and Built-In Computer

Chinese AI company DeepSeek has released DeepSeek-R1, a fully open-source AI agent that runs locally on personal computers with web search capabilities, code generation, and built-in computer functionality. The model represents a significant move toward accessible, self-contained AI systems outside the dominant U.S. ecosystem.

99% relevant

Cursor Composer2 Launches on Fireworks AI Platform, Adds RL to Code Generation Stack

Cursor Composer2, the next iteration of Cursor's AI-powered code generation system, is now available via the Fireworks AI platform. This release introduces reinforcement learning (RL) components alongside standard inference, expanding the technical approach beyond the initial version.

85% relevant

Very Rubin Platform Launches: AI-Powered Code Generation and Debugging Tool

Very Rubin, a new AI platform for software development, has launched. It offers real-time code generation, debugging, and optimization through a browser-based interface.

85% relevant

WiseTech Cuts 2,000 Engineers, Citing AI Code Generation as Primary Driver

Logistics software giant WiseTech has laid off 2,000 engineers, stating AI now writes the code. This move highlights a strategic pivot where knowing what to build is becoming the core skill, not writing the code itself.

85% relevant

Meta's New AI Checklist Forces Models to Show Their Work, Revolutionizing Code Generation

Meta researchers have developed a mandatory checklist system that requires AI models to trace code execution line-by-line rather than making blind guesses. This breakthrough addresses fundamental reliability issues in AI-generated code by enforcing step-by-step reasoning.

85% relevant

ByteDance's CUDA Agent: The AI System Outperforming Human Experts in GPU Code Generation

ByteDance has unveiled CUDA Agent, a large-scale reinforcement learning system that generates high-performance CUDA kernels. The system achieves state-of-the-art results, outperforming torch.compile by up to 100% and beating leading AI models like Claude Opus 4.5 and Gemini 3 Pro by approximately 40% on the most challenging tasks.

95% relevant

DeepSeek-R1 Scores 79.8% on SWE-Bench Verified, Matching Claude 3.5 Sonnet in Code Generation

DeepSeek's new R1 reasoning model achieved 79.8% on SWE-Bench Verified, matching Claude 3.5 Sonnet's performance. This marks significant progress in AI's ability to solve real-world coding problems.

85% relevant

Ollama Now Runs Codex Locally: DeepSeek V4, Gemma 4, Qwen 3.6 Supported

Ollama integrates Codex support for DeepSeek V4, Gemma 4, Qwen 3.6, enabling free local code generation, challenging OpenAI's API model.

83% relevant

Paper Details Full-Stack MFM Acceleration: Quant, Spec Decode, HW Co-Design

A research paper details a full-stack approach for accelerating multimodal foundation models, combining hierarchy-aware mixed-precision quantization, structural pruning, speculative decoding, model cascading, and a specialized hardware accelerator. Demonstrated on medical and code generation tasks.

72% relevant

OpenAI Expands Codex into Desktop Agent with Vision & Memory

OpenAI has reportedly expanded its Codex model beyond code generation into a multimodal desktop agent that can see, click, type, and learn user habits. This signals a strategic move from an API tool into a proactive, personalized AI assistant.

85% relevant

The 3,167-Line Function: What Claude Code's Leaked Source Teaches Us About

Claude Code's leaked source exposes the practical risks of over-reliance on AI for code generation, highlighting a critical need for human-led refactoring and architectural guardrails.

100% relevant

OpenAI Codex Weekly Users Hit 3M, Up 50% in Under a Month

Weekly active users of OpenAI's Codex have grown from 2 million to 3 million in under a month. This 50% surge indicates accelerating enterprise integration of AI-powered code generation.

85% relevant

Sam Altman Envisions Codex Desktop Evolving into Unified AI Agent Controlling Computers

Sam Altman discussed the Codex Desktop ecosystem evolving toward a unified AI agent that can control computers, access user data, and work across multiple surfaces. This vision points toward AI systems moving beyond code generation to become proactive, cross-platform assistants.

89% relevant

OpenAI Expands Codex Plugin Ecosystem to Slack, Figma, Notion, and Gmail

OpenAI has rolled out new plugins connecting its Codex model to productivity tools like Slack, Figma, Notion, and Gmail, moving code generation beyond the IDE into broader workflows.

87% relevant

Stop Asking Claude Code to Write Code—Start Asking It to Read Your Codebase

Experienced users report the biggest productivity gain isn't code generation—it's using Claude as a superhuman codebase navigator that can trace data flows and debug across files instantly.

95% relevant

Beyond Prompt Engineering: Claude Code Emerges as a Comprehensive AI Development Platform

Anthropic's Claude Code represents a paradigm shift from simple prompt tools to full AI engineering systems, offering integrated development environments, automated workflows, and sophisticated code generation capabilities that transform how developers build software.

85% relevant

Claude Octopus: GitHub Tool Enables Claude Code to Run Gemini and Codex Simultaneously

A developer discovered Claude Octopus, a GitHub repository that allows Anthropic's Claude Code to execute prompts across Google's Gemini and OpenAI's Codex models concurrently. The tool appears to enable parallel code generation from multiple AI assistants.

89% relevant

Claude Opus 4.7 Builds AlphaZero-Style Self-Play on Consumer Hardware

Claude Opus 4.7 built AlphaZero self-play from scratch on consumer hardware in three hours, showing autonomous algorithmic code generation.

100% relevant

MiniMax M2.7 Used by AtomicBot to Generate Flappy Bird Clone

A developer used the open-source MiniMax M2.7 frontier model to generate a complete, playable desktop game from a text prompt. This demonstrates practical code generation for creative applications.

75% relevant

Developer Builds LLM Wiki 'Second Brain' for AI Coding Agents

A developer built an 'LLM Wiki' that feeds an AI coding agent's context window with a living knowledge base of a specific codebase. This aims to solve the agent's short-term memory problem, leading to more consistent and informed code generation.

87% relevant

CLAUDE.md Promises 63% Reduction in Claude Output Tokens with Drop-in Prompt File

A new prompt engineering file called CLAUDE.md claims to reduce Claude's output token usage by 63% without code changes. The drop-in file aims to make Claude's code generation more efficient by structuring its responses.

87% relevant

CMU Research Identifies 'Biggest Unlock' for Coding Agents: Strategic Test Execution

New research from Carnegie Mellon University suggests the key advancement for AI coding agents lies not in raw code generation, but in developing strategies for how to run and interpret tests. This shifts focus from LLM capability to agentic reasoning.

87% relevant

Amazon's Reinforcement Fine-Tuning Revolution: How Nova Models Learn Through Feedback, Not Imitation

Amazon introduces reinforcement fine-tuning for its Nova AI models, shifting from imitation-based learning to evaluation-driven training. This approach enables enterprises to customize models using feedback signals rather than just examples, with applications from code generation to customer service.

75% relevant

Cisco Launches DevNet MCP Server: Bring API Docs Directly to Your AI Coding Assistant

Cisco's new DevNet Content Search MCP Server integrates Cisco's API documentation directly into Claude Code and other MCP-compatible IDEs, providing context-aware code generation without browser switching.

91% relevant

Pinterest Builds Dedicated Conversion Candidate Generation Model

Pinterest details the design and deployment of a dedicated shopping conversion candidate generation model, replacing engagement-based retrieval. Key innovations include a parallel DCN v2 and MLP architecture (+11% recall) and a unified multi-task approach that boosted conversion recall by +42% over their 2023 model.

100% relevant

NVIDIA Lyra 2.0 Launches on Hugging Face for Persistent 3D World Generation

NVIDIA has released Lyra 2.0 on Hugging Face, a framework designed to generate persistent, explorable 3D worlds at scale. It specifically addresses the core technical challenges of spatial forgetting and temporal drifting in long-horizon video generation.

95% relevant

Uni-ViGU Unifies Video Generation & Understanding in Single Diffusion Model

A new paper introduces Uni-ViGU, a unified model that performs video generation and understanding within a single diffusion process via flow matching. This inverts the standard approach of separate models for each task.

85% relevant

Developer Open-Sources 'Prompt-to-3D' Tool for Instant, Navigable World Generation

A developer has released an open-source tool that creates interactive 3D worlds from text or image inputs. This moves 3D asset generation from static models to instant, explorable environments.

91% relevant

ByteDance's Helios: A 14B Parameter Video Generation Model Running at 19.5 FPS on a Single H100 GPU

ByteDance has introduced Helios, a 14-billion parameter video generation model that reportedly runs at 19.5 frames per second on a single NVIDIA H100 GPU. This represents a significant step in making high-quality, real-time video synthesis more computationally accessible.

95% relevant