Skip to content
gentic.news — AI News Intelligence Platform

gpu programming

29 articles about gpu programming in AI news

Karpathy's 'Autoresearch' Tool Democratizes AI Research: One GPU, One Night, 100 Experiments

Andrej Karpathy has open-sourced 'autoresearch,' a tool that enables AI to autonomously improve its own training code. By writing simple prompts in Markdown, researchers can have AI agents run hundreds of experiments overnight on a single GPU, dramatically accelerating the research process.

95% relevant

Karpathy's Autonomous AI Researcher: Programming the Programmer in the Age of Agentic Science

Andrej Karpathy has open-sourced an autonomous AI research agent that can run ~100 experiments overnight without human supervision. The system turns research into a game with fixed-time trials, where prompt engineering replaces manual coding.

95% relevant

Jensen Huang Declares AI Has Democratized Programming Through 'Vibe Coding'

NVIDIA CEO Jensen Huang claims AI has eliminated the technology divide, enabling anyone to become a software programmer through 'vibe coding.' He cites examples of individuals creating million-dollar businesses using these new AI-powered development tools.

85% relevant

ByteDance's CUDA Agent: The AI System Outperforming Human Experts in GPU Code Generation

ByteDance has unveiled CUDA Agent, a large-scale reinforcement learning system that generates high-performance CUDA kernels. The system achieves state-of-the-art results, outperforming torch.compile by up to 100% and beating leading AI models like Claude Opus 4.5 and Gemini 3 Pro by approximately 40% on the most challenging tasks.

95% relevant

NVIDIA's cuQuantum-DGX OS Aims to Manage Hybrid Quantum-Classical Workflows

NVIDIA announced its AI software stack is evolving into an operating system for quantum computing, aiming to manage the complex workflow between quantum processors and classical GPUs. This targets a major integration bottleneck as quantum hardware scales.

85% relevant

InCoder-32B-Thinking Hits 81.3% on LiveCodeBench, Trained on Chip & Kernel Traces

InCoder-32B-Thinking, a 32B parameter model trained on execution traces from chip design, GPU kernels, and embedded systems, scores 81.3% on LiveCodeBench V5 and an 84% compile pass rate on CAD-Coder.

92% relevant

Karpathy's Autoresearch: Democratizing AI Experimentation with Minimalist Agentic Tools

Andrej Karpathy releases 'autoresearch,' a 630-line Python tool enabling AI agents to autonomously conduct machine learning experiments on single GPUs. This minimalist framework transforms how researchers approach iterative ML optimization.

85% relevant

Google's TensorFlow 2.21 Revolutionizes Edge AI with Unified LiteRT Framework

Google has launched TensorFlow 2.21, marking LiteRT's transition to a production-ready universal on-device inference framework. This major update delivers faster GPU performance, new NPU acceleration, and seamless PyTorch edge deployment, effectively replacing TensorFlow Lite for mobile and edge applications.

75% relevant

NVIDIA's SVG Benchmark Saturation Signals New Era in AI Graphics Performance

NVIDIA CEO Jensen Huang's presentation of the next RTX 6000 GPU series reveals that SVG benchmark performance has reached saturation, indicating a major milestone in AI-accelerated graphics rendering capabilities.

85% relevant

Cerebras' Strategic Partnership Yields Breakthrough AI Training Results

Cerebras Systems' partnership with Abu Dhabi's G42 has produced remarkable AI training benchmarks, achieving results 100x faster than traditional GPU clusters. The collaboration demonstrates the viability of wafer-scale computing for large language model development.

85% relevant

Pyptx: Write Nvidia PTX Kernels in Python for Hopper and Blackwell

Pyptx lets developers write and launch hand-tuned Nvidia PTX kernels directly from Python, supporting Hopper (sm_90a) and Blackwell (sm_100a). It provides explicit control over registers, shared memory, and advanced features like WGMMA and TMA, with dispatch through JAX, PyTorch eager, and torch.compile.

77% relevant

Nvidia Invests $2B in Marvell for NVLink Fusion Interconnect

Nvidia is investing $2 billion in Marvell Technology to deepen their partnership on NVLink Fusion, a new interconnect architecture for scaling AI clusters beyond current limits.

98% relevant

Developer Achieves 395x RTFx on M5 Max with Fastest Parakeet v3 for Apple ANE

Developer @mweinbach has optimized the Parakeet v3 speech recognition model for Apple's Neural Engine, achieving a 395x real-time factor on an M5 Max chip. This represents a significant performance leap for on-device AI inference on Apple Silicon.

87% relevant

AI-Powered PS4 Emulator 'Spine' Runs Bloodborne Locally on PC

A developer has released Spine, a PS4 emulator that uses AI techniques to run Bloodborne fully on PC. This represents a major step forward in console emulation, previously considered years away.

87% relevant

Claude Code Builds Browser-Based 3D Flight Simulator in Weekend

A developer used Anthropic's Claude Code to build a complete 3D flight simulator that runs in a web browser over a weekend, demonstrating rapid AI-assisted game development.

85% relevant

Claude Code Runs 100% Locally on Mac via Native 200-Line API Server

A developer created a 200-line server that speaks Anthropic's API natively, allowing Claude Code to run entirely locally on M-series Macs at 65 tokens/second with no cloud dependency.

100% relevant

MLX-Benchmark Suite Launches as First Comprehensive LLM Eval for Apple Silicon

The MLX-Benchmark Suite has been released as the first comprehensive evaluation framework for Large Language Models running on Apple's MLX framework. It provides standardized metrics for models optimized for Apple Silicon hardware.

85% relevant

Qwen 3.6 Released: Free, Open-Weights Model for Local AI Coding

Alibaba's Qwen team released Qwen 3.6, an open-weights AI model for local deployment. This provides a free, private alternative to ID-verified models like Anthropic's Mythos and OpenAI's Codex.

100% relevant

NVIDIA Ising AI OS Cuts Quantum Calibration from Days to Hours

NVIDIA launched Ising, an open-source AI model family that acts as an OS for quantum computers. It uses a vision language model to automate calibration and a 3D neural network for error correction, reducing calibration from days to hours.

95% relevant

Claude Code's 'Shallow Thinking' Problem

Enterprise users report Claude Code sometimes skips deep analysis on complex tasks. Use specific prompting techniques and session management to ensure thorough reasoning.

87% relevant

NVIDIA CEO Jensen Huang Declares All Future Software Will Be Agentic

NVIDIA CEO Jensen Huang stated that all future software will be agentic, meaning every software company must transform into an agentic company. This vision positions AI agents as the fundamental architecture for future computing.

87% relevant

OpenAI Codex Now Translates C++, CUDA, and Python to Swift and Python for CoreML Model Conversion

OpenAI's Codex AI code generator is now being used to automatically rewrite C++, CUDA, and Python code into Swift and Python specifically for CoreML model conversion, a previously manual and error-prone process for Apple ecosystem deployment.

89% relevant

AI Engineer Henry Ndubuaku Releases Open-Source 'Maths, CS & AI Compendium' Textbook

AI engineer Henry Ndubuaku has published a free, open-source textbook compiling mathematics, computer science, and AI concepts. The resource emphasizes intuitive understanding over notation and has reportedly helped users land roles at DeepMind, OpenAI, and Nvidia.

85% relevant

Open-Source 'Manus Alternative' Emerges: Fully Local AI Agent with Web Browsing, Code Execution, and Voice Input

An open-source project has been released that replicates core features of AI agent platforms like Manus—autonomous web browsing, multi-language code execution, and voice input—while running entirely locally on user hardware with no external API dependencies.

85% relevant

NVIDIA's GTC Park to Feature 'Build-a-Claw' Workshop for Creating AI Agents

NVIDIA is launching a hands-on 'Build-a-Claw' experience at GTC Park from March 16-19, allowing attendees to create their own long-running AI agents using the fast-growing OpenClaw open-source project.

89% relevant

Open-Source LLM Course Revolutionizes AI Education: Free GitHub Repository Challenges Paid Alternatives

A comprehensive GitHub repository called 'LLM Course' by Maxime Labonne provides complete, free training on large language models—from fundamentals to deployment—threatening the market for paid AI courses with its organized structure and practical notebooks.

89% relevant

The Hidden Economics of AI: How Anthropic's Massive Subsidies Are Reshaping the Coding Assistant Market

Internal research from Cursor reveals Anthropic is subsidizing Claude Code subscriptions at staggering rates—up to $5,000 in compute costs for a $200 monthly plan. This aggressive pricing strategy highlights the fierce competition in AI coding tools and raises questions about sustainable business models in the generative AI space.

85% relevant

AWS Becomes OpenAI's Exclusive Third-Party Cloud Partner in Landmark Deal

OpenAI and Amazon have announced a multi-year strategic partnership making AWS the exclusive third-party cloud provider for OpenAI Frontier. The deal includes 2 gigawatts of Trainium capacity and co-creation of a Stateful Runtime Environment on Amazon Bedrock.

90% relevant

Beyond Nvidia: How OpenAI's Cerebras-Powered Model Redefines AI Hardware Competition

OpenAI's GPT-5.3-Codex-Spark demonstrates real-time coding capabilities on Cerebras hardware, challenging Nvidia's dominance and signaling a new era of specialized AI infrastructure.

75% relevant