Skip to content
gentic.news — AI News Intelligence Platform
Connecting to the Living Graph…

chip design

30 articles about chip design in AI news

InCoder-32B-Thinking Hits 81.3% on LiveCodeBench, Trained on Chip & Kernel Traces

InCoder-32B-Thinking, a 32B parameter model trained on execution traces from chip design, GPU kernels, and embedded systems, scores 81.3% on LiveCodeBench V5 and an 84% compile pass rate on CAD-Coder.

92% relevant

Jensen Huang's AI Productivity Mandate: Engineers Must Spend 50% of Salary on AI Tokens

NVIDIA CEO Jensen Huang argues that a $500K engineer should spend at least $250K annually on AI inference tokens, framing token consumption as essential as CAD tools for chip design. He claims this investment eliminates perceptions of difficulty, time, and resource constraints in development.

85% relevant

Google TPU 'Broadfly' Topology Scales Pod to 1,152 Chips

Google unveiled a Broadfly TPU topology at Cloud Next, scaling pods to 1,152 chips — 4.5x larger than Ironwood — with max 7 hops. This inference-first design challenges NVIDIA's NVLink on scale and latency.

94% relevant

Inference shift opens door for AI chip startups to challenge Nvidia

Inference shift from training to serving creates opportunities for AI chip startups. Nvidia's $20B Groq acquihire validates disaggregated compute strategies.

96% relevant

The $500B AI Chip Bottleneck: One Material, One Supplier

A single Japanese chemical company supplies 98% of the thin-film material used in every AI chip on earth. NVIDIA is paying half the capex to expand supplier fabs as lead times stretch past 6 months.

90% relevant

Google's Virgo Network Links 134,000 TPU v8 Chips with 47 Pbps Fabric

Google unveiled its Virgo networking stack for TPU v8, capable of linking 134,000 chips in a single fabric with 47 petabits/sec of bi-sectional bandwidth. This represents a massive scale-up in interconnect technology for large-scale AI model training.

100% relevant

VMLOps Publishes NLP Engineer System Design Interview Guide

VMLOps has published 'The NLP Engineer's System Design Interview Guide,' a detailed resource covering architecture, scaling, and trade-offs for real-world NLP systems. It provides a structured framework for both interviewers and candidates.

75% relevant

Google, Marvell in Talks to Co-Develop New AI Chips, Including TPU-Optimized MPU

Google is reportedly in talks with Marvell Technology to co-develop two new AI chips: a memory processing unit (MPU) to pair with TPUs and a new, optimized TPU. This move is a direct effort to bolster Google's custom silicon stack and compete with Nvidia's dominance.

95% relevant

Nvidia Invests $2B in Marvell to Expand NVLink Fusion Chip Partnership

Nvidia is investing $2 billion in Marvell Technology to deepen their partnership on NVLink Fusion, a chip-to-chip interconnect crucial for scaling AI training clusters. This strategic move aims to secure supply and accelerate development of high-bandwidth links between GPUs and custom AI accelerators.

84% relevant

TSMC's $56B 2026 CapEx Fuels AI Chip Race with 22 New Fabs

TSMC is constructing up to 22 advanced semiconductor fabs simultaneously, backed by a $52–56 billion capital expenditure plan for 2026. This unprecedented manufacturing scale is critical for producing the 2nm-and-below chips required by next-generation AI models.

85% relevant

Canada's AI Compute Gap: Google Cloud Montreal Offers 2017-Era Chips

A technical developer's attempt to rent modern AI compute in Canada revealed a stark infrastructure gap, with major providers offering chips as old as 2017, undermining national AI ambitions.

85% relevant

Anthropic Considers Custom AI Chips, Following Google & OpenAI

Anthropic is reportedly considering developing custom AI chips, a strategic move to gain control over its compute infrastructure and reduce costs. This follows similar initiatives by Google, Amazon, and OpenAI.

85% relevant

Broadcom to Manufacture Google TPU Chips in Foundry Partnership

Google has licensed its Tensor Processing Unit (TPU) intellectual property to Broadcom for chip fabrication. This allows Google to earn from its IP while Broadcom manages the complex hardware build and networking integration.

85% relevant

DeepSeek V4 to Run on Huawei Ascend 950PR Chips, Sparking 20% Price Surge

DeepSeek's anticipated V4 model will be powered by Huawei's Ascend 950PR chips, with Alibaba, ByteDance, and Tencent stockpiling hundreds of thousands of units ahead of launch. This has driven chip prices up approximately 20% in recent weeks.

91% relevant

Elon Musk Says Global Chip Fabs Supply Only 2% of Tesla's AI Compute Needs, Driving Terafab Build

Elon Musk stated current global chip fabrication capacity can supply only about 2% of Tesla's AI compute requirements, necessitating the construction of a 'terafab' even if suppliers expand.

87% relevant

AWS Commits 2 Gigawatts of Trainium Capacity to OpenAI, Reveals 1.4 Million Chips Deployed

Amazon's $50B OpenAI deal includes a 2-gigawatt commitment of Trainium computing capacity. AWS disclosed 1.4 million Trainium chips are deployed, with over 1 million Trainium2 chips running Anthropic's Claude.

95% relevant

Nvidia's Jensen Huang Dismisses Custom AI Chip Threat: 'Science Projects' Versus 'AI Factories'

Nvidia CEO Jensen Huang confidently dismissed concerns about custom AI chips challenging Nvidia's dominance, framing competitors' efforts as 'science projects' while Nvidia builds revenue-generating 'AI factories' with a complete platform approach.

85% relevant

Nvidia's Groq Ramps Up AI Chip Production with Samsung in Major Partnership Expansion

Nvidia's recent acquisition Groq has significantly expanded its partnership with Samsung, increasing chip orders from 9,000 to 30,000 wafers. This massive production boost signals accelerated development of Groq's specialized AI inference processors amid growing market demand.

85% relevant

Nvidia's $2B Nebius Bet: Chip Giant Doubles Down on AI Infrastructure Empire

Nvidia will invest $2 billion in AI cloud specialist Nebius Group NV, expanding its strategic investments in companies that build data centers using its chips. The partnership aims to deploy over 5 gigawatts of AI-optimized data center capacity by 2030, equivalent to powering 4 million U.S. households.

81% relevant

Nvidia's Strategic Shift: Merging Groq Hardware in New AI Chip Targeting OpenAI

Nvidia is reportedly developing a new AI chip that combines its GPU technology with hardware from Groq, with OpenAI potentially becoming a major customer. This move signals Nvidia's recognition of specialized AI hardware beyond traditional GPUs.

95% relevant

AI's Insatiable Appetite: Nvidia's Rubin Chip Demands 288GB Memory, Sparking Global Shortage Crisis

Nvidia's upcoming Rubin AI chip requires 288GB of RAM—800% more than top desktop computers—creating unprecedented memory demand. Massive purchases by OpenAI and Alphabet have depleted supply, driving DDR4 prices up 2352% and causing a global memory chip shortage.

95% relevant

The Invisible Dance: How AI Chip Manufacturing Relies on Microscopic Wire Bonding

High-speed semiconductor wire bonding creates thousands of electrical connections per minute using ultra-fine 25-micron wires. This critical but often overlooked process enables the AI chips powering today's most advanced systems.

85% relevant

Google's $1.9 Trillion Vertical Integration Strategy: Building an AI Empire from Chips to Power Grid

Google is investing $1.9 trillion over the next decade to control every layer of the AI stack, from custom TPU chips to power infrastructure. This vertical integration strategy creates a competitive moat that could reshape the entire AI industry landscape.

95% relevant

China's Semiconductor Leaders Rally for National AI Chip Alliance Amid Tech War Escalation

China's top semiconductor executives have issued an unprecedented public call for a consolidated national effort to build AI chips, signaling a strategic shift toward self-reliance as U.S. export controls tighten. This coordinated push represents China's most direct response yet to technological containment efforts.

85% relevant

Broadcom's $100 Billion AI Chip Forecast Signals Industry Transformation

Broadcom CEO Hock Tan projects the company's AI chip sales will exceed $100 billion in 2027, challenging Nvidia's dominance. This forecast comes alongside Anthropic resuming Pentagon talks about military AI applications.

75% relevant

SEval-NAS: The Flexible Framework That Could Revolutionize Hardware-Aware AI Design

Researchers propose SEval-NAS, a search-agnostic evaluation method that decouples metric calculation from the Neural Architecture Search process. This allows AI developers to easily introduce new performance criteria, especially for hardware-constrained devices, without redesigning their entire search algorithms.

75% relevant

DeepSeek's Blackwell Training Exposes Critical Gaps in US Chip Export Controls

Chinese AI startup DeepSeek reportedly trained its latest model on Nvidia's restricted Blackwell chips, challenging US export controls. The development reveals significant loopholes in semiconductor restrictions amid escalating AI competition.

90% relevant

Meta's $100B AMD Gamble: The AI Chip War Enters Its Most Strategic Phase

Meta has secured a landmark deal to purchase up to $100 billion worth of AMD AI chips, receiving a massive stock warrant in return. This unprecedented agreement signals Meta's aggressive push to diversify its AI infrastructure beyond Nvidia while pursuing ambitious 'personal superintelligence' goals.

90% relevant

MatX Secures $500M War Chest to Challenge Nvidia's AI Chip Dominance

AI chip startup MatX, founded by ex-Google semiconductor engineers, has raised over $500 million to develop hardware that directly competes with Nvidia. This massive funding round signals growing investor confidence in alternatives to the current AI chip market leader.

80% relevant

DeepSeek's Blackwell Gambit: How a Chinese AI Firm Reportedly Circumvented U.S. Chip Export Controls

Chinese AI company DeepSeek reportedly trained its upcoming model using Nvidia's restricted Blackwell chips, potentially clustered in an Inner Mongolia data center. This development highlights the escalating tech rivalry and challenges of enforcing export controls in the AI arms race.

95% relevant