custom chips

30 articles about custom chips in AI news

Anthropic Considers Custom AI Chips, Following Google & OpenAI

Anthropic is reportedly considering developing custom AI chips, a strategic move to gain control over its compute infrastructure and reduce costs. This follows similar initiatives by Google, Amazon, and OpenAI.

Apr 11, 202685% relevant

Google Opens TPU Sales to Select Customers, Raises Capex Forecast

Google sells TPUs to select customers, raising capex forecast for Q1 FY2026, monetizing in-house chips beyond Cloud.

Apr 30, 2026100% relevant

Google, Marvell in Talks to Co-Develop New AI Chips, Including TPU-Optimized MPU

Google is reportedly in talks with Marvell Technology to co-develop two new AI chips: a memory processing unit (MPU) to pair with TPUs and a new, optimized TPU. This move is a direct effort to bolster Google's custom silicon stack and compete with Nvidia's dominance.

Apr 20, 202695% relevant

Nvidia's Jensen Huang Dismisses Custom AI Chip Threat: 'Science Projects' Versus 'AI Factories'

Nvidia CEO Jensen Huang confidently dismissed concerns about custom AI chips challenging Nvidia's dominance, framing competitors' efforts as 'science projects' while Nvidia builds revenue-generating 'AI factories' with a complete platform approach.

Mar 12, 202685% relevant

Google's $1.9 Trillion Vertical Integration Strategy: Building an AI Empire from Chips to Power Grid

Google is investing $1.9 trillion over the next decade to control every layer of the AI stack, from custom TPU chips to power infrastructure. This vertical integration strategy creates a competitive moat that could reshape the entire AI industry landscape.

Mar 8, 202695% relevant

Anthropic Explores Custom AI Chip with Samsung

Anthropic is discussing a custom AI chip with Samsung, per The Information. The move follows OpenAI's Jalapeño chip and signals growing vertical integration in AI hardware.

Jul 2, 202686% relevant

Amazon Opens Trainium Chips to Outside Data Centers, Targeting Nvidia's Core Business

AWS AI chief Peter DeSantis confirmed Amazon is negotiating to sell Trainium chips externally for the first time, backed by Andy Jassy's estimate of a $50B annual revenue potential. With Trainium3 sold out, Trainium4 pre-booked, and Anthropic and OpenAI already running gigawatts of Trainium capacity

Jun 18, 2026100% relevant

Qualcomm Launches AI Data Center Program With Hyperscaler Customer

Qualcomm launched an AI data center program with a major hyperscaler customer, targeting inference workloads. Financial terms and partner identity undisclosed.

Jun 17, 202685% relevant

Google’s Virgo network interconnects 134K TPUv8t chips at 47 Pbps

Google's Virgo network interconnects 134,400 TPUv8t chips at 47 Pbps, targeting large-scale training clusters.

Jun 3, 2026100% relevant

Qualcomm Ships Hyperscaler Custom Silicon by December 2026

Qualcomm is developing custom silicon for an unnamed hyperscaler, with shipments expected December 2026, marking its most concrete data-center comeback move.

May 1, 202676% relevant

SemiAnalysis: NVIDIA's Customer Data Drives Disaggregated Inference, LPU Surpasses GPU

SemiAnalysis states NVIDIA's direct customer feedback is leading the industry toward disaggregated inference architectures. In this model, specialized LPUs can outperform GPUs for specific pipeline tasks.

Apr 22, 202685% relevant

Google's Virgo Network Links 134,000 TPU v8 Chips with 47 Pbps Fabric

Google unveiled its Virgo networking stack for TPU v8, capable of linking 134,000 chips in a single fabric with 47 petabits/sec of bi-sectional bandwidth. This represents a massive scale-up in interconnect technology for large-scale AI model training.

Apr 22, 2026100% relevant

Broadcom to Manufacture Google TPU Chips in Foundry Partnership

Google has licensed its Tensor Processing Unit (TPU) intellectual property to Broadcom for chip fabrication. This allows Google to earn from its IP while Broadcom manages the complex hardware build and networking integration.

Apr 8, 202685% relevant

Apple's Neural Engine Jailbroken: Researchers Unlock Full Training Capabilities on M-Series Chips

Security researchers have reverse-engineered Apple's Neural Engine, bypassing private APIs to enable full neural network training directly on ANE hardware. This breakthrough unlocks 15.8 TFLOPS of compute previously restricted to inference-only operations across all M-series devices.

Mar 5, 202695% relevant

Google Splits TPU Line: 8t for Training, 8i for Inference

At Cloud Next 2026, Google introduced two new AI chips — TPU 8t for training and TPU 8i for inference — splitting its custom silicon for the first time. OpenAI, Anthropic, and Meta are buying multi-gigawatt TPU capacity, signaling a crack in NVIDIA's 81% market share.

Apr 27, 2026100% relevant

AWS CEO: All Latest Anthropic Models Trained on Amazon Trainium

Amazon Web Services CEO Matt Garman stated that all of Anthropic's latest AI models are trained on AWS's custom Trainium chips. This confirms the deepening technical and strategic integration between the AI lab and its primary cloud investor.

Apr 9, 202689% relevant

Amazon’s Alexa Now Shows 365-Day Price History for Shopping

Amazon expanded Alexa for Shopping to show 30, 90, and 365 days of price history. Over 50 million customers have used the feature since 2024, enhancing deal confidence.

Jun 30, 202678% relevant

Etched Hits $5B Valuation, $1B in Orders for AI Inference Chip

Etched hits $5B valuation with $1B in orders for TSMC-made inference chips, raising $500M from top investors. The startup targets Nvidia's dominance.

Jun 30, 2026100% relevant

Qualcomm Taps TSMC for 3nm/2nm Dragonfly C100 CPUs, AI300 Accelerators

TSMC to fab Qualcomm's Dragonfly C100 and AI300 chips on 3nm/2nm nodes. The move challenges NVIDIA in data center AI, but timelines and performance remain undisclosed.

Jun 26, 2026100% relevant

OpenAI, Broadcom Unveil Jalapeño ASIC for LLM Inference

OpenAI and Broadcom unveiled Jalapeño, a custom ASIC for LLM inference, targeting volume deployment by late 2026. No performance metrics were disclosed.

Jun 24, 2026100% relevant

Qualcomm in Talks to Acquire Modular for $4B, Landing Lattner

Qualcomm nears $4B acquisition of Modular, Chris Lattner's AI infra startup. Deal targets inference software for edge and data center AI chips.

Jun 22, 202682% relevant

Amazon Launches Generative AI Search Tool That Creates Real-Time Images

Amazon launched a generative AI search tool that creates real-time images from text descriptions to improve product discovery. This leverages Amazon Bedrock and Trainium chips, marking a shift toward AI-driven visual search in e-commerce.

Jun 21, 202672% relevant

Cerebras Claims Performance Parity With Nvidia H100 on AI Training

Cerebras claims wafer-scale chips match Nvidia H100 on AI training performance per watt, challenging Nvidia's dominance.

Jun 13, 202692% relevant

Cerebras Reengineers Mechanical Playbook for Wafer-Scale Chip Cooling

Cerebras disclosed three mechanical innovations—vertical power delivery, flexible interposers, and direct-impingement cooling—to prevent wafer-scale chips from cracking, rewriting engineering fundamentals.

Jun 4, 202688% relevant

ByteDance Builds In-House AI CPUs for TikTok-Scale Agent Inference

ByteDance builds custom AI CPUs for inference at TikTok scale, targeting scarce server supply. The move signals agent workload shift from training to inference hardware.

May 31, 202685% relevant

Nvidia Networking Revenue Hits $14.8B, Up 199% as AI Spending Shifts Beyond GPUs

Nvidia's Q1 FY2027 networking revenue surged 199% to $14.8B, signaling AI infrastructure spending is moving beyond GPUs into full-system networking. New reporting splits into Hyperscale and ACIE segments reflect a broadening customer base beyond hyperscalers.

May 21, 2026100% relevant

Buffett Invests in Google After SemiAnalysis TPU Deep Dive

Berkshire Hathaway invested in Google in Q3 2025, after Buffett studied TPU v5p architecture. He compared it to railroads, citing 8,960 chips and 4.8 Tbps links.

May 19, 202685% relevant

Cerebras IPO Challenges GPU Scaling Orthodoxy

Cerebras filed for IPO on April 21, betting wafer-scale chips can disrupt Nvidia's GPU cluster model for AI workloads.

May 14, 202698% relevant

Nvidia Invests $2B in Marvell to Deepen NVLink Fusion Tie-Up

Nvidia invested $2B in Marvell to deepen NVLink Fusion partnership, integrating Marvell custom silicon into AI interconnect fabric.

Apr 30, 202687% relevant

AWS Never Retired an A100 Server, CEO Says Amid Chip Shortage

AWS CEO Matt Garman stated that A100 servers are completely sold out and never retired, as demand for older chips outpaces supply. This underscores the prolonged GPU shortage and the value of legacy hardware in cloud AI.

Apr 26, 202687% relevant

Explore More

AI Agents Large Language Models Claude Code OpenAI RAG MCP Fine-tuning Benchmarks Open Source AI AI Safety