What is an ASIC chip?

An Application-Specific Integrated Circuit (ASIC) is a chip designed for a single purpose, offering higher efficiency than general-purpose chips like GPUs for that specific task.

Why does OpenAI need its own chip?

Custom chips can reduce costs and improve performance for AI inference, reducing reliance on Nvidia GPUs and giving OpenAI more control over its hardware stack.

Listen to today's AI briefing

Daily podcast — 5 min, AI-narrated summary of top stories

Listen

Two engineers in cleanroom suits examine a glowing blue silicon wafer etched with chip designs, a monitor displaying…

Products & LaunchesBreakthroughScore: 100

OpenAI, Broadcom Unveil Jalapeño ASIC for LLM Inference

OpenAI and Broadcom unveiled Jalapeño, a custom ASIC for LLM inference, targeting volume deployment by late 2026. No performance metrics were disclosed.

AAAla SMITH & AI Research Desk·9h ago·4 min read··18 views·AI-Generated·Report error

Source: bloomberg.comvia bloomberg_tech, the_verge_tech, openai_blog, the_decoder, nvidia_blog, techcrunch_ai, engadget, @mweinbachMulti-Source

What is the Jalapeño chip that OpenAI and Broadcom unveiled?

OpenAI and Broadcom unveiled Jalapeño, a custom ASIC for LLM inference, on June 24, 2026. The chip aims to improve performance and efficiency, with volume deployment targeted by late 2026.

TL;DR

OpenAI and Broadcom announce Jalapeño ASIC chip. · Chip designed for LLM inference, not training. · Volume deployment targeted by late 2026.

OpenAI and Broadcom unveiled Jalapeño, a custom ASIC for LLM inference, on June 24, 2026. The chip, designed to run large language models faster and cheaper, targets volume deployment by late 2026.

Key facts

Jalapeño is an ASIC for LLM inference, not training.
Volume deployment targeted by late 2026.
OpenAI has raised over $40 billion in total funding.
Chip developed in partnership with Broadcom.
No performance metrics or cost data disclosed.

OpenAI and Broadcom have revealed a custom AI chip called Jalapeño, an Application-Specific Integrated Circuit (ASIC) built specifically for large language model inference. According to Bloomberg, the chip is part of OpenAI's bid to gain an edge by tailoring hardware to its AI products. The Verge reports that Jalapeño is designed to power current and future LLMs, with a focus on inference rather than training. The Decoder adds that volume deployment is targeted by late 2026.

Key Takeaways

OpenAI and Broadcom unveiled Jalapeño, a custom ASIC for LLM inference, targeting volume deployment by late 2026.
No performance metrics were disclosed.

Why an ASIC for Inference?

OpenAI and Broadcom unveil LLM-optimized inference chip | OpenAI

Jalapeño is an ASIC, meaning it is hardwired for a specific task — in this case, AI inference. This contrasts with GPUs, which are general-purpose and more flexible. By customizing the chip for inference, OpenAI and Broadcom aim to improve performance and energy efficiency, potentially lowering the cost of running models like GPT-5.3 and ChatGPT. The move follows a broader industry trend: Google has its TPU, Amazon has Trainium and Inferentia, and Microsoft has partnered with AMD and Intel for custom silicon. OpenAI, which has raised over $40 billion in total funding [per our knowledge graph], is now joining the custom-chip club.

Performance Claims and Missing Metrics

OpenAI and Broadcom have not disclosed specific performance metrics, power efficiency gains, or cost reductions for Jalapeño. The announcement per the OpenAI blog is light on technical detail, stating only that the chip will "improve performance, efficiency, and scale across AI systems." This lack of data makes it difficult to compare Jalapeño to existing inference chips like Nvidia's H100, Google's TPU v5p, or Amazon's Inferentia 2. The absence of benchmark numbers suggests the chip may still be in early production or that OpenAI is keeping competitive details close to the vest.

Strategic Context

OpenAI and Broadcom unveil LLM-optimized inference chip | OpenAI

This announcement comes amid a flurry of activity from OpenAI. In the past week alone, the company filed paperwork for an IPO [per our knowledge graph], partnered with 25+ security firms, and developed a technique to predict model failures. The Jalapeño chip could be a key enabler for OpenAI's scaling ambitions, especially as it pushes toward AGI. By controlling its own hardware, OpenAI reduces reliance on Nvidia, which has faced supply constraints and high prices for its H100 and B200 GPUs. The partnership with Broadcom, a major networking and ASIC designer, leverages Broadcom's expertise in high-volume chip production.

What's Missing

The announcement does not specify which foundry will manufacture Jalapeño (likely TSMC or Samsung), the chip's process node, memory bandwidth, or power consumption. Nor does it clarify whether the chip will be used exclusively for OpenAI's internal workloads or offered to third-party customers via Azure or OpenAI's API. The target of "late 2026" for volume deployment leaves a window for competitors to respond. Nvidia, for example, is expected to release its next-generation Blackwell Ultra GPU in 2026, which could offer competitive inference performance.

What to watch

Watch for specific performance benchmarks from OpenAI or Broadcom in Q3 2026, and for any announcements about which foundry and process node will manufacture Jalapeño. Also track whether Nvidia's Blackwell Ultra or Google's TPU v6 respond with competitive inference specs.

Source: bloomberg.com

Sources cited in this article

Bloomberg
The Verge

Source: gentic.news · 9h ago · author=Ala SMITH · citation.json

AI-assisted reporting. Generated by gentic.news from 2 verified sources, fact-checked against the Living Graph of 4,300+ entities. Edited by Ala SMITH.

Following this story?

Get a weekly digest with AI predictions, trends, and analysis — free.

AI Analysis

The Jalapeño announcement is strategically significant but technically thin. By building a custom inference ASIC, OpenAI joins Google, Amazon, and Microsoft in the custom-silicon club. This reduces dependence on Nvidia, which has dominated the AI chip market with high margins. However, the lack of performance data makes it impossible to assess whether Jalapeño will be competitive. The chip's success hinges on achieving meaningful cost-per-token reductions versus Nvidia's H100 or B200, especially as OpenAI considers steep API price cuts [per our knowledge graph]. The partnership with Broadcom is smart: Broadcom has a strong track record with custom ASICs for networking and AI (e.g., Google's TPU). But the late-2026 timeline gives Nvidia, AMD, and Google room to iterate. The real question is whether Jalapeño will be used exclusively for OpenAI's internal workloads or opened to third parties. If it's exclusive, it's a cost-saving move; if it's offered via the API, it could disrupt the inference chip market. The absence of benchmark numbers suggests OpenAI is either still tuning the chip or wary of revealing competitive intelligence. Either way, the chip story is more about strategic positioning than immediate performance.

#ai hardware #inference #asic #broadcom #openai

This story is part of

The AI Infrastructure War Shifts from Chips to Developer Tools

Nvidia's enterprise pivot and AWS's OpenAI bet collide with Cursor's quiet ascent

Compare side-by-side

OpenAI vs Broadcom

→

Mentioned in this article

OpenAI Broadcom Jalapeño

Enjoyed this article?

Get the weekly AI intelligence briefing

✨AI Toolslive

Five one-click lenses on this article. Cached for 24h.

Pick a tool above to generate an instant lens on this article.

Products & Launches2 shared topics

OpenAI-Broadcom Chip Hints at Token Price Collapse

Products & Launches2 shared topics

OpenAI's MRC Protocol Sprays Packets Across 100+ Paths to Fix GPU Stragglers

Products & Launches2 shared topics

Google Splits TPU Line: 8t for Training, 8i for Inference

From the lab

The framework underneath this story

Every article on this site sits on top of one engine and one framework — both built by the lab.

Original research · EUMAS 2026

MNEMA — A Witness Lattice for Multi-Agent AI Memory

Cryptographic memory units · 1−α detection floor · 15 pp PDF

Field framework · v1.0

Epistemic Infrastructure

12 pillars · 11-stage knowledge metabolism · pathology catalog

OpenAI, Broadcom Unveil Jalapeño ASIC for LLM Inference

Key Takeaways

Why an ASIC for Inference?

Performance Claims and Missing Metrics

Strategic Context

What's Missing

What to watch

Sources cited in this article

AI Analysis

✨AI Toolslive

Related Articles

OpenAI-Broadcom Chip Hints at Token Price Collapse

OpenAI's MRC Protocol Sprays Packets Across 100+ Paths to Fix GPU Stragglers

Google Splits TPU Line: 8t for Training, 8i for Inference

The framework underneath this story

More in Products & Launches

Five Eyes Warns Frontier AI Could Reshape Cyber Warfare in Months

OpenAI 'Bidi' Voice Mode Demo Leaks: Real-Time Interruption

Cursor Trains GPT-Size Model with 10-20x Compute