Skip to content
gentic.news — AI News Intelligence Platform
Connecting to the Living Graph…

Listen to today's AI briefing

Daily podcast — 5 min, AI-narrated summary of top stories

Two engineers in cleanroom suits examine a glowing blue silicon wafer etched with chip designs, a monitor displaying…
Products & LaunchesBreakthroughScore: 100

OpenAI, Broadcom Unveil Jalapeño ASIC for LLM Inference

OpenAI and Broadcom unveiled Jalapeño, a custom ASIC for LLM inference, targeting volume deployment by late 2026. No performance metrics were disclosed.

·9h ago·4 min read··18 views·AI-Generated·Report error
Share:
Source: bloomberg.comvia bloomberg_tech, the_verge_tech, openai_blog, the_decoder, nvidia_blog, techcrunch_ai, engadget, @mweinbachMulti-Source
What is the Jalapeño chip that OpenAI and Broadcom unveiled?

OpenAI and Broadcom unveiled Jalapeño, a custom ASIC for LLM inference, on June 24, 2026. The chip aims to improve performance and efficiency, with volume deployment targeted by late 2026.

TL;DR

OpenAI and Broadcom announce Jalapeño ASIC chip. · Chip designed for LLM inference, not training. · Volume deployment targeted by late 2026.

OpenAI and Broadcom unveiled Jalapeño, a custom ASIC for LLM inference, on June 24, 2026. The chip, designed to run large language models faster and cheaper, targets volume deployment by late 2026.

Key facts

  • Jalapeño is an ASIC for LLM inference, not training.
  • Volume deployment targeted by late 2026.
  • OpenAI has raised over $40 billion in total funding.
  • Chip developed in partnership with Broadcom.
  • No performance metrics or cost data disclosed.

OpenAI and Broadcom have revealed a custom AI chip called Jalapeño, an Application-Specific Integrated Circuit (ASIC) built specifically for large language model inference. According to Bloomberg, the chip is part of OpenAI's bid to gain an edge by tailoring hardware to its AI products. The Verge reports that Jalapeño is designed to power current and future LLMs, with a focus on inference rather than training. The Decoder adds that volume deployment is targeted by late 2026.

Key Takeaways

  • OpenAI and Broadcom unveiled Jalapeño, a custom ASIC for LLM inference, targeting volume deployment by late 2026.
  • No performance metrics were disclosed.

Why an ASIC for Inference?

OpenAI and Broadcom unveil LLM-optimized inference chip | OpenAI

Jalapeño is an ASIC, meaning it is hardwired for a specific task — in this case, AI inference. This contrasts with GPUs, which are general-purpose and more flexible. By customizing the chip for inference, OpenAI and Broadcom aim to improve performance and energy efficiency, potentially lowering the cost of running models like GPT-5.3 and ChatGPT. The move follows a broader industry trend: Google has its TPU, Amazon has Trainium and Inferentia, and Microsoft has partnered with AMD and Intel for custom silicon. OpenAI, which has raised over $40 billion in total funding [per our knowledge graph], is now joining the custom-chip club.

Performance Claims and Missing Metrics

OpenAI and Broadcom have not disclosed specific performance metrics, power efficiency gains, or cost reductions for Jalapeño. The announcement per the OpenAI blog is light on technical detail, stating only that the chip will "improve performance, efficiency, and scale across AI systems." This lack of data makes it difficult to compare Jalapeño to existing inference chips like Nvidia's H100, Google's TPU v5p, or Amazon's Inferentia 2. The absence of benchmark numbers suggests the chip may still be in early production or that OpenAI is keeping competitive details close to the vest.

Strategic Context

OpenAI and Broadcom unveil LLM-optimized inference chip | OpenAI

This announcement comes amid a flurry of activity from OpenAI. In the past week alone, the company filed paperwork for an IPO [per our knowledge graph], partnered with 25+ security firms, and developed a technique to predict model failures. The Jalapeño chip could be a key enabler for OpenAI's scaling ambitions, especially as it pushes toward AGI. By controlling its own hardware, OpenAI reduces reliance on Nvidia, which has faced supply constraints and high prices for its H100 and B200 GPUs. The partnership with Broadcom, a major networking and ASIC designer, leverages Broadcom's expertise in high-volume chip production.

What's Missing

The announcement does not specify which foundry will manufacture Jalapeño (likely TSMC or Samsung), the chip's process node, memory bandwidth, or power consumption. Nor does it clarify whether the chip will be used exclusively for OpenAI's internal workloads or offered to third-party customers via Azure or OpenAI's API. The target of "late 2026" for volume deployment leaves a window for competitors to respond. Nvidia, for example, is expected to release its next-generation Blackwell Ultra GPU in 2026, which could offer competitive inference performance.

What to watch

Watch for specific performance benchmarks from OpenAI or Broadcom in Q3 2026, and for any announcements about which foundry and process node will manufacture Jalapeño. Also track whether Nvidia's Blackwell Ultra or Google's TPU v6 respond with competitive inference specs.


Source: bloomberg.com


Sources cited in this article

  1. The Verge
Source: gentic.news · · author= · citation.json

AI-assisted reporting. Generated by gentic.news from 2 verified sources, fact-checked against the Living Graph of 4,300+ entities. Edited by Ala SMITH.

Following this story?

Get a weekly digest with AI predictions, trends, and analysis — free.

AI Analysis

The Jalapeño announcement is strategically significant but technically thin. By building a custom inference ASIC, OpenAI joins Google, Amazon, and Microsoft in the custom-silicon club. This reduces dependence on Nvidia, which has dominated the AI chip market with high margins. However, the lack of performance data makes it impossible to assess whether Jalapeño will be competitive. The chip's success hinges on achieving meaningful cost-per-token reductions versus Nvidia's H100 or B200, especially as OpenAI considers steep API price cuts [per our knowledge graph]. The partnership with Broadcom is smart: Broadcom has a strong track record with custom ASICs for networking and AI (e.g., Google's TPU). But the late-2026 timeline gives Nvidia, AMD, and Google room to iterate. The real question is whether Jalapeño will be used exclusively for OpenAI's internal workloads or opened to third parties. If it's exclusive, it's a cost-saving move; if it's offered via the API, it could disrupt the inference chip market. The absence of benchmark numbers suggests OpenAI is either still tuning the chip or wary of revealing competitive intelligence. Either way, the chip story is more about strategic positioning than immediate performance.
This story is part of
The AI Infrastructure War Shifts from Chips to Developer Tools
Nvidia's enterprise pivot and AWS's OpenAI bet collide with Cursor's quiet ascent
Compare side-by-side
OpenAI vs Broadcom

Mentioned in this article

Enjoyed this article?
Share:

AI Toolslive

Five one-click lenses on this article. Cached for 24h.

Pick a tool above to generate an instant lens on this article.

Related Articles

From the lab

The framework underneath this story

Every article on this site sits on top of one engine and one framework — both built by the lab.

More in Products & Launches

View all