sakutto
Generative AI

What Is OpenAI Jalapeño? Broadcom's Custom Inference Chip

OpenAIBroadcomAI chipinference
What Is OpenAI Jalapeño? Broadcom's Custom Inference Chip

What Is OpenAI Jalapeño?

How Jalapeño's development is divided

OpenAI
Designs the chip itself from scratch (architecture, kernels, memory, serving systems)
Broadcom
Silicon implementation, high-speed networking (Tomahawk), and large-scale production
Celestica
Board, rack, and system integration

OpenAI Jalapeño is the first in-house AI chip from OpenAI, co-developed with semiconductor giant Broadcom. Announced on June 24, 2026, it was delivered to OpenAI CEO Sam Altman and President Greg Brockman by Broadcom's Hock Tan and Charlie Kawwas. It marks a turning point in vertical integration — OpenAI moving to own the very foundation that runs its inference.

View official source →
"OpenAI and Broadcom (NASDAQ: AVGO) today unveiled Jalapeño, OpenAI's first Intelligence Processor: an accelerator architected around OpenAI's vision for the future of LLM inference." — from the Jalapeño announcement

Jalapeño's definition (OpenAI's first "Intelligence Processor")

Jalapeño is what OpenAI calls an "Intelligence Processor" — its own term for a self-designed accelerator (a compute chip specialized for AI). It was designed from a blank slate for modern LLM inference, not a general-purpose part adapted after the fact, a point OpenAI stresses throughout the announcement.

View official source →
"Jalapeño is a blank-slate design for modern LLM inference, not a general-purpose accelerator adapted from earlier AI workloads." — from the Jalapeño announcement

What "inference-only" means (vs. training)

It helps to clarify "inference" first. AI work splits into two stages: "training," which builds a model from vast data, and "inference," which runs the finished model to produce answers. Jalapeño targets the latter — the moment ChatGPT answers a question. Inference is where AI reaches people, so making it faster, cheaper, and more stable feeds directly into how good the service feels.

Engineering samples are already running in the lab at production target frequency and power, executing AI workloads that include OpenAI's GPT-5.3-Codex-Spark.

View official source →
"Engineering samples of the Jalapeño chip are running ML workloads in the lab at production target frequency and power, including GPT-5.3-Codex-Spark." — from the Jalapeño announcement

How OpenAI, Broadcom, and Celestica divide the work

As the figure above shows, OpenAI designs the chip, Broadcom handles silicon implementation, high-speed networking (Tomahawk), and production, and Celestica handles board, rack, and system integration. OpenAI pours its model insights into the design, and the semiconductor and networking specialists turn that into an industrial product.

Compared with a general-purpose high-performance accelerator, Jalapeño sits like this.

AspectConventional general-purpose acceleratorJalapeño (inference-only)
Design philosophyBroad: training and inference alikeBlank-slate design focused on LLM inference
Optimization targetGeneral compute performanceKernels, memory movement, networking, serving
Who leads the designChip vendorOpenAI, guided by its model insights
Main goalBroad supply to the whole marketRunning OpenAI's own inference efficiently

Performance and the Nine-Month Build

Key facts about Jalapeño

Type
OpenAI's first in-house AI chip (Intelligence Processor)
Purpose
LLM inference only (a blank-slate design, not a repurposed part)
Performance
Performance per watt substantially better than current state of the art (measurement still in progress)
Build time
~9 months from design to tape-out (among the fastest ASIC cycles)
In the lab
Running GPT-5.3-Codex-Spark and more at production target frequency and power
Deployment
From the end of 2026, at gigawatt scale with Microsoft and others

What draws attention is not only the novelty of the goal, but the projected performance and the build speed. Here is what has been disclosed.

Performance per watt: "substantially better" (measurement in progress)

In early testing, OpenAI says Jalapeño's performance per watt is on track to substantially exceed the current state of the art. That said, final measurement is still in progress, and the detailed technical report is due in the coming months. Since the numbers aren't final, they are best read as projections.

View official source →
"While OpenAI is still measuring final performance, early testing shows that Jalapeño will deliver performance per watt substantially better than current state-of-the-art." — from the Jalapeño announcement

The basis for the performance is the architecture. By cutting down on data movement and balancing compute, memory, and networking, it is designed to reach realized performance close to the theoretical peak.

View official source →
"The architecture reduces data movement and balances compute, memory, and networking resources to achieve realized utilization much closer to theoretical peak performance." — from the Jalapeño announcement

From design to tape-out in "just nine months"

The build speed stands out too. From the initial design to the manufacturing milestone, tape-out, took just nine months. OpenAI calls it what it believes to be the fastest ASIC development cycle ever achieved in high-performance advanced semiconductors. This kind of chip development usually takes years, which makes nine months strikingly fast.

View official source →
"Jalapeño was co-developed from initial design to manufacturing tape-out in just nine months, and the custom AI accelerator program represents what we believe to be the fastest ASIC development cycle ever achieved in high-performance advanced semiconductors." — from the Jalapeño announcement

OpenAI's own AI models sped up the design

That speed was helped by OpenAI's own AI models. On top of tight software-hardware co-development, the company used its own models for parts of the design and optimization. The same models served to users are helping build the foundation that will run the next generation — a self-reinforcing loop.

View official source →
"The same models served to users are helping improve the infrastructure used to run future models." — from the Jalapeño announcement

Faster, cheaper inference flows straight back into how services like ChatGPT feel. For the differences between the major models and the bigger picture, see our guide to ChatGPT (GPT-5).

Free ToolURL to Markdown ConverterConvert any public web page URL to Markdown. Preserves headings, tables, lists, and links — perfect for LLM and RAG preprocessing, research notes, and archiving web articles.Try it now →

Why OpenAI Is Building Its Own Chip

What OpenAI now designs in-house (the full stack)

Chip
Jalapeño (the newly added bottom layer)
Base software
Kernels, memory, networking, scheduling
Models
The GPT family of models
Products
ChatGPT, Codex, the API

Jalapeño's real significance lies less in one chip's performance and more in how far OpenAI wants to own its own stack. Here are three angles on the backdrop.

A "full-stack" strategy: designing everything from chip to product

OpenAI has built frontier models and the products on top of them. Now it is stepping into the layer below — the chips and networking that run those models. The strength of vertical integration is that every layer can be optimized toward one goal: making the models faster, more reliable, and more affordable. President Greg Brockman frames Jalapeño as part of that long-term full-stack strategy.

View official source →
"Jalapeño is part of our long-term full-stack infrastructure strategy to make compute more abundant, resulting in AI which is faster, more reliable, more affordable for people and businesses." — from the Jalapeño announcement

The "first-generation chip" of the ten-gigawatt partnership

Jalapeño didn't come out of nowhere. Back in October 2025, OpenAI and Broadcom had already announced a partnership to co-deploy ten gigawatts of OpenAI-designed custom AI accelerators, and Jalapeño is the first generation of that multi-generation plan.

View official source →
"Broadcom will deploy racks of AI accelerator and network systems targeted to start in the second half of 2026, to complete by end of 2029." — from the strategic-collaboration announcement

The view that it reduces Nvidia reliance

Moves like this are often described as a way to reduce reliance on Nvidia's GPUs. Serving its own inference on its own chip does loosen concentration on any single vendor. That said, OpenAI's official materials never claim to "replace Nvidia." What it puts forward is the full-stack story of making compute more abundant and AI more broadly available; the "reduced Nvidia reliance" reading comes from outside observers.

When Jalapeño Ships and What It Means

The OpenAI × Broadcom timeline

Oct 2025
Announce the ten-gigawatt custom AI accelerator partnership
Jun 2026
Unveil the first-generation chip, Jalapeño
End 2026
Begin initial deployment (gigawatt scale, with Microsoft and others)
End 2029
Targeted completion of the rack rollout

Finally, here is when Jalapeño starts running and how it touches everyday AI users.

Deploying from the end of 2026 with Microsoft and others

As the first step of a multi-generation platform, Jalapeño is slated for initial deployment from the end of 2026. Broadcom CEO Hock Tan says that co-developing the silicon directly with OpenAI enables gigawatt-scale data centers with Microsoft and other partners beginning in 2026.

View official source →
"By co-developing our industry-leading silicon directly with OpenAI, we are enabling the deployment of gigawatt scale data centers with Microsoft and other partners beginning in 2026." — from the Jalapeño announcement

What it means for everyday AI users

Jalapeño isn't something individuals buy and use directly. Even so, the benefits circle back into everyday AI. If inference gets cheaper and faster, that shows up as quicker ChatGPT answers, more stable access at peak times, and cheaper-to-build API products. OpenAI itself stresses that inference is where AI reaches people, and that each improvement there changes how the experience feels.

Official announcements like this are often long English texts, and there are times you'll want AI to summarize the key points. Converting a web page into clean Markdown keeps the heading and table structure intact, which tends to improve how accurately AI reads it.

Free ToolURL to Markdown ConverterConvert any public web page URL to Markdown. Preserves headings, tables, lists, and links — perfect for LLM and RAG preprocessing, research notes, and archiving web articles.Try it now →

The Bottom Line on OpenAI Jalapeño

OpenAI Jalapeño is the company's first in-house AI chip, built with Broadcom. It is purpose-built from scratch for inference — running trained models — and its performance per watt is projected to substantially beat the current state of the art. It was finished at an unusual pace, from design to its manufacturing milestone in nine months. Behind it sits a full-stack strategy of owning everything from chip to product, plus the ten-gigawatt partnership struck in October 2025. OpenAI doesn't claim to "replace Nvidia" — but the move to serve its own inference on its own chip is clearly accelerating.

For more on the latest models and availability, see our explainers on why you can't use GPT-5.6 yet and OpenAI Daybreak.

FAQ

Q. What is OpenAI Jalapeño?
It is OpenAI's first in-house AI chip, co-developed with Broadcom. It is purpose-built from scratch for inference — running trained AI models to produce responses. It was announced on June 24, 2026.
OpenAI Official (Jalapeño announcement)
OpenAI and Broadcom (NASDAQ: AVGO) today unveiled Jalapeño, OpenAI's first Intelligence Processor: an accelerator architected around OpenAI's vision for the future of LLM inference. OpenAI Official (Jalapeño announcement)
Q. When will Jalapeño be deployed?
Initial deployment is planned for the end of 2026. It will expand gradually with Microsoft and other partners across gigawatt-scale data centers. It is the first generation of a multi-generation compute platform.
OpenAI Official (Jalapeño announcement)
Jalapeño is the first step in a multi-generation compute platform designed for initial deployment by the end of 2026 and expanding in the years ahead. OpenAI Official (Jalapeño announcement)
Q. Does Jalapeño replace Nvidia's GPUs?
OpenAI does not officially say it 'replaces Nvidia.' The stated goal is a full-stack strategy — designing everything from chip to product — to make compute more abundant and deliver AI faster and more affordably. Serving its own inference on its own chip does ease reliance on any single GPU supplier, but that framing comes from outside observers, not OpenAI.
OpenAI Official (Greg Brockman statement)
Jalapeño is part of our long-term full-stack infrastructure strategy to make compute more abundant, resulting in AI which is faster, more reliable, more affordable for people and businesses. OpenAI Official (Greg Brockman statement)

Related Tools

Related Tool Categories

Articles