When will Jalapeño be deployed?

Initial deployment is planned for the end of 2026. It will expand gradually with Microsoft and other partners across gigawatt-scale data centers. It is the first generation of a multi-generation compute platform.

Does Jalapeño replace Nvidia's GPUs?

OpenAI does not officially say it 'replaces Nvidia.' The stated goal is a full-stack strategy — designing everything from chip to product — to make compute more abundant and deliver AI faster and more affordably. Serving its own inference on its own chip does ease reliance on any single GPU supplier, but that framing comes from outside observers, not OpenAI.

What Is OpenAI Jalapeño? Broadcom's Custom Inference Chip

Q: What is OpenAI Jalapeño?

It is OpenAI's first in-house AI chip, co-developed with Broadcom. It is purpose-built from scratch for inference — running trained AI models to produce responses. It was announced on June 24, 2026.

Article summary

Jalapeño is OpenAI's first in-house AI chip, co-developed with Broadcom. It is purpose-built from scratch for inference — running trained models.
The split of work is design = OpenAI / silicon and networking = Broadcom / board and rack = Celestica. It is a "blank-slate design," not a repurposed general-purpose part.
OpenAI says its performance per watt is substantially better than the current state of the art (though final measurement is still in progress, with a detailed report due in the coming months).
It went from design to its manufacturing milestone (tape-out) in just nine months, and OpenAI's own AI models helped speed up part of the design.
It is the first-generation chip of the October 2025 ten-gigawatt partnership, deploying from the end of 2026 at gigawatt scale with Microsoft and others.
The aim is vertical integration (the full stack) from chip to product. This does ease reliance on a single GPU supplier, but OpenAI does not claim to "replace Nvidia."

What Is OpenAI Jalapeño?

How Jalapeño's development is divided

OpenAI

Designs the chip itself from scratch (architecture, kernels, memory, serving systems)

Broadcom

Silicon implementation, high-speed networking (Tomahawk), and large-scale production

Celestica

Board, rack, and system integration

OpenAI Jalapeño is the first in-house AI chip from OpenAI, co-developed with semiconductor giant Broadcom. Announced on June 24, 2026, it was delivered to OpenAI CEO Sam Altman and President Greg Brockman by Broadcom's Hock Tan and Charlie Kawwas. It marks a turning point in vertical integration — OpenAI moving to own the very foundation that runs its inference.

OpenAI Official (Jalapeño announcement)View official source →

"OpenAI and Broadcom (NASDAQ: AVGO) today unveiled Jalapeño, OpenAI's first Intelligence Processor: an accelerator architected around OpenAI's vision for the future of LLM inference." — from the Jalapeño announcement

Jalapeño's definition (OpenAI's first "Intelligence Processor")

Jalapeño is what OpenAI calls an "Intelligence Processor" — its own term for a self-designed accelerator (a compute chip specialized for AI). It was designed from a blank slate for modern LLM inference, not a general-purpose part adapted after the fact, a point OpenAI stresses throughout the announcement.

OpenAI Official (Jalapeño announcement)View official source →

"Jalapeño is a blank-slate design for modern LLM inference, not a general-purpose accelerator adapted from earlier AI workloads." — from the Jalapeño announcement

What "inference-only" means (vs. training)

It helps to clarify "inference" first. AI work splits into two stages: "training," which builds a model from vast data, and "inference," which runs the finished model to produce answers. Jalapeño targets the latter — the moment ChatGPT answers a question. Inference is where AI reaches people, so making it faster, cheaper, and more stable feeds directly into how good the service feels.

Engineering samples are already running in the lab at production target frequency and power, executing AI workloads that include OpenAI's GPT-5.3-Codex-Spark.

OpenAI Official (Jalapeño announcement)View official source →

"Engineering samples of the Jalapeño chip are running ML workloads in the lab at production target frequency and power, including GPT-5.3-Codex-Spark." — from the Jalapeño announcement

How OpenAI, Broadcom, and Celestica divide the work

As the figure above shows, OpenAI designs the chip, Broadcom handles silicon implementation, high-speed networking (Tomahawk), and production, and Celestica handles board, rack, and system integration. OpenAI pours its model insights into the design, and the semiconductor and networking specialists turn that into an industrial product.

Compared with a general-purpose high-performance accelerator, Jalapeño sits like this.

Aspect	Conventional general-purpose accelerator	Jalapeño (inference-only)
Design philosophy	Broad: training and inference alike	Blank-slate design focused on LLM inference
Optimization target	General compute performance	Kernels, memory movement, networking, serving
Who leads the design	Chip vendor	OpenAI, guided by its model insights
Main goal	Broad supply to the whole market	Running OpenAI's own inference efficiently

Performance and the Nine-Month Build

Key facts about Jalapeño

Type

OpenAI's first in-house AI chip (Intelligence Processor)

Purpose

LLM inference only (a blank-slate design, not a repurposed part)

Performance

Performance per watt substantially better than current state of the art (measurement still in progress)

Build time

~9 months from design to tape-out (among the fastest ASIC cycles)

In the lab

Running GPT-5.3-Codex-Spark and more at production target frequency and power

Deployment

From the end of 2026, at gigawatt scale with Microsoft and others

What draws attention is not only the novelty of the goal, but the projected performance and the build speed. Here is what has been disclosed.

Performance per watt: "substantially better" (measurement in progress)

In early testing, OpenAI says Jalapeño's performance per watt is on track to substantially exceed the current state of the art. That said, final measurement is still in progress, and the detailed technical report is due in the coming months. Since the numbers aren't final, they are best read as projections.

OpenAI Official (Jalapeño announcement)View official source →

"While OpenAI is still measuring final performance, early testing shows that Jalapeño will deliver performance per watt substantially better than current state-of-the-art." — from the Jalapeño announcement

The basis for the performance is the architecture. By cutting down on data movement and balancing compute, memory, and networking, it is designed to reach realized performance close to the theoretical peak.

OpenAI Official (Jalapeño announcement)View official source →

"The architecture reduces data movement and balances compute, memory, and networking resources to achieve realized utilization much closer to theoretical peak performance." — from the Jalapeño announcement

From design to tape-out in "just nine months"

The build speed stands out too. From the initial design to the manufacturing milestone, tape-out, took just nine months. OpenAI calls it what it believes to be the fastest ASIC development cycle ever achieved in high-performance advanced semiconductors. This kind of chip development usually takes years, which makes nine months strikingly fast.

OpenAI Official (Jalapeño announcement)View official source →

"Jalapeño was co-developed from initial design to manufacturing tape-out in just nine months, and the custom AI accelerator program represents what we believe to be the fastest ASIC development cycle ever achieved in high-performance advanced semiconductors." — from the Jalapeño announcement

OpenAI's own AI models sped up the design

That speed was helped by OpenAI's own AI models. On top of tight software-hardware co-development, the company used its own models for parts of the design and optimization. The same models served to users are helping build the foundation that will run the next generation — a self-reinforcing loop.

OpenAI Official (Jalapeño announcement)View official source →

"The same models served to users are helping improve the infrastructure used to run future models." — from the Jalapeño announcement

Faster, cheaper inference flows straight back into how services like ChatGPT feel. For the differences between the major models and the bigger picture, see our guide to ChatGPT (GPT-5).

Free ToolURL to Markdown ConverterConvert any public web page URL to Markdown. Preserves headings, tables, lists, and links — perfect for LLM and RAG preprocessing, research notes, and archiving web articles.Try it now →

Why OpenAI Is Building Its Own Chip

What OpenAI now designs in-house (the full stack)

Chip

Jalapeño (the newly added bottom layer)

Base software

Kernels, memory, networking, scheduling

Models

The GPT family of models

Products

ChatGPT, Codex, the API

Jalapeño's real significance lies less in one chip's performance and more in how far OpenAI wants to own its own stack. Here are three angles on the backdrop.

A "full-stack" strategy: designing everything from chip to product

OpenAI has built frontier models and the products on top of them. Now it is stepping into the layer below — the chips and networking that run those models. The strength of vertical integration is that every layer can be optimized toward one goal: making the models faster, more reliable, and more affordable. President Greg Brockman frames Jalapeño as part of that long-term full-stack strategy.

OpenAI Official (Greg Brockman statement)View official source →

"Jalapeño is part of our long-term full-stack infrastructure strategy to make compute more abundant, resulting in AI which is faster, more reliable, more affordable for people and businesses." — from the Jalapeño announcement

The "first-generation chip" of the ten-gigawatt partnership

Jalapeño didn't come out of nowhere. Back in October 2025, OpenAI and Broadcom had already announced a partnership to co-deploy ten gigawatts of OpenAI-designed custom AI accelerators, and Jalapeño is the first generation of that multi-generation plan.

OpenAI Official (ten-gigawatt partnership, October 2025)View official source →

"Broadcom will deploy racks of AI accelerator and network systems targeted to start in the second half of 2026, to complete by end of 2029." — from the strategic-collaboration announcement

The view that it reduces Nvidia reliance

Moves like this are often described as a way to reduce reliance on Nvidia's GPUs. Serving its own inference on its own chip does loosen concentration on any single vendor. That said, OpenAI's official materials never claim to "replace Nvidia." What it puts forward is the full-stack story of making compute more abundant and AI more broadly available; the "reduced Nvidia reliance" reading comes from outside observers.

When Jalapeño Ships and What It Means

The OpenAI × Broadcom timeline

Oct 2025

Announce the ten-gigawatt custom AI accelerator partnership

↓

Jun 2026

Unveil the first-generation chip, Jalapeño

↓

End 2026

Begin initial deployment (gigawatt scale, with Microsoft and others)

↓

End 2029

Targeted completion of the rack rollout

Finally, here is when Jalapeño starts running and how it touches everyday AI users.

Deploying from the end of 2026 with Microsoft and others

As the first step of a multi-generation platform, Jalapeño is slated for initial deployment from the end of 2026. Broadcom CEO Hock Tan says that co-developing the silicon directly with OpenAI enables gigawatt-scale data centers with Microsoft and other partners beginning in 2026.

OpenAI Official (Hock Tan statement)View official source →

"By co-developing our industry-leading silicon directly with OpenAI, we are enabling the deployment of gigawatt scale data centers with Microsoft and other partners beginning in 2026." — from the Jalapeño announcement

What it means for everyday AI users

Jalapeño isn't something individuals buy and use directly. Even so, the benefits circle back into everyday AI. If inference gets cheaper and faster, that shows up as quicker ChatGPT answers, more stable access at peak times, and cheaper-to-build API products. OpenAI itself stresses that inference is where AI reaches people, and that each improvement there changes how the experience feels.

Official announcements like this are often long English texts, and there are times you'll want AI to summarize the key points. Converting a web page into clean Markdown keeps the heading and table structure intact, which tends to improve how accurately AI reads it.

The Bottom Line on OpenAI Jalapeño

OpenAI Jalapeño is the company's first in-house AI chip, built with Broadcom. It is purpose-built from scratch for inference — running trained models — and its performance per watt is projected to substantially beat the current state of the art. It was finished at an unusual pace, from design to its manufacturing milestone in nine months. Behind it sits a full-stack strategy of owning everything from chip to product, plus the ten-gigawatt partnership struck in October 2025. OpenAI doesn't claim to "replace Nvidia" — but the move to serve its own inference on its own chip is clearly accelerating.

For more on the latest models and availability, see our explainers on why you can't use GPT-5.6 yet and OpenAI Daybreak.

What Is OpenAI Jalapeño? Broadcom's Custom Inference Chip

What Is OpenAI Jalapeño?

Jalapeño's definition (OpenAI's first "Intelligence Processor")

What "inference-only" means (vs. training)

How OpenAI, Broadcom, and Celestica divide the work

Performance and the Nine-Month Build

Performance per watt: "substantially better" (measurement in progress)

From design to tape-out in "just nine months"

OpenAI's own AI models sped up the design

Why OpenAI Is Building Its Own Chip

A "full-stack" strategy: designing everything from chip to product

The "first-generation chip" of the ten-gigawatt partnership

The view that it reduces Nvidia reliance

When Jalapeño Ships and What It Means

Deploying from the end of 2026 with Microsoft and others

What it means for everyday AI users

The Bottom Line on OpenAI Jalapeño

FAQ

Related Tools

Related Tool Categories

Articles

What Is Claude Mythos 5? Why It's Back for 100+ Critical Orgs

What Is the Colorado AI Act? Why the First U.S. High-Risk AI Law Was Repealed Before Taking Effect

How to Summarize Long Text with ChatGPT: Splitting & Prompt Tips