OpenAI and Broadcom unveil Jalapeño inference chip

25 June 2026, 23:31·1 min read

OpenAI and Broadcom unveiled Jalapeño, OpenAI’s first Intelligence Processor and the first accelerator in a multi-generation compute platform aimed at LLM inference. The chip was designed around OpenAI’s model roadmap, serving systems, kernels, and product needs, with Broadcom contributing silicon implementation and networking technologies and Celestica supporting board, rack, and system integration.

Engineering samples are running ML workloads in the lab at production target frequency and power, including GPT‑5.3‑Codex‑Spark. OpenAI said final measurements are still underway, but early testing shows performance per watt substantially better than current state-of-the-art. The architecture is intended to reduce data movement and balance compute, memory, and networking resources so workloads run closer to theoretical peak performance.

Jalapeño was co-developed from initial design to manufacturing tape-out in just nine months, with OpenAI models used to accelerate parts of the design and optimization process. The platform is designed for initial deployment by the end of 2026 and is expected to expand over multiple generations, including gigawatt scale data centers with Microsoft and other partners.

Originally reported by openai.comRead the source →

Related coverage

Chips

OpenAI and Broadcom unveil Jalapeño inference chip

OpenAI and Broadcom unveil Jalapeño AI chip

AI advances push reasoning, edge computing and quantum hybrids

OpenAI aims for $100 billion ad business by 2030

AI boom races ahead as costs and risks mount