On June 24, OpenAI and Broadcom walked out on stage and handed Sam Altman a piece of silicon. They called it Jalapeño — OpenAI’s first custom chip, built to run large language models rather than train them. Just to serve them to you when you ask ChatGPT a question. The press cycle did what press cycles do. Broadcom’s stock moved. Everyone wrote the “OpenAI takes on Nvidia” headline.
And the chip is the least interesting thing about the whole announcement.
I don’t mean that dismissively. A purpose-built inference processor is serious engineering, and the people who designed it are very good at their jobs. But “company builds custom chip to cut costs” is a story we’ve seen before. Google did it with the TPU. Amazon did it. Meta’s doing it. A new logo joining that club is not the part worth your attention.
The part worth your attention sits in a single sentence of the announcement, and almost nobody is talking about it: OpenAI used its own models to help design the chip. They compressed what is normally a multi-year process into nine months — blank page to tape-out — partly by pointing their AI at the problem of building better AI hardware.
We’ve spent two years arguing about whether AI can write a decent email or replace a junior developer. Meanwhile, quietly, it just helped design the physical processor that will run the next generation of itself. The thing serving your prompts had a hand in shaping the thing that will serve them faster. That’s not a productivity hack. That’s a loop closing.
This is what I’d flag to anyone building or leading in this space: the most important AI capabilities aren’t showing up in the demos. They’re showing up in the supply chain. Chip design, materials science, protein folding, logistics — the unglamorous, deeply technical work where a small acceleration compounds into an enormous one. We keep measuring AI by how well it mimics us. The sharper question is what happens when it gets good at the things we were never fast enough to do ourselves.
There’s an old line that the best place to hide something is in plain sight. The self-improving loop everyone has theorized about — the one that lives in essays with ominous titles — didn’t arrive with a manifesto. It showed up as a footnote in a hardware press release.
Now the part where I keep us honest, because that’s the whole point of this newsletter.
We do not actually know how good this chip is. OpenAI says it delivers significantly better performance-per-watt than the current state of the art. Bloomberg reported it could cut inference costs by roughly half. Both numbers would be a very big deal if they hold. But notice the verbs. Says. Could. The detailed performance report is “coming in the months ahead,” which is corporate for “we haven’t shown our work yet.” What we have today is engineering samples running in a lab and a set of claims nobody outside the building can check.
So treat the 50% with the skepticism it deserves. A chip designed for one company’s exact workloads will always look spectacular on that company’s exact workloads. The real test is independent benchmarks, deployment at scale, and whether the economics survive contact with reality. We’ll start finding out in late 2026. Until then, the cost cut is a promise, not a result.
But the design-loop story doesn’t depend on the benchmark. Whether Jalapeño turns out to be 50% cheaper or 20% cheaper, the method that produced it — AI shrinking the timeline for building its own infrastructure — is already real, already shipped, and already repeatable. That’s the thing I can’t stop thinking about.
The chip will get faster. The next one will be designed faster still, by better models, in less time. The interesting number was never the performance-per-watt. It was the nine months.
Keep your eye on the loop, not the logo.
Discover more from Akshat Uniyal
Subscribe to get the latest posts sent to your email.
