Skip to content
agenticnick

The Inference Cost Curve Is Bending

Every step down the cost curve quietly makes a new category of product economic.

Where this goes

Open weights redrew the competitive map. A capable model you can run yourself reframes every build-versus-buy decision, and the frontier labs are well aware of it.

Inference cost is falling faster than most forecasts assumed. Each step down the curve unlocks a class of product that was uneconomic the month before, which is why the application layer keeps surprising people.

Margins at the model layer are thinner than the headlines suggest. Training is a capital expense; serving is a recurring one. The business that wins is rarely the one with the largest model.

Every software company now describes itself as an AI company. Most of that is positioning. The ones that mean it have rebuilt a core workflow, not bolted a chat box onto the dashboard.

Open weights redrew the competitive map. A capable model you can run yourself reframes every build-versus-buy decision, and the frontier labs are well aware of it.

Revision history

  • 2026-05-15 Added a note on batch inference economics.
  • 2026-05-10 Updated the cost figures for current pricing.

Related

Search

Esc

Search is available on the live site.