OpenAI

GPT-4o mini cost & gross margin per customer

GPT-4o mini is OpenAI's cheap, fast model. Its low token price makes it one of the easiest models to keep profitable per customer, even at low subscription prices and high usage.

GPT-4o mini is cheap enough that cost per customer is almost a rounding error at typical usage. The risk isn't a single heavy user — it's volume: at scale, even a fraction of a cent per request compounds across millions of calls.

Input

$0.15 /Mtok

Output

$0.6 /Mtok

Margin per customer by usage & plan price

How GPT-4o mini margin holds up as a customer's usage rises, across common subscription prices.

Usage / moLLM cost$19/mo$29/mo$49/mo$79/mo
Light$0.0399.8%99.9%99.9%100%
Typical$0.1699.2%99.4%99.7%99.8%
Heavy$0.6696.5%97.7%98.7%99.2%
Power user$2.7085.8%90.7%94.5%96.6%

Margin % per customer at each plan price. Token prices indicative, as of 2026-06.

Of the $0.17 a typical customer costs on GPT-4o mini, output tokens are $0.09 (53%) and input $0.08. Output is priced at $0.6/Mtok — 4× the input rate — so the more your product generates per request, the faster a customer's margin slips.

Worked example

Take a power user on your $49/mo plan sending 8M input / 2.5M output tokens a month. On GPT-4o mini that's $2.70 in tokens — that's still comfortable at 94.5% ($46.30) — even a heavy user leaves you firmly in the black on most plan prices.

How to keep GPT-4o mini profitable

  • Trim and cache input context — long system prompts and re-sent chat history are pure, repeated cost.
  • Cap output length and stop generation early where you can: at roughly 4× the input price, every extra generated token is where GPT-4o mini hurts most.
  • Route easy requests to a cheaper model and reserve GPT-4o mini for the hard ones that actually need it.
  • Set a per-customer margin alert so one heavy user can't quietly slip into the red unnoticed.

When to choose GPT-4o mini

Choose GPT-4o mini for high-volume, latency-sensitive features where 'good enough' quality keeps customers profitable. It's the safe default for free tiers and anything you run on every request.

FAQ

How much does GPT-4o mini cost per customer?
At a typical 500k input / 150k output tokens per customer per month, GPT-4o mini costs about $0.16 per customer (input 0.15/Mtok, output 0.6/Mtok).
Is GPT-4o mini profitable for a $49/mo AI SaaS?
At typical usage, yes — margin is about 99.7% ($48.84 per customer). It erodes as usage rises; heavy and power users are where GPT-4o mini can turn unprofitable.
What's a good gross margin for an AI SaaS using GPT-4o mini?
Most AI products target a 60–80% gross margin. With GPT-4o mini at typical usage you're around 99.7% on a $49 plan — comfortable — but your blended margin depends on the heavy users, which is the number worth watching.
At what usage does GPT-4o mini stop being profitable on a $29 plan?
Around 90.6M input / 27.2M output tokens a month. Past that point, a $29 customer costs you more than they pay.
How do I reduce GPT-4o mini cost per customer?
Cut output tokens first (they're the priciest), cache or trim input context, route easy requests to a cheaper model, and watch the break-even point — around 153.1M input / 45.9M output tokens a $49 customer stops being profitable.

Compare this model

Other models

Key terms