OpenAI

GPT-4o cost & gross margin per customer

GPT-4o is OpenAI's flagship general-purpose model — strong quality at a mid-range price. On a flat-priced AI product it's comfortably profitable for light usage, but heavy chat or agent workloads can erode your margin fast.

GPT-4o sits in the upper-middle of the price range: input is moderate, but output at $10/Mtok adds up on chatty, generative features. It's rarely a problem for light tools, yet conversational or agentic products built on it need per-customer margin tracking from day one.

Input

$2.5 /Mtok

Output

$10 /Mtok

Margin per customer by usage & plan price

How GPT-4o margin holds up as a customer's usage rises, across common subscription prices.

Usage / moLLM cost$19/mo$29/mo$49/mo$79/mo
Light$0.5597.1%98.1%98.9%99.3%
Typical$2.7585.5%90.5%94.4%96.5%
Heavy$11.0042.1%62.1%77.6%86.1%
Power user$45.00-136.8%-55.2%8.2%43%

Margin % per customer at each plan price. Token prices indicative, as of 2026-06.

Of the $2.75 a typical customer costs on GPT-4o, output tokens are $1.50 (55%) and input $1.25. Output is priced at $10/Mtok — 4× the input rate — so the more your product generates per request, the faster a customer's margin slips.

Worked example

Take a power user on your $49/mo plan sending 8M input / 2.5M output tokens a month. On GPT-4o that's $45.00 in tokens — that's only just positive at 8.2% ($4.00) — thin, and any cheaper plan would flip it negative.

How to keep GPT-4o profitable

  • Trim and cache input context — long system prompts and re-sent chat history are pure, repeated cost.
  • Cap output length and stop generation early where you can: at roughly 4× the input price, every extra generated token is where GPT-4o hurts most.
  • Route easy requests to a cheaper model and reserve GPT-4o for the hard ones that actually need it.
  • Set a per-customer margin alert so one heavy user can't quietly slip into the red unnoticed.

When to choose GPT-4o

Choose GPT-4o when you need dependable general quality and per-customer usage is light to moderate. If your product is conversation- or agent-heavy, model the margin first — or reserve it for your premium tiers.

FAQ

How much does GPT-4o cost per customer?
At a typical 500k input / 150k output tokens per customer per month, GPT-4o costs about $2.75 per customer (input 2.5/Mtok, output 10/Mtok).
Is GPT-4o profitable for a $49/mo AI SaaS?
At typical usage, yes — margin is about 94.4% ($46.25 per customer). It erodes as usage rises; heavy and power users are where GPT-4o can turn unprofitable.
What's a good gross margin for an AI SaaS using GPT-4o?
Most AI products target a 60–80% gross margin. With GPT-4o at typical usage you're around 94.4% on a $49 plan — comfortable — but your blended margin depends on the heavy users, which is the number worth watching.
At what usage does GPT-4o stop being profitable on a $29 plan?
Around 5.3M input / 1.6M output tokens a month. Past that point, a $29 customer costs you more than they pay.
How do I reduce GPT-4o cost per customer?
Cut output tokens first (they're the priciest), cache or trim input context, route easy requests to a cheaper model, and watch the break-even point — around 8.9M input / 2.7M output tokens a $49 customer stops being profitable.

Compare this model

Other models

Key terms