Usage-based pricing for SaaS: What it is and how AI agents are breaking it

Abstract art with geometric stairs on the left and swirling yellow patterns on the right against a black background.
A heashot of Arnon Shimoni, co-founder & marketing at Paid.ai.
Arnon Shimoni 24 Oct 25

By now everyone knows usage-based pricing charges customers based on product consumption instead of flat subscription fees. In traditional SaaS, you paid per API call, per gigabyte stored, or per message sent.

For AI agents, this model is creating a margin crisis, and that's why usage-based is wrong for AI and specifically AI agents.

What is usage-based pricing?

Definition: A pricing model where costs scale directly with measurable product usage like API calls, compute hours, or data processed.

You pay for what you use. Nothing more, nothing less.

Usage-based pricing examples in SaaS

The model powered some of the biggest SaaS companies:

  • Twilio: $0.0083 per SMS, $0.0085 per voice minute
  • AWS: $0.023 per GB of S3 storage
  • SendGrid: changes, but around $0.0006 per email
  • Stripe: 2.9% + $0.30 per successful transaction

These work because costs stay relatively predictable. An API call costs roughly the same whether it's your first or your millionth (assuming you didn't pre-buy a package with overage fees).

Usage-based vs subscription pricing

Factor

Subscription

Usage-Based

Bill predictability

As predictable as you can get

Variable

Customer risk

You pay regardless of use

Usage can exceed what is planned

Revenue predictability

Guaranteed

Variable

Expansion potential

Limited

Unlimited

CFO preference

Preferred (makes forecasting easy)

Cautious (harder to forecast on)

  • Subscription: $50/month whether you use it once or 1,000 times
  • Usage-based: $0.01 per API call (100 calls = $1, 10,000 calls = $100)
  • Hybrid: $20/month base + $0.01 per API call over 1,000

Why AI agents break usage-based pricing

1. Cost variance destroys margins

Traditional SaaS: If an API call costs $0.001 and you charge $0.005, your margin holds at 80%.

compare with AI agents: A "resolved ticket" costs anywhere from $0.99 to $3.00.

One customer service platform charges $0.99 per resolved ticket. Simple questions cost them $0.04 in resources. Complex issues cost $2.80. Average margin is ~60%, but engaged customers generate losses. This variance isn't a bug. Agents assess complexity and adapt. While simple queries need one LLM call, complex ones trigger research, use memory, multiple reasoning steps, and generate more detailed responses. It's quite hard to price for average when the range is so big.

Manny Medina (our CEO who built Outreach to a $4.4B valuation) sees this playing out across the industry:

Play

2. Autonomous decisions break attribution

Traditional software waited for instructions. Click send, email goes out, charge applies.

AI agents make independent decisions. An AI SDR might send five touchpoints instead of two, adjusting based on engagement. Do you charge for two emails or five? For the outcome or the process? For successful actions or every attempt?

Cursor faces this daily. When their AI generates code, refactors it, then fixes a bug, how many "completions" was that? Developers wanted working auth. They don't know if that costs 1,000 tokens or 10,000.

The whole metering approach of "count the things, charge per thing" breaks down here, as one outcome could have many usage events.

Flowchart showing AI agent cost breakdown: searches knowledge base ($0.02), generates response ($0.08), checks policies ($0.05), refines answer ($0.12), formats output ($0.02). Total: $0.29.

3. Token pricing is a proxy for something else, obfuscates the value

Charging for tokens maps directly to infrastructure costs and obfuscates the value. Anthropic does it. OpenAI does it, but customers think in outcomes, not tokens - which is why so many people complain about their bills.

"Implement user authentication" is clear. Estimating 8,347 input tokens and 2,156 output tokens requires expertise most don't have.

This creates two BIG problems:

  1. Customers can't forecast. Finance needs predictability. Token consumption varies in non-intuitive ways.
  2. Customers self-police (optimize for saving, not using). Unpredictable bills kill usage. This destroys expansion and prevents full value realization. An AI SDR that books fewer meetings but uses fewer tokens isn't more valuable.

4. The race to zero accelerates

AI model costs drop 90% annually while better models launch constantly.

When you price based on tokens or compute, your pricing tracks the collapsing cost structure. As GPT-4 gets cheaper, customers expect matching price drops.

One document processing company started at $0.50 per page. When costs dropped and competitors entered at $0.30, then $0.15, they had no defense.

Companies charging for outcomes maintained pricing power. A reviewed contract is worth the same whether processing costs $2 or $0.20.

The illusion of simplicity

Usage-based pricing feels simple in theory - you count, charge, and you're done.

But a customer service agent doesn't just "resolve tickets." It searches your knowledge base, generates responses, checks policies, formats output, and learns from interactions.

Five usage events for one outcome. Which do you charge for? All of them? How do you explain that?

The measurement gap

Usage-based pricing assumes you can measure usage accurately and cheaply.

For traditional SaaS, counting API calls is trivial. For AI agents, you need to track LLM calls across providers, attribute costs to workflows, monitor tokens in real time, calculate per-interaction margins, and catch cost spikes before they kill profitability.

Stripe counts transactions but doesn't know workflow costs. Chargebee generates invoices but can't tell you which customers are profitable.

The infrastructure gap means flying blind while thinking you're data-driven because you're "usage-based."

Why the SaaS playbook fails

Traditional SaaS had simple economics:

  • Development costs upfront
  • Marginal costs approach zero
  • More users = more profit
  • Predictable usage patterns

AI agents invert everything:

  • Development costs ongoing
  • Marginal costs scale with complexity
  • More usage might mean less profit
  • Usage patterns are autonomous

At this point, you might be thinking: "Fine, usage-based is broken. Seats are dying. What's actually supposed to work?"...

We have a surprisingly simple answer:

Play

"You don't need seats. You don't need a ton of thinking. The moment you establish what the exchange value is going to be, then you're in good shape to price."

This sounds almost too simple. But it cuts through the complexity: figure out the value you're creating, take 20-50% of that, and build a system that lets you price differently for each customer. Not a perfect formula. Not a one-size-fits-all model. A framework for thinking about value exchange.


Martin Casado and Kyle Poyar recently wrote about why all pricing models have flaws. While some of the criticism is fair - not everything requires a PhD or is a pipe-dream. Where they are right is that usage-based pricing for AI is misaligned with the economics.

Usage-based pricing worked in SaaS because it aligned incentives. Customers paid for what was performed, and vendors captured it.

For AI agents, it creates misalignment:

  • Customers want outcomes but pay for consumption
  • Vendors need margin predictability but face 100x variance
  • Both unfortunately optimize for wrong metrics

Our SaaS to AI Agent transition report analyzed 250+ companies - from pure usage-based pricing correlates with 70% churn and negative margin to companies evolving to workflow or outcome models that maintain 94% margins.

At this point, you have to recognize that usage-based pricing optimizes for infrastructure efficiency when the world demands outcome delivery.

Build agents, get paid

Understand your margins and get paid for the value your agents create.

A background image showing a graph.

Related articles

Ready to get Paid?

Price smarter. Protect margins. Grow revenue.

Get started for free

  • Generous free tier
  • Transparent pricing
  • Stripe integration
Get started

Scale with enterprise

  • Unlimited usage, no limits
  • Multiple payment gateways
  • Hands-on engineering support
Book a demo