Usage-based pricing for SaaS: What it is and how AI agents are breaking it

Arnon Shimoni 24 Oct 25

By now everyone knows usage-based pricing charges customers based on product consumption instead of flat subscription fees. In traditional SaaS, you paid per API call, per gigabyte stored, or per message sent.

For AI agents, this model is creating a margin crisis, and that's why usage-based is wrong for AI and specifically AI agents.

What is usage-based pricing?

Definition: A pricing model where costs scale directly with measurable product usage like API calls, compute hours, or data processed.

You pay for what you use. Nothing more, nothing less.

Usage-based pricing examples in SaaS

The model powered some of the biggest SaaS companies:

Twilio: $0.0083 per SMS, $0.0085 per voice minute
AWS: $0.023 per GB of S3 storage
SendGrid: changes, but around $0.0006 per email
Stripe: 2.9% + $0.30 per successful transaction

These work because costs stay relatively predictable. An API call costs roughly the same whether it's your first or your millionth (assuming you didn't pre-buy a package with overage fees).

Usage-based vs subscription pricing

Factor	Subscription	Usage-Based
Bill predictability	As predictable as you can get	Variable
Customer risk	You pay regardless of use	Usage can exceed what is planned
Revenue predictability	Guaranteed	Variable
Expansion potential	Limited	Unlimited
CFO preference	Preferred (makes forecasting easy)	Cautious (harder to forecast on)

Subscription: $50/month whether you use it once or 1,000 times
Usage-based: $0.01 per API call (100 calls = $1, 10,000 calls = $100)
Hybrid: $20/month base + $0.01 per API call over 1,000

Why AI agents break usage-based pricing

1. Cost variance destroys margins

Traditional SaaS: If an API call costs $0.001 and you charge $0.005, your margin holds at 80%.

compare with AI agents: A "resolved ticket" costs anywhere from $0.99 to $3.00.

One customer service platform charges $0.99 per resolved ticket. Simple questions cost them $0.04 in resources. Complex issues cost $2.80. Average margin is ~60%, but engaged customers generate losses. This variance isn't a bug. Agents assess complexity and adapt. While simple queries need one LLM call, complex ones trigger research, use memory, multiple reasoning steps, and generate more detailed responses. It's quite hard to price for average when the range is so big.

Manny Medina (our CEO who built Outreach to a $4.4B valuation) sees this playing out across the industry:

2. Autonomous decisions break attribution

Traditional software waited for instructions. Click send, email goes out, charge applies.

AI agents make independent decisions. An AI SDR might send five touchpoints instead of two, adjusting based on engagement. Do you charge for two emails or five? For the outcome or the process? For successful actions or every attempt?

Cursor faces this daily. When their AI generates code, refactors it, then fixes a bug, how many "completions" was that? Developers wanted working auth. They don't know if that costs 1,000 tokens or 10,000.

The whole metering approach of "count the things, charge per thing" breaks down here, as one outcome could have many usage events.

Flowchart showing AI agent cost breakdown: searches knowledge base ($0.02), generates response ($0.08), checks policies ($0.05), refines answer ($0.12), formats output ($0.02). Total: $0.29.

3. Token pricing is a proxy for something else, obfuscates the value

Charging for tokens maps directly to infrastructure costs and obfuscates the value. Anthropic does it. OpenAI does it, but customers think in outcomes, not tokens - which is why so many people complain about their bills.

"Implement user authentication" is clear. Estimating 8,347 input tokens and 2,156 output tokens requires expertise most don't have.

This creates two BIG problems:

Customers can't forecast. Finance needs predictability. Token consumption varies in non-intuitive ways.
Customers self-police (optimize for saving, not using). Unpredictable bills kill usage. This destroys expansion and prevents full value realization. An AI SDR that books fewer meetings but uses fewer tokens isn't more valuable.

4. The race to zero accelerates

AI model costs drop 90% annually while better models launch constantly.

When you price based on tokens or compute, your pricing tracks the collapsing cost structure. As GPT-4 gets cheaper, customers expect matching price drops.

One document processing company started at $0.50 per page. When costs dropped and competitors entered at $0.30, then $0.15, they had no defense.

Companies charging for outcomes maintained pricing power. A reviewed contract is worth the same whether processing costs $2 or $0.20.

The illusion of simplicity

Usage-based pricing feels simple in theory - you count, charge, and you're done.

But a customer service agent doesn't just "resolve tickets." It searches your knowledge base, generates responses, checks policies, formats output, and learns from interactions.

Five usage events for one outcome. Which do you charge for? All of them? How do you explain that?

The measurement gap

Usage-based pricing assumes you can measure usage accurately and cheaply.

For traditional SaaS, counting API calls is trivial. For AI agents, you need to track LLM calls across providers, attribute costs to workflows, monitor tokens in real time, calculate per-interaction margins, and catch cost spikes before they kill profitability.

Stripe counts transactions but doesn't know workflow costs. Chargebee generates invoices but can't tell you which customers are profitable.

The infrastructure gap means flying blind while thinking you're data-driven because you're "usage-based."

Why the SaaS playbook fails

Traditional SaaS had simple economics:

Development costs upfront
Marginal costs approach zero
More users = more profit
Predictable usage patterns

AI agents invert everything:

Development costs ongoing
Marginal costs scale with complexity
More usage might mean less profit
Usage patterns are autonomous

At this point, you might be thinking: "Fine, usage-based is broken. Seats are dying. What's actually supposed to work?"...

We have a surprisingly simple answer:

"You don't need seats. You don't need a ton of thinking. The moment you establish what the exchange value is going to be, then you're in good shape to price."

This sounds almost too simple. But it cuts through the complexity: figure out the value you're creating, take 20-50% of that, and build a system that lets you price differently for each customer. Not a perfect formula. Not a one-size-fits-all model. A framework for thinking about value exchange.

Martin Casado and Kyle Poyar recently wrote about why all pricing models have flaws. While some of the criticism is fair - not everything requires a PhD or is a pipe-dream. Where they are right is that usage-based pricing for AI is misaligned with the economics.

Usage-based pricing worked in SaaS because it aligned incentives. Customers paid for what was performed, and vendors captured it.

For AI agents, it creates misalignment:

Customers want outcomes but pay for consumption
Vendors need margin predictability but face 100x variance
Both unfortunately optimize for wrong metrics

Our SaaS to AI Agent transition report analyzed 250+ companies - from pure usage-based pricing correlates with 70% churn and negative margin to companies evolving to workflow or outcome models that maintain 94% margins.

At this point, you have to recognize that usage-based pricing optimizes for infrastructure efficiency when the world demands outcome delivery.

Build agents, get paid

Understand your margins and get paid for the value your agents create.

Billing Book a demo

The discovery trap: Why your best SaaS sellers are leaving money on the table with AI agents

Your top performers - the ones who've been crushing SaaS quotas for years - are systematically undervaluing AI agents by 70-80%. Not because they're bad sellers. But because they're too good at what they've always done.

5 Dec 25 • Manny

Illustration of hands holding a coin with a dollar sign, surrounded by radiant sunbeams, symbolizing prosperity and financial growth.

The discovery framework for unlocking bigger budget pools

Most SaaS companies are used to selling against the $1 software and tools budget pool. The new guard of AI natives are positioning themselves as labor replacement and fishing in the $10 labor pool.

2 Dec 25 • Manny

Silhouettes of people standing on ripple-like patterns with a large, bright sun in the background, creating a dramatic contrast.

The 5 rules for winning AI proofs of value

Winning companies will be the ones proving value in the shortest possible time, locking out competitors and creating customers who succeed - and who stick.

28 Nov 25 • Manny

Illustration of a central yellow padlock with swirling, abstract black and white patterns surrounding it, creating a sense of movement.

Ready to get Paid?

Price smarter. Protect margins. Grow revenue.

Get started for free

Generous free tier
Transparent pricing
Stripe integration

Get started

Scale with enterprise

Unlimited usage, no limits
Multiple payment gateways
Hands-on engineering support

Book a demo