What Enterprises Should Demand in CaaS Contracts
Enterprise buyers considering large CaaS commitments should consider riders or contract language addressing the following issues.
1. Compute-Cost Review Rights
The contract should include a scheduled review of task pricing if the underlying cost of AI execution materially declines.
This does not require the vendor to disclose every infrastructure detail. But it should create a formal mechanism for reassessing rates when compute economics change.
Useful language:
Because the contracted services are delivered through compute-dependent AI systems, the parties agree that material reductions in the cost or resource intensity required to perform covered tasks shall trigger a good-faith pricing review.
2. Clarity-Layer Adjustment
If the customer improves its own content, workflows, metadata, provenance, or machine-readable structure, the vendor should not be the only party that benefits.
Useful language:
The customer shall not be commercially disadvantaged for improving the clarity, structure, or machine-readability of its content, data, workflows, or authority declarations. Any measurable reduction in agentic execution cost attributable to such improvements shall be considered in task pricing, renewal pricing, or usage-credit adjustments.
This is where MSP-1 becomes procurement-relevant.
It gives the buyer a concrete way to say:
We are reducing your inference burden. That reduction should be commercially recognized.
3. Failed Task and Retry Treatment
In task-based AI billing, the buyer should define what counts as a billable task.
Questions to address:
- Does a failed task count?
- Does a repeated task count?
- Does an avoidable hallucination count?
- Does a human-escalated task count at the same rate?
- Does a task caused by poor model interpretation count as billable work?
The contract should distinguish between successful task completion, partial completion, failed execution, retry caused by platform error, retry caused by customer ambiguity, and human escalation.
Without this distinction, the buyer may pay for failure as if it were value.
4. Usage and Resource Transparency
The buyer may not need raw token logs, but it should receive meaningful operational reporting.
At minimum, enterprise reporting should distinguish:
- task volume
- successful completions
- failed completions
- retry rates
- average interaction depth
- escalation rates
- source-grounding events
- policy-blocked actions
- approximate resource intensity tiers
This allows procurement teams to ask whether they are paying for useful work or for repeated attempts to overcome unclear operating conditions.
5. Benchmarking Rights
Long-term CaaS contracts should allow periodic benchmarking against market pricing, internal tests, or mutually agreed performance baselines.
The buyer should be able to ask:
- Are comparable tasks becoming cheaper elsewhere?
- Has the vendor’s model stack become more efficient?
- Has the buyer’s clarity layer reduced task complexity?
- Are renewal rates reflecting those changes?
Without benchmarking, the buyer may be locked into early-market pricing long after the underlying economics have improved.
6. Liability and Governance Alignment
Task-based pricing creates task-based accountability pressure.
If a vendor markets an AI agent as performing business work, the contract should clearly define where responsibility sits when that work fails.
Important areas include:
- audit trails
- approval gates
- human-in-the-loop thresholds
- source-grounding requirements
- policy constraints
- role permissions
- rollback procedures
- customer notification obligations
- liability allocation for erroneous agent actions
A clarity layer can support this governance structure by giving agents cleaner context before acting.
That means clarity is not only an efficiency issue.
It is a liability issue.