Core Concepts
Reasoning
Memory & Retrieval
Agent Types
Design Patterns
Training & Alignment
Frameworks
Tools
Safety & Security
Evaluation
Meta
Core Concepts
Reasoning
Memory & Retrieval
Agent Types
Design Patterns
Training & Alignment
Frameworks
Tools
Safety & Security
Evaluation
Meta
An AI Service Level Agreement (AI-SLA) is a contractual document that defines the performance standards, metrics, and remedies applicable to AI-powered services. 1) Unlike traditional SLAs that focus primarily on uptime and response time, AI-SLAs must address the unique characteristics of AI systems including model accuracy, inference latency, output quality, fairness, and drift monitoring.
Traditional SLAs measure deterministic software behavior: the service is either available or it is not, and response times are predictable. AI systems introduce stochastic behavior where outputs can vary, models can degrade over time, and quality metrics extend beyond simple availability. 2)
AI-SLAs must account for:
Standard uptime commitments remain foundational. Cloud providers typically offer 99.5 to 99.9 percent monthly uptime for AI services, with service credits for breaches. 3) Downtime calculations exclude planned maintenance windows with advance notice.
AI-SLAs commonly define tiered service levels:
When service levels are not met, AI-SLAs typically provide financial credits as the sole remedy. Credit structures are usually tiered based on the severity and duration of the breach. For example, falling below 99.9 percent uptime may trigger a 10 percent credit, while falling below 99.0 percent may trigger a 30 percent credit. 6)
Common exclusions from AI-SLA calculations include: