Browse
Core Concepts
Reasoning
Memory & Retrieval
Agent Types
Design Patterns
Training & Alignment
Frameworks
Tools
Safety
Meta
Browse
Core Concepts
Reasoning
Memory & Retrieval
Agent Types
Design Patterns
Training & Alignment
Frameworks
Tools
Safety
Meta
Thanos is an open-source project maintained by the Cloud Native Computing Foundation (CNCF) that extends Prometheus monitoring capabilities with long-term metrics storage and global querying functionality. The project addresses fundamental scalability limitations of traditional Prometheus deployments by providing a tiered storage architecture that separates real-time metrics collection from historical data retention 1).
Thanos solves the challenge of scaling Prometheus metrics collection across distributed systems. While Prometheus excels at short-term local metrics scraping and querying, it lacks native support for long-term retention and global aggregation across multiple Prometheus instances. Thanos bridges this gap by implementing a layered approach: local Prometheus instances continue handling real-time metrics with their standard retention policies, while Thanos components manage longer-term storage in object storage backends and provide unified query interfaces across multiple Prometheus clusters 2).
Thanos employs a modular architecture comprising several key components. The Sidecar component runs alongside Prometheus instances, exposing Prometheus data through the Thanos API and managing uploads to object storage. The Query component provides a global query interface compatible with PromQL (Prometheus Query Language), allowing unified metric queries across multiple Prometheus instances and historical data. The Store Gateway retrieves metrics from object storage, enabling access to long-term historical data. The Compactor optimizes storage efficiency by compacting and downsampling metrics over time, reducing storage costs while maintaining query performance 3).
The tiered storage architecture enables cost-effective long-term retention by supporting various object storage backends including Amazon S3, Google Cloud Storage, Azure Blob Storage, and OpenStack Swift. This flexibility allows organizations to choose storage solutions matching their infrastructure and compliance requirements.
A critical feature of Thanos is its full PromQL compatibility, which allows existing Prometheus queries and dashboards to function without modification. The Query component implements the complete PromQL specification, including aggregation operators, mathematical functions, and time-range selectors. This compatibility ensures organizations can transition from standalone Prometheus deployments to Thanos-based infrastructure without rewriting monitoring logic or reconfiguring Grafana dashboards 4).
Enterprise platforms have built upon Thanos's foundation to create enhanced monitoring solutions. Projects such as Databricks Pantheon leverage Thanos's tiered storage architecture and PromQL compatibility while adding custom control plane implementations and operational optimizations tailored to specific use cases at scale 5).
Thanos deployment typically follows patterns where Prometheus instances scrape metrics from application targets, Sidecar components upload blocks to object storage, and Query components provide centralized query endpoints. This architecture scales to deployments managing trillions of metric samples, supporting both short-term real-time monitoring and long-term trend analysis. Organizations deploying Thanos benefit from reduced local storage requirements on Prometheus servers, improved query performance for historical data, and cost optimization through intelligent retention and compaction policies.
As a CNCF project, Thanos operates under the Foundation's open governance model and community-driven development processes. The project maintains active community contributions, regular releases incorporating bug fixes and performance improvements, and comprehensive documentation supporting both basic and advanced deployments.