Monitoring K8s With Prometheus Running On Federated Mode Integrated With Thanos

How to query metrics across multiple clusters?

  1. Metric view from a single pane of a glass : Just imagine you deployed prometheus on 50 clusters, and you don’t want to jump between prometheus instance to query metrics for each cluster.
  2. Long-term metric storage : How do we store metrics for long-term by default. As we all know prometheus has a retention of 15 days by default; so if I ingest some metrics, those samples are kept in tsdb for 15 days by default. The question remains, what if I want to query data from several months ago.
  3. High Availability (HA) : What if for any number of reasons prometheus goes down; what happens to our entire metric system.

Kubernetes Together With Prometheus Architecture

Real Architecture Of Thanos



