- Monitor container metrics
- Send Alerts when a container is stopped / down
We are going to use the following tools.
cadvisor — https://github.com/google/cadvisor
Analyzes resource usage and performance characteristics of running containers.
Prometheus — https://prometheus.io/
Event monitoring and alerting. It records real-time metrics in a time series database built using an HTTP pull model, with flexible queries and real-time alerting.
Grafana — https://grafana.com/
Multi-platform open-source analytics and interactive visualization web application. It provides charts, graphs, and alerts.
Install cadvisor on the host you…