Rules

containers

7.048s ago

1.719ms

Rule State Error Last Evaluation Evaluation Time
alert: graphnode_down expr: absent((time() - container_last_seen{name="graph-node"}) < 10) for: 30s labels: severity: critical annotations: description: Graph Node container is down for more than 30 seconds. summary: Graph Node down ok 7.051s ago 384.1us
alert: graphnode_high_cpu expr: sum(rate(container_cpu_usage_seconds_total{name="graph-node"}[1m])) / count(node_cpu_seconds_total{mode="system"}) * 100 > 10 for: 30s labels: severity: warning annotations: description: Graph Node CPU usage is {{ humanize $value}}%. summary: Graph Node high CPU usage ok 7.05s ago 228.3us
alert: graphnode_high_memory expr: sum(container_memory_usage_bytes{name="graph-node"}) > 1.2e+09 for: 30s labels: severity: warning annotations: description: Graph Node memory consumption is at {{ humanize $value}}. summary: Graph Node high memory usage ok 7.05s ago 115.5us
alert: postgres_down expr: absent((time() - container_last_seen{name="postgres"}) < 10) for: 30s labels: severity: critical annotations: description: Postgres container is down for more than 30 seconds. summary: Postgres down ok 7.05s ago 173.3us
alert: postgres_high_cpu expr: sum(rate(container_cpu_usage_seconds_total{name="postgres"}[1m])) / count(node_cpu_seconds_total{mode="system"}) * 100 > 10 for: 30s labels: severity: warning annotations: description: Postgres CPU usage is {{ humanize $value}}%. summary: Postgres high CPU usage ok 7.05s ago 107.8us
alert: postgres_high_memory expr: sum(container_memory_usage_bytes{name="postgres"}) > 1.2e+09 for: 30s labels: severity: warning annotations: description: Postgres memory consumption is at {{ humanize $value}}. summary: Postgres high memory usage ok 7.05s ago 68.44us
alert: nginx_down expr: absent((time() - container_last_seen{name="nginx-proxy"}) < 10) for: 30s labels: severity: critical annotations: description: Nginx container is down for more than 30 seconds. summary: Nginx down ok 7.05s ago 88.04us
alert: nginx_high_cpu expr: sum(rate(container_cpu_usage_seconds_total{name="nginx-proxy"}[1m])) / count(node_cpu_seconds_total{mode="system"}) * 100 > 10 for: 30s labels: severity: warning annotations: description: PostNginxgres CPU usage is {{ humanize $value}}%. summary: Nginx high CPU usage ok 7.05s ago 141.2us
alert: nginx_high_memory expr: sum(container_memory_usage_bytes{name="nginx-proxy"}) > 1.2e+09 for: 30s labels: severity: warning annotations: description: Nginx memory consumption is at {{ humanize $value}}. summary: Nginx high memory usage ok 7.05s ago 80.23us
alert: caddy_down expr: absent((time() - container_last_seen{name="caddy"}) < 10) for: 30s labels: severity: critical annotations: description: Caddy container is down for more than 30 seconds. summary: Caddy down ok 7.05s ago 186.1us
alert: caddy_high_cpu expr: sum(rate(container_cpu_usage_seconds_total{name="caddy"}[1m])) / count(node_cpu_seconds_total{mode="system"}) * 100 > 10 for: 30s labels: severity: warning annotations: description: Caddy CPU usage is {{ humanize $value}}%. summary: Caddy high CPU usage ok 7.05s ago 97us
alert: caddy_high_memory expr: sum(container_memory_usage_bytes{name="caddy"}) > 1.2e+09 for: 30s labels: severity: warning annotations: description: Caddy memory consumption is at {{ humanize $value}}. summary: Caddy high memory usage ok 7.05s ago 32.33us

host

1.83s ago

630.5us

Rule State Error Last Evaluation Evaluation Time
alert: high_cpu_load expr: node_load1 > 1.5 for: 30s labels: severity: warning annotations: description: Docker host is under high load, the avg load 1m is at {{ $value}}. Reported by instance {{ $labels.instance }} of job {{ $labels.job }}. summary: Server under high load ok 1.83s ago 240.4us
alert: high_memory_load expr: (sum(node_memory_MemTotal_bytes) - sum(node_memory_MemFree_bytes + node_memory_Buffers_bytes + node_memory_Cached_bytes)) / sum(node_memory_MemTotal_bytes) * 100 > 85 for: 30s labels: severity: warning annotations: description: Docker host memory usage is {{ humanize $value}}%. Reported by instance {{ $labels.instance }} of job {{ $labels.job }}. summary: Server memory is almost full ok 1.83s ago 252.8us
alert: high_storage_load expr: (node_filesystem_size_bytes{fstype="aufs"} - node_filesystem_free_bytes{fstype="aufs"}) / node_filesystem_size_bytes{fstype="aufs"} * 100 > 85 for: 30s labels: severity: warning annotations: description: Docker host storage usage is {{ humanize $value}}%. Reported by instance {{ $labels.instance }} of job {{ $labels.job }}. summary: Server storage is almost full ok 1.829s ago 125.8us

targets

10.292s ago

1.176ms

Rule State Error Last Evaluation Evaluation Time
alert: monitor_service_down expr: up == 0 for: 30s labels: severity: critical annotations: description: Service {{ $labels.instance }} is down. summary: Monitor service non-operational ok 10.292s ago 1.168ms