Rules

containers

10.713s ago

1.949ms

Rule State Error Last Evaluation Evaluation Time
alert: graphnode_down expr: absent((time() - container_last_seen{name="graph-node"}) < 10) for: 30s labels: severity: critical annotations: description: Graph Node container is down for more than 30 seconds. summary: Graph Node down ok 10.714s ago 434.7us
alert: graphnode_high_cpu expr: sum(rate(container_cpu_usage_seconds_total{name="graph-node"}[1m])) / count(node_cpu_seconds_total{mode="system"}) * 100 > 10 for: 30s labels: severity: warning annotations: description: Graph Node CPU usage is {{ humanize $value}}%. summary: Graph Node high CPU usage ok 10.713s ago 229.5us
alert: graphnode_high_memory expr: sum(container_memory_usage_bytes{name="graph-node"}) > 1.2e+09 for: 30s labels: severity: warning annotations: description: Graph Node memory consumption is at {{ humanize $value}}. summary: Graph Node high memory usage ok 10.713s ago 129.9us
alert: postgres_down expr: absent((time() - container_last_seen{name="postgres"}) < 10) for: 30s labels: severity: critical annotations: description: Postgres container is down for more than 30 seconds. summary: Postgres down ok 10.713s ago 217.9us
alert: postgres_high_cpu expr: sum(rate(container_cpu_usage_seconds_total{name="postgres"}[1m])) / count(node_cpu_seconds_total{mode="system"}) * 100 > 10 for: 30s labels: severity: warning annotations: description: Postgres CPU usage is {{ humanize $value}}%. summary: Postgres high CPU usage ok 10.713s ago 103.1us
alert: postgres_high_memory expr: sum(container_memory_usage_bytes{name="postgres"}) > 1.2e+09 for: 30s labels: severity: warning annotations: description: Postgres memory consumption is at {{ humanize $value}}. summary: Postgres high memory usage ok 10.713s ago 40.89us
alert: nginx_down expr: absent((time() - container_last_seen{name="nginx-proxy"}) < 10) for: 30s labels: severity: critical annotations: description: Nginx container is down for more than 30 seconds. summary: Nginx down ok 10.713s ago 114us
alert: nginx_high_cpu expr: sum(rate(container_cpu_usage_seconds_total{name="nginx-proxy"}[1m])) / count(node_cpu_seconds_total{mode="system"}) * 100 > 10 for: 30s labels: severity: warning annotations: description: PostNginxgres CPU usage is {{ humanize $value}}%. summary: Nginx high CPU usage ok 10.713s ago 165.9us
alert: nginx_high_memory expr: sum(container_memory_usage_bytes{name="nginx-proxy"}) > 1.2e+09 for: 30s labels: severity: warning annotations: description: Nginx memory consumption is at {{ humanize $value}}. summary: Nginx high memory usage ok 10.713s ago 95.03us
alert: caddy_down expr: absent((time() - container_last_seen{name="caddy"}) < 10) for: 30s labels: severity: critical annotations: description: Caddy container is down for more than 30 seconds. summary: Caddy down ok 10.713s ago 195.5us
alert: caddy_high_cpu expr: sum(rate(container_cpu_usage_seconds_total{name="caddy"}[1m])) / count(node_cpu_seconds_total{mode="system"}) * 100 > 10 for: 30s labels: severity: warning annotations: description: Caddy CPU usage is {{ humanize $value}}%. summary: Caddy high CPU usage ok 10.713s ago 136.9us
alert: caddy_high_memory expr: sum(container_memory_usage_bytes{name="caddy"}) > 1.2e+09 for: 30s labels: severity: warning annotations: description: Caddy memory consumption is at {{ humanize $value}}. summary: Caddy high memory usage ok 10.713s ago 67.79us

host

5.493s ago

654.4us

Rule State Error Last Evaluation Evaluation Time
alert: high_cpu_load expr: node_load1 > 1.5 for: 30s labels: severity: warning annotations: description: Docker host is under high load, the avg load 1m is at {{ $value}}. Reported by instance {{ $labels.instance }} of job {{ $labels.job }}. summary: Server under high load ok 5.493s ago 167.3us
alert: high_memory_load expr: (sum(node_memory_MemTotal_bytes) - sum(node_memory_MemFree_bytes + node_memory_Buffers_bytes + node_memory_Cached_bytes)) / sum(node_memory_MemTotal_bytes) * 100 > 85 for: 30s labels: severity: warning annotations: description: Docker host memory usage is {{ humanize $value}}%. Reported by instance {{ $labels.instance }} of job {{ $labels.job }}. summary: Server memory is almost full ok 5.493s ago 241.4us
alert: high_storage_load expr: (node_filesystem_size_bytes{fstype="aufs"} - node_filesystem_free_bytes{fstype="aufs"}) / node_filesystem_size_bytes{fstype="aufs"} * 100 > 85 for: 30s labels: severity: warning annotations: description: Docker host storage usage is {{ humanize $value}}%. Reported by instance {{ $labels.instance }} of job {{ $labels.job }}. summary: Server storage is almost full ok 5.493s ago 235us

targets

13.955s ago

1.152ms

Rule State Error Last Evaluation Evaluation Time
alert: monitor_service_down expr: up == 0 for: 30s labels: severity: critical annotations: description: Service {{ $labels.instance }} is down. summary: Monitor service non-operational ok 13.955s ago 1.143ms