Commit Graph

29 Commits

Author SHA1 Message Date
Gregor Michels eadcf6f296 monitoring: extend ifInErrors alert to non-snmp devices
also automatically clear alarm after 2 hours because linux devices have
no way to clear the nic error counters
2023-04-18 21:00:04 +02:00
Gregor Michels 2299e3aff1 monitoring: make summary and description for snmp alarms more verbose 2023-03-23 00:07:23 +01:00
Gregor Michels d1c1f34bf8 monitoring: alert on snmp if{In,Out}Errors 2023-03-22 23:53:39 +01:00
Gregor Michels 01c9fa2317 accesspoints: expose airtime information 2023-03-07 23:59:58 +01:00
Gregor Michels a837a2b916 playbook_provision_backbone: configure backbone for sax-rgs-gw-core01 2023-01-17 23:54:07 +01:00
Gregor Michels 032937c7ea add mowoe as a maintainer
welcome to the team :)
2022-12-30 20:00:47 +01:00
Gregor Michels 51a8de4299 ffl-ans-gw-core01: move offloader network hook into /usr/lib 2022-12-23 13:30:03 +01:00
Gregor Michels 1ea236b206 ffl-ans-gw-core01: finally put offloader vm setup into ansible 2022-12-23 13:22:38 +01:00
Gregor Michels 0475923590 alerting: only alarm on devices that are unreachable for 1m at least 2022-12-22 16:37:15 +01:00
Gregor Michels 69834a8d2b alerting: also alert on reboots of snmp devices 2022-12-22 16:37:15 +01:00
Gregor Michels e3b111f2c7 monitoring: monitor switches in the ANS via snmp 2022-11-21 02:58:13 +01:00
Gregor Michels 5fa5b13da7 monitoring: install snmp_exporter 2022-11-21 02:56:59 +01:00
Gregor Michels 9cfee1f384 monitoring: add alerting rules for disks running out of space 2022-11-19 01:58:14 +01:00
Gregor Michels 8389a18488 monitoring: move prometheus stack onto eae-adp-jump01
to be able to also monitor the new site.

custom grafana dashboard broke while transfering stack.
will fix next
2022-11-17 00:35:57 +01:00
Gregor Michels 8370f150a6 add lodrich as a maintainer 2022-11-12 21:48:27 +01:00
Gregor Michels 8d4fc76a81 playbook_provision_backbone: configure backbone for ffl-ans-gw-core01 2022-11-10 02:06:52 +01:00
Gregor Michels ec917a24c6 monitoring: add alarm "PublicWifiUpstreamLost" 2022-10-19 02:05:32 +02:00
Gregor Michels f0115625f6 monitoring: add end to end tests to monitor internet reachability
via imcp (blackbox exporter)

There are two exporters.
One lives inside `monitoring01` and uses the "normal" route into the
internet without a vpn (job: `e2e_default_v4`).

The other one lives inside `mon-e2e-clients01` and routes into the
internet via the vpn (job: `e2e_clients_v4`).
2022-09-14 03:12:22 +02:00
Gregor Michels 6623cc0e09 monitoring: alert on node reboots 2022-09-14 02:16:15 +02:00
Gregor Michels ba014a64d0 wifi.lua: add wifi_clients metric 2022-07-25 02:00:56 +02:00
Gregor Michels 7b223d7053 add vanilla wifi.lua
from `prometheus-node-exporter-lua-wifi` package
2022-07-25 01:59:52 +02:00
Gregor Michels 5a21b2cd88 monitoring: prometheus: add simple alerting rule 2022-07-13 01:27:07 +02:00
Gregor Michels e3210198ff eae-adp-jump01: install prometheus node_exporter 2022-07-03 02:14:44 +02:00
Gregor Michels 8b5ff0aeed pass: add clarifying notes about gpg keys 2022-07-01 02:24:06 +02:00
Gregor Michels 29b790931c add ssh keys of @katzenparadoxon 2022-07-01 02:15:11 +02:00
Gregor Michels 2de716a405 poc for tunnel provisioning 2022-06-28 21:59:22 +02:00
Gregor Michels dbe8978987 add vm eap-adp-jump01
with a basic playbook for configuration
2022-06-28 00:11:01 +02:00
Gregor Michels 153c835b1a add install.conf for eae-adp-jump01 2022-06-26 22:46:45 +02:00
Gregor Michels 71f4ee9c5f initial commit 2022-06-22 02:05:55 +02:00