Commit graph

89 commits

Author SHA1 Message Date
e00cc2d4dd
fix typos and file layout for yamllint 2024-10-07 09:19:54 +02:00
665dce8506
fix prometheus values 2024-10-03 20:13:09 +02:00
6a60c2e48e
prometheus changes 2024-10-03 14:38:40 +02:00
d2a37063b3
try to fix prometheus helm values 2024-10-03 12:27:41 +02:00
ee0909b968
add thanos as datasource to grafana 2024-10-02 22:01:03 +02:00
a88ac3b91a
add remotewrite to prometheus 2024-10-02 21:17:25 +02:00
a3acde7f63
change longhorn storageclass 2024-09-30 22:39:58 +02:00
39fd1f7d0f
prometheus: add k8s resources to kustomize 2024-09-30 22:04:39 +02:00
262ae950ff
add secrets (what could go wrong lol) 2024-09-30 21:52:57 +02:00
5f9fbb1a83
set Grafana Role via SSO/ProxyAuth 2024-09-29 16:03:50 +02:00
7bd1d487f7
grafana: set whitelist for auth 2024-09-29 15:41:25 +02:00
40ed383da8
reconfigure grafana for sso 2024-09-29 15:23:05 +02:00
04b745d0d6
add SSO to prometheus, longhorn, alertmanager 2024-09-29 14:10:34 +02:00
4bf5c5b9dd
prometheus/values.yaml: change settings for 63.x compatibility 2024-09-28 23:54:11 +02:00
e1ed098915
Adjust ingress tls values for cert-manager 2024-05-28 17:44:20 +02:00
ceed3ed4bd
delete manual rule overrides and handle it with helm values instead 2024-03-17 12:36:18 +01:00
ae67aa88e9
prometheus/values.yaml: Fix pod antiaffinity 2024-03-16 13:06:47 +01:00
7e20805c34
change grafana volume to RWX 2024-02-23 13:00:56 +01:00
285c0b53d3
fix update strategy 2024-02-23 12:51:54 +01:00
986cdef4d6
set Updatestrategy for grafana (this should fix ) 2024-02-23 12:46:11 +01:00
a181eb3fec
Revert "try to fix prometheus"
This reverts commit d4727c0923.
2024-02-20 20:31:36 +01:00
d4727c0923
try to fix prometheus 2024-02-20 20:26:06 +01:00
8f15467a36
try to fix 3 2024-02-18 06:59:06 +01:00
bce6e8f315
switch to traefik 2 2024-02-18 06:17:03 +01:00
5654baa437
add pod affinity for alertmanager 2024-02-09 04:41:59 +01:00
878a2de21d
change prometheus metric storage values 2024-02-06 22:27:49 +01:00
3c02a16714
Prometheus: remove waiting time for KubeNodeUnreachable Alert 2024-02-01 22:44:57 +01:00
921306dcdc
PROMETHEUS: move alerts to this repo to allow modifications 2024-02-01 22:00:37 +01:00
a2a306c195
add inhibition rule to alertmanager 2024-01-31 16:19:42 +01:00
30b7c96833
add ECC alert (closes ) 2024-01-29 19:28:16 +01:00
0be2949c50
rework storage to reduce backup load 2024-01-26 13:39:13 +01:00
dad89f524c
prometheus/values.yaml: Prevent all replicas on the same node 2023-12-25 10:19:03 +01:00
5a9bb1850e
change alert inhibition rules 2023-12-18 17:33:47 +01:00
4c6bf59f9e
prometheus/values.yaml: avoid all pods on the same node 2023-11-26 20:41:43 +01:00
11f471a711
prometheus/alerts.yaml: increase temperature limit to 90 2023-11-25 18:21:45 +01:00
cf76be1d39
add longhorn monitoring 2023-11-24 20:32:50 +01:00
a441ff630b
Prometheus: change DiskspaceLow Alert 2023-11-23 20:35:46 +01:00
2207baf8e2
fix type error 2023-10-23 18:32:49 +02:00
8c5f6beca7
add label to prometheus namespace 2023-10-23 18:31:47 +02:00
e5cd0a214f
Tell Prometheus to only pick up rules from namespaces with label "prometheus: yolokube" 2023-10-23 18:05:29 +02:00
53be807c0b prometheus/ingress.yaml aktualisiert 2023-09-20 22:15:41 +02:00
d22605c1d9
fix alertmanager 2023-09-15 01:43:41 +02:00
94c2a34aac
try to fix prometheus 2 2023-08-31 00:29:12 +02:00
778306127f
try to fix prometheus
try to fix prometheus 2

try to fix prometheus 3
2023-08-30 22:56:03 +02:00
ffaf6a079e
put alertmanager config back into helm values 2023-08-30 21:27:13 +02:00
69dde5d035
enable persistence for grafana 2023-06-29 12:02:54 +02:00
deba86906d revert memory rule changes (back to 80%)
Signed-off-by: Tom Neuber <tomneuber@web.de>
2023-06-24 18:50:24 +02:00
812cd1efa6
Alerting: edit rules for storage low 2023-06-24 09:56:07 +02:00
78793ed440
Monitoring: change prometheus values to prevent sync-loop in argo 2023-06-24 07:28:58 +02:00
c4033903b4
Monitoring: add node tag to node-exporter metrics 2023-06-23 19:19:48 +02:00