|
/etc/config/alerting_rules.yml > BlackBox Alerts
|
| Labels |
State |
Active Since |
Value |
|
alertname="Probe failure"
instance="https://compliance-association.spk8uat1eus2-spirion.com/api/v1/association/healthcheck"
job="compliance-service-check"
|
firing |
2026-02-10 19:30:38.926874181 +0000 UTC |
0 |
| Annotations |
- summary
- The service compliance-service-check is unreachable or down. please check the cluster for further information.
|
|
alertname="Probe failure"
instance="https://compliance-subjectrequest.spk8uat1eus2-spirion.com/api/v2/healthcheck"
job="compliance-service-check"
|
firing |
2026-02-10 19:29:38.926874181 +0000 UTC |
0 |
| Annotations |
- summary
- The service compliance-service-check is unreachable or down. please check the cluster for further information.
|
|
alertname="Probe failure"
instance="https://svc-sdmtranslator-i.spk8uat1eus2-spirion.com/api/healthcheck"
job="dpm-service-check"
|
firing |
2026-02-10 19:22:38.926874181 +0000 UTC |
0 |
| Annotations |
- summary
- The service dpm-service-check is unreachable or down. please check the cluster for further information.
|
|
alertname="Probe failure"
instance="https://compliance-datasubjectrequest.spk8uat1eus2-spirion.com/api/datasubjectrequests/healthcheck"
job="compliance-service-check"
|
firing |
2026-02-10 19:29:38.926874181 +0000 UTC |
0 |
| Annotations |
- summary
- The service compliance-service-check is unreachable or down. please check the cluster for further information.
|
|
|
|
|
/etc/config/alerting_rules.yml > MSSQL Alerts
|
| Labels |
State |
Active Since |
Value |
|
alertname="KubernetesPodNotHealthy"
namespace="uat1"
pod="compliance-association-f88564b-kldx5"
severity="critical"
|
firing |
2025-12-16 18:17:46.936640537 +0000 UTC |
1 |
| Annotations |
- description
- Pod has been in a non-ready state for longer than 15 minutes.
VALUE = 1
LABELS = map[namespace:uat1 pod:compliance-association-f88564b-kldx5]
- summary
- Kubernetes Pod not healthy (instance compliance-association-f88564b-kldx5)
|
|
alertname="KubernetesPodNotHealthy"
namespace="uat1"
pod="compliance-runtimeetl-974d4dfb6-zr9hw"
severity="critical"
|
firing |
2025-12-16 18:17:46.936640537 +0000 UTC |
1 |
| Annotations |
- description
- Pod has been in a non-ready state for longer than 15 minutes.
VALUE = 1
LABELS = map[namespace:uat1 pod:compliance-runtimeetl-974d4dfb6-zr9hw]
- summary
- Kubernetes Pod not healthy (instance compliance-runtimeetl-974d4dfb6-zr9hw)
|
|
alertname="KubernetesPodNotHealthy"
namespace="uat1"
pod="mssql-backup-29534470-6x595"
severity="critical"
|
firing |
2026-02-26 14:53:46.936640537 +0000 UTC |
1 |
| Annotations |
- description
- Pod has been in a non-ready state for longer than 15 minutes.
VALUE = 1
LABELS = map[namespace:uat1 pod:mssql-backup-29534470-6x595]
- summary
- Kubernetes Pod not healthy (instance mssql-backup-29534470-6x595)
|
|
alertname="KubernetesPodNotHealthy"
namespace="uat1"
pod="job-licensestatuscheck-0.0.9.228.0-hbl5l"
severity="critical"
|
firing |
2026-03-13 00:12:46.936640537 +0000 UTC |
1 |
| Annotations |
- description
- Pod has been in a non-ready state for longer than 15 minutes.
VALUE = 1
LABELS = map[namespace:uat1 pod:job-licensestatuscheck-0.0.9.228.0-hbl5l]
- summary
- Kubernetes Pod not healthy (instance job-licensestatuscheck-0.0.9.228.0-hbl5l)
|
|
alertname="KubernetesPodNotHealthy"
namespace="velero"
pod="node-agent-fqrgl"
severity="critical"
|
firing |
2025-12-16 18:03:46.936640537 +0000 UTC |
1 |
| Annotations |
- description
- Pod has been in a non-ready state for longer than 15 minutes.
VALUE = 1
LABELS = map[namespace:velero pod:node-agent-fqrgl]
- summary
- Kubernetes Pod not healthy (instance node-agent-fqrgl)
|
|
alert: KubernetesVolumeOutOfDiskSpace
expr: sum without(beta_kubernetes_io_arch, beta_kubernetes_io_instance_type, failure_domain_beta_kubernetes_io_region, kubernetes_azure_com_cluster, kubernetes_azure_com_node_image_version, kubernetes_azure_com_role, kubernetes_io_arch, kubernetes_io_hostname, kubernetes_io_os, kubernetes_io_role, node_kubernetes_io_instance_type, topology_kubernetes_io_region, topology_kubernetes_io_zone, failure_domain_beta_kubernetes_io_zone) (kubelet_volume_stats_available_bytes / kubelet_volume_stats_capacity_bytes * 100 < 10)
for: 2m
labels:
severity: warning
annotations:
description: |-
Volume is almost full (< 10% left)
VALUE = {{ $value }}
LABELS = {{ $labels }}
summary: Kubernetes Volume out of disk space (instance {{ $labels.pod }})
| Labels |
State |
Active Since |
Value |
|
agentpool="mssql"
alertname="KubernetesVolumeOutOfDiskSpace"
beta_kubernetes_io_os="linux"
eks_amazonaws_com_capacityType="ON_DEMAND"
eks_amazonaws_com_nodegroup="eks-node-mssql-20240815132817056800000031"
eks_amazonaws_com_nodegroup_image="ami-0b831bfe98b29906c"
eks_amazonaws_com_sourceLaunchTemplateId="lt-067594b81d5cc5c61"
eks_amazonaws_com_sourceLaunchTemplateVersion="1"
instance="ip-10-0-6-116.ec2.internal"
job="kubernetes-nodes"
k8s_io_cloud_provider_aws="d0cf132161bf4419c1a119ccef83b551"
kubernetes_azure_com_agentpool="mssql"
namespace="uat1"
persistentvolumeclaim="mssql-linux-backups"
sdp_node_role="mssql"
severity="warning"
topology_ebs_csi_aws_com_zone="us-east-1a"
topology_k8s_aws_zone_id="use1-az1"
|
firing |
2026-02-26 00:32:46.936640537 +0000 UTC |
2.1822023047674595 |
| Annotations |
- description
- Volume is almost full (< 10% left)
VALUE = 2.1822023047674595
LABELS = map[agentpool:mssql beta_kubernetes_io_os:linux eks_amazonaws_com_capacityType:ON_DEMAND eks_amazonaws_com_nodegroup:eks-node-mssql-20240815132817056800000031 eks_amazonaws_com_nodegroup_image:ami-0b831bfe98b29906c eks_amazonaws_com_sourceLaunchTemplateId:lt-067594b81d5cc5c61 eks_amazonaws_com_sourceLaunchTemplateVersion:1 instance:ip-10-0-6-116.ec2.internal job:kubernetes-nodes k8s_io_cloud_provider_aws:d0cf132161bf4419c1a119ccef83b551 kubernetes_azure_com_agentpool:mssql namespace:uat1 persistentvolumeclaim:mssql-linux-backups sdp_node_role:mssql topology_ebs_csi_aws_com_zone:us-east-1a topology_k8s_aws_zone_id:use1-az1]
- summary
- Kubernetes Volume out of disk space (instance )
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|