Predefined PrometheusRules

There are few pre-defined PrometheusRules that come with the platform. You can use existing rules to forward alerts to your preferred medium of choice.

Following are the rules along their descriptions.

Kubernetes Apps

NameDescription
KubePodCrashLoopingPod Namespace/Pod is restarting N times / 5 minutes.
KubePodNotReadyPod Namespace/Pod has been in a non-ready state for longer than 15 minutes
KubeDeploymentGenerationMismatchDeployment generation for Namespace/Deployment does not match, this indicates that the Deployment has failed but has not been rolled back.
KubeDeploymentReplicasMismatchDeployment Namespace/Deployment has not matched the expected number of replicas for longer than 15 minutes.
KubeStatefulSetReplicasMismatchStatefulSet Namespace/StatefulSet has not matched the expected number of replicas for longer than 15 minutes
KubeStatefulSetGenerationMismatchStatefulSet generation for Namespace/StatefulSet does not match, this indicates that the StatefulSet has failed but has not been rolled back.
KubeStatefulSetUpdateNotRolledOutStatefulSet Namespace/StatefulSet update has not been rolled out.
KubeDaemonSetRolloutStuckDaemonSet Namespace/DaemonSet has not finished or progressed for at least 15 minutes.
KubeContainerWaitingPod Namespace/Pod container Container has been in waiting state for longer than 1 hour.
KubeDaemonSetNotScheduledPods of DaemonSet Namespace/DaemonSet are not scheduled.
KubeJobCompletionJob Namespace/Job is taking more than 12 hours to complete.
KubeJobFailedJob Namespace/Job failed to complete. Removing failed job after investigation should clear this alert.
KubeHpaReplicasMismatchHPA (Horizontal Pod Autoscaler) Namespace/HPA has not matched the desired number of replicas for longer than 15 minutes.
KubeHpaMaxedOutHPA (Horizontal Pod Autoscaler) Namespace/HPA has been running at max replicas for longer than 15 minutes.

Kubernetes Storage

NameDescription
KubePersistentVolumeFillingUpThe PersistentVolume claimed by PersistentVolume in Namespace Namespace is only Percentage % free.
KubePersistentVolumeErrorsThe persistent volume PersistentVolume has status Failed/Pending.