You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We had a cluster where someone cordoned a few nodes and we have found out only when a node was restarted. I think that this should be part of the alert because a node failure may cause the cluster to be unable to schedule all pods.
The text was updated successfully, but these errors were encountered:
Seems like a genuine concern, not sure if KubeMemoryOvercommit is the right alert for this but some alert which says "cluster is unable to schedule all pods" sounds useful.
We had a cluster where someone cordoned a few nodes and we have found out only when a node was restarted. I think that this should be part of the alert because a node failure may cause the cluster to be unable to schedule all pods.
The text was updated successfully, but these errors were encountered: