Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

KubeMemoryOvercommit (and family) does not cordoning nodes into consideration #770

Open
mladedav opened this issue Jun 9, 2022 · 2 comments
Labels
keepalive Use to prevent automatic closing

Comments

@mladedav
Copy link

mladedav commented Jun 9, 2022

We had a cluster where someone cordoned a few nodes and we have found out only when a node was restarted. I think that this should be part of the alert because a node failure may cause the cluster to be unable to schedule all pods.

Copy link

This issue has not had any activity in the past 30 days, so the
stale label has been added to it.

  • The stale label will be removed if there is new activity
  • The issue will be closed in 7 days if there is no new activity
  • Add the keepalive label to exempt this issue from the stale check action

Thank you for your contributions!

@github-actions github-actions bot added the stale label Sep 26, 2024
@skl skl added the keepalive Use to prevent automatic closing label Sep 26, 2024
@skl
Copy link
Collaborator

skl commented Sep 26, 2024

Seems like a genuine concern, not sure if KubeMemoryOvercommit is the right alert for this but some alert which says "cluster is unable to schedule all pods" sounds useful.

@skl skl removed the stale label Oct 14, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
keepalive Use to prevent automatic closing
Projects
None yet
Development

No branches or pull requests

2 participants