[bug] Some dashboards are broken on high pods count #129

maxpain · 2024-11-09T07:44:50Z

Describe the bug

Hello. We have a lot of short-lived pods in our clusters. It's also a problem for frequent CronJobs.

Screen.Recording.2024-11-09.at.10.35.49.mov

How to reproduce?

No response

Expected behavior

No response

Additional context

No response

EladAviczer · 2024-11-21T11:32:17Z

Have you tried increasing the CPU resources allocated to Prometheus?

maxpain · 2024-11-21T12:40:43Z

@EladAviczer did you watch the video?

EladAviczer · 2024-11-21T12:52:03Z

*victoriaMetrics

maxpain · 2024-11-21T12:53:18Z

*victoriaMetrics

The problem is not in Prometheus/VictoriaMetrics, but in the grafana dashboard itself.

EladAviczer · 2024-11-21T13:15:07Z

The dashboard uses VictoriaMetrics to query the data, you get 422 Unprocessable Content error when calling the promql/metricsQL query.

I don't say that i'm 100% sure that it is a VictoriaMetrics problem but it could be and you should check it too. : )

maxpain · 2024-11-21T13:17:48Z

The dashboard uses VictoriaMetrics to query the data, you get 422 Unprocessable Content error when calling the promql/metricsQL query.

The problem is that there are a lot of pods (because CronJob running every minute), and this dashboard tries to pass the array of pod names (1440 pods for last 24 hours), which will fail on any installation (Prometheus or VictoriaMetrics)

dotdc · 2024-11-22T06:22:03Z

Hi @maxpain,

The created_by variable was introduced to enable filtering on deployments, but if there are too many pods, you'll up end with a 422 Unprocessable Entity error as you just experienced.

I’ll check if there’s a better solution, but removing the created_by variable might work better in your case.

maxpain added the bug Something isn't working label Nov 9, 2024

maxpain assigned dotdc Nov 9, 2024

maxpain mentioned this issue Nov 9, 2024

[victoria-metrics-k8s-stack] bug: Some dashboards are broken on high pods count VictoriaMetrics/helm-charts#1729

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[bug] Some dashboards are broken on high pods count #129

[bug] Some dashboards are broken on high pods count #129

maxpain commented Nov 9, 2024

EladAviczer commented Nov 21, 2024

maxpain commented Nov 21, 2024

EladAviczer commented Nov 21, 2024

maxpain commented Nov 21, 2024

EladAviczer commented Nov 21, 2024 •

edited

Loading

maxpain commented Nov 21, 2024

dotdc commented Nov 22, 2024

[bug] Some dashboards are broken on high pods count #129

[bug] Some dashboards are broken on high pods count #129

Comments

maxpain commented Nov 9, 2024

Describe the bug

How to reproduce?

Expected behavior

Additional context

EladAviczer commented Nov 21, 2024

maxpain commented Nov 21, 2024

EladAviczer commented Nov 21, 2024

maxpain commented Nov 21, 2024

EladAviczer commented Nov 21, 2024 • edited Loading

maxpain commented Nov 21, 2024

dotdc commented Nov 22, 2024

EladAviczer commented Nov 21, 2024 •

edited

Loading