-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Determine reliability of (Promotheus) metrics #447
Comments
How would we recreate this event? Turn off ingress in staging and turn it back on? |
I tried to recreate in Staging by enabling and disabling the ingress. I did see a spike but nothing quite like this. Let's keep an eye on it and see if it happens again. I'm wondering if it's possibly related to the performance issues that were fixed by adding an ingress. |
After reviewing the dashboard it seems that 5xx have reduced, however there have been spikes on the heartbeat. We'll keep this open for now |
Another possible complication here:
Is this affecting us now? |
/cc @ahoneiser |
On the 9th Nov at 16h30, it was reported in PROD that the load balancer heartbeat was called 10M times per minute (184K RPS). Normal heartbeat 48K RPS. And the
/stripe_from_pubsub
14K RPS, which lead to 1378 error responses 5XX per min.Are these numbers real?
The text was updated successfully, but these errors were encountered: