Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unable to send metrics using googlecloudexporter to custom endpoint with incorrect errors #36502

Open
Mona3087 opened this issue Nov 22, 2024 · 1 comment
Labels
bug Something isn't working exporter/googlecloud needs triage New item requiring triage

Comments

@Mona3087
Copy link

Component(s)

exporter/googlecloud

What happened?

Description

We are using otel collector to collect custom metrics from our Kubernetes cluster and export it to Google Cloud Monitoring for alerts. When have lots of pods in our cluster, our clusters could be generating more than 200k time series per minute. Otel at that point starts logging error and is not able to push metrics in cloud. The error is talking about hitting 200 quota limit and it is misleading. Ref: https://cloud.google.com/monitoring/quotas

Steps to Reproduce

Create sample pod/app that pushes more than 200k time series on single monitored resource type.

Expected Result

Error should specify that which quota limit otel exporter is hitting.

Actual Result

2024-11-19T17:18:39.278Z error internal/queue_sender.go:92 Exporting failed. Dropping data. {"kind": "exporter", "data_type": "metrics", "name": "googlecloud", "error": "rpc error: code = ResourceExhausted desc = One or more TimeSeries could not be written: timeSeries[0-53]: write for resource=k8s_cluster{location:us-east1,cluster_name:ctl-gke-4501} failed with: Monitored resource has too many time series (custom metrics).\nerror details: name = Unknown desc = total_point_count:54 errors:{status:{code:8} point_count:54}", "dropped_items": 54}

Collector version

v0.114.0

Environment information

Environment

OS: (e.g., "Ubuntu 20.04")
Compiler(if manually compiled): (e.g., "go 14.2")
GKE 1.31.1-gke.2105000

OpenTelemetry Collector configuration

processors: 
  batch:       
    send_batch_size: 200       
    send_batch_max_size: 200
    timeout: 200ms
  memory_limiter:      
    check_interval: 1s       
    limit_mib: 1600       
    spike_limit_mib: 320
exporters:    
  googlecloud:      
    project: ""      
    metric:       
      prefix: "custom.googleapis.com"
service:  
  pipelines:    
    metrics:     
      recievers: []    
      processors: [batch, memory_limiter]     
      exporters: [googlecloud]

Log output

2024-11-19T17:18:39.278Z	error	internal/queue_sender.go:92	Exporting failed. Dropping data.	{"kind": "exporter", "data_type": "metrics", "name": "googlecloud", "error": "rpc error: code = ResourceExhausted desc = One or more TimeSeries could not be written: timeSeries[0-53]: write for resource=k8s_cluster{location:us-east1,cluster_name:ctl-gke-4501} failed with: Monitored resource has too many time series (custom metrics).\nerror details: name = Unknown  desc = total_point_count:54  errors:{status:{code:8}  point_count:54}", "dropped_items": 54}

Additional context

No response

@Mona3087 Mona3087 added bug Something isn't working needs triage New item requiring triage labels Nov 22, 2024
Copy link
Contributor

Pinging code owners:

See Adding Labels via Comments if you do not have permissions to add labels yourself.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working exporter/googlecloud needs triage New item requiring triage
Projects
None yet
Development

No branches or pull requests

1 participant