[exporter/loadbalancing] Add the ability to load balance by Metric stream ID #32513

RichieSams · 2024-04-18T12:31:12Z

Component(s)

exporter/loadbalancing

Is your feature request related to a problem? Please describe.

The LoadBalancing exporter is a great idea. It currently exposes a number of ways to route metrics coming in:

const (
	traceIDRouting routingKey = iota
	svcRouting
	metricNameRouting
	resourceRouting
)

However, there are many situation where these routing methods can lead to non-uniform distributions of metrics. For example, if datapoints use lots of labels, or med to high cardinality labels. The finest grain routing we can currently do is metric name routing. But if the majority of the datapoints are all the same metric name, then this doesn't help.

Describe the solution you'd like

I would like to propose adding a new value to the routing enum: streamIDRouting. This would route individual datapoints based on their unique ID. We could use the new internal/exp/metrics/identity package: https://github.com/open-telemetry/opentelemetry-collector-contrib/blob/main/internal/exp/metrics/identity/stream.go#L29

I'm willing to be the implementer

Describe alternatives you've considered

No response

Additional context

No response

The text was updated successfully, but these errors were encountered:

github-actions · 2024-04-18T12:31:26Z

Pinging code owners:

exporter/loadbalancing: @jpkrohling

See Adding Labels via Comments if you do not have permissions to add labels yourself.

JaredTan95 · 2024-04-19T12:51:46Z

@RichieSams Hi, Thanks for the proposal. It's yours

skandragon · 2024-05-03T16:57:22Z

I am also needing something similar to this, but likely need to dig a bit deeper and make this per-metric-attribute.

My use case is, I have a custom processor that decorates telemetry with an attribute, placed either on the log, metric, or span. This key is what I want to partition on, and send to each back-end to be gathered and ultimately written to the same s3 output file.

What is the recommended way to add this, as the PR associated with this change seems to be rejected? I know the PR is not sufficient as it only looks at resource attributes, and I would need to look deeper, and I assume split deeper as well.

jpkrohling · 2024-05-07T11:35:33Z

I believe @RichieSams will work on this, the PR was closed in favor of breaking it into smaller chunks of work.

RichieSams · 2024-05-07T11:39:24Z

Correct. I already split up the commits, I just got busy at work last week. I'll create new PRs today

RichieSams · 2024-05-07T14:30:40Z

This is the first PR. Just a trivial helper function: #32794

Then there are these 4 commits that will be applied on top. @jpkrohling How do you want me to do them? I can serially make each a PR, get it merged, then go on to the next. Or do you want to combine any?

chore: Add str const variables for the routingKeys RichieSams@bb473e1
[exporter/loadbalacing] Refactor how metrics are split and then re-joined after load-balacing RichieSams@89a14af
[exporter/loadbalancer] Refactor the metrics export benchmarks RichieSams@891bdfa
[exporter/loadbalancer] Add a new routing key: streamID RichieSams@5314b93

**Description:** This will merge the metrics in mdB into mdA, trying to re-use resourceMetrics, scopeMetrics, and metric values as possible. This will be used to help implement the new feature for: #32513 **Link to tracking Issue:** #32513 / #32690 **Testing:** I created a unit test which tests various scenarios of how merge behavior should happen **Documentation:** The exported function is documented using standard golang style. And there are comments inside the code to explain what is going on and why --------- Co-authored-by: Ziqi Zhao <zhaoziqi9146@gmail.com>

github-actions · 2024-07-08T03:32:21Z

This issue has been inactive for 60 days. It will be closed in 60 days if there is no activity. To ping code owners by adding a component label, see Adding Labels via Comments, or if you are unsure of which component this issue relates to, please ping @open-telemetry/collector-contrib-triagers. If this issue is still relevant, please ping the code owners or leave a comment explaining why it is still relevant. Otherwise, please close it.

Pinging code owners:

exporter/loadbalancing: @jpkrohling

See Adding Labels via Comments if you do not have permissions to add labels yourself.

RichieSams · 2024-07-09T13:32:13Z

This issue is still valid. Currently waiting for this PR to be merged, and then I can create the final PR that will close this issue.

#33676

jpkrohling · 2024-07-11T08:14:38Z

The dependent PR has been merged!

**Description:** This adds a new routing option for metrics: streamID. This routes datapoints based on their streamID. That's the unique hash of all it's attributes, plus the attributes and identifying information of its resource, scope, and metric data **Link to tracking Issue:** #32513 **Testing:** I added to the existing testing suites, testing the new routing, as well as adding to the benchmark suite **Documentation:** I updated the README to describe the new routingKey: `metricID`, and how it works

github-actions · 2024-09-10T03:32:21Z

This issue has been inactive for 60 days. It will be closed in 60 days if there is no activity. To ping code owners by adding a component label, see Adding Labels via Comments, or if you are unsure of which component this issue relates to, please ping @open-telemetry/collector-contrib-triagers. If this issue is still relevant, please ping the code owners or leave a comment explaining why it is still relevant. Otherwise, please close it.

Pinging code owners:

exporter/loadbalancing: @jpkrohling

See Adding Labels via Comments if you do not have permissions to add labels yourself.

jpkrohling · 2024-09-10T08:59:47Z

@RichieSams , can this be closed?

RichieSams · 2024-09-10T17:23:42Z

Yup! Closed as of 12d071d. Thanks for the reminder

RichieSams added enhancement New feature or request needs triage New item requiring triage labels Apr 18, 2024

github-actions bot added the exporter/loadbalancing label Apr 18, 2024

RichieSams changed the title ~~Add the ability to Load balance by Metric stream ID~~ [exporter/loadbalancing] Add the ability to Load balance by Metric stream ID Apr 18, 2024

RichieSams changed the title ~~[exporter/loadbalancing] Add the ability to Load balance by Metric stream ID~~ [exporter/loadbalancing] Add the ability to load balance by Metric stream ID Apr 18, 2024

JaredTan95 removed the needs triage New item requiring triage label Apr 19, 2024

JaredTan95 assigned RichieSams Apr 19, 2024

This was referenced Apr 25, 2024

[exporter/loadbalacing] Add support for new streamID routing #32690

Closed

[internal/exp/metrics] Add functions to merge metrics #32794

Merged

github-actions bot added the Stale label Jul 8, 2024

github-actions bot removed the Stale label Jul 10, 2024

RichieSams mentioned this issue Jul 16, 2024

[exporter/loadbalancer] Add a new routing key: streamID #34086

Merged

github-actions bot added the Stale label Sep 10, 2024

RichieSams closed this as completed Sep 10, 2024

github-actions bot mentioned this issue Dec 2, 2024

Link Checker Report signalfx/splunk-otel-collector#5658

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[exporter/loadbalancing] Add the ability to load balance by Metric stream ID #32513

[exporter/loadbalancing] Add the ability to load balance by Metric stream ID #32513

RichieSams commented Apr 18, 2024

github-actions bot commented Apr 18, 2024

JaredTan95 commented Apr 19, 2024

skandragon commented May 3, 2024

jpkrohling commented May 7, 2024

RichieSams commented May 7, 2024

RichieSams commented May 7, 2024

github-actions bot commented Jul 8, 2024

RichieSams commented Jul 9, 2024

jpkrohling commented Jul 11, 2024

github-actions bot commented Sep 10, 2024

jpkrohling commented Sep 10, 2024

RichieSams commented Sep 10, 2024 •

edited

Loading

[exporter/loadbalancing] Add the ability to load balance by Metric stream ID #32513

[exporter/loadbalancing] Add the ability to load balance by Metric stream ID #32513

Comments

RichieSams commented Apr 18, 2024

Component(s)

Is your feature request related to a problem? Please describe.

Describe the solution you'd like

Describe alternatives you've considered

Additional context

github-actions bot commented Apr 18, 2024

JaredTan95 commented Apr 19, 2024

skandragon commented May 3, 2024

jpkrohling commented May 7, 2024

RichieSams commented May 7, 2024

RichieSams commented May 7, 2024

github-actions bot commented Jul 8, 2024

RichieSams commented Jul 9, 2024

jpkrohling commented Jul 11, 2024

github-actions bot commented Sep 10, 2024

jpkrohling commented Sep 10, 2024

RichieSams commented Sep 10, 2024 • edited Loading

RichieSams commented Sep 10, 2024 •

edited

Loading