-
Notifications
You must be signed in to change notification settings - Fork 894
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Sampling based on URL #1597
Comments
I think this was fixed recently in Java: open-telemetry/opentelemetry-java-instrumentation#388. The spec currently says in https://github.com/open-telemetry/opentelemetry-specification/blob/v1.1.0/specification/trace/api.md#span-creation:
which would include HTTP headers too (makes sense, what if you have an "X-My-Company-Loadtest" header?). Of course this means that sampling will not prevent initial attribute collection, for this problem see #620. |
Thanks @Oberon00, could we extend the spec and explicitly mention that the URL should be present at the sampling time (if it is known)? |
The spec does leave room for the sampler to access attributes that are provided during span creation (not after the span creation), so it is not an API issue. |
The quoted spec already says that:
Of course this also applies to the URL. If we need to discriminate attributes that are more or less important to sampling (original semantic conventions generator PR had a |
There hasn't been much recent activity here. Is there a reasonable workaround that people agree on: specifically I'm asking for the go sdk. Should I ask there instead? |
Well I would ask the same for js/node sdk, does anyone figured out anything worth sharing? |
We have determined that this issue has been completed due to updates in the semantic conventions that require URL to be present at span creation for consumption by samplers. |
What are you trying to achieve?
The Hypertrace agents collect all request and response data (headers and payloads). This data is not needed for every request, hence I would like to implement a sampler that would make a decision based on the URL/endpoint pattern when this data should be collected (e.g. one in 100 requests). The decision should be propagated to downstream services (e.g. via tracestate) to make the decision coherent across the transaction.
The custom sampler would make a decision whether to collect payload and headers, the span sampling would still go through the normal sampling process (e.g. my custom sampler would delegate to it).
To fully implement this use-case these parts are needed:
tracestate
(supported by the API) or drop the header and payload attributes inSpanProcessor#onEnd
(not supported by the API).What is missing in the spec to solve my use-case
The sampler does not have access to the URL. The sampler interface (at least in Java) accepts attributes, however the instrumentation (e.g. servlet in otel-java-instrumentation) is not passing URL attribute there nor spec mandates it to do it.
The text was updated successfully, but these errors were encountered: