Establishing HTTP connection spec #1226

mateuszrzeszutek · 2023-01-30T12:01:40Z

What are you trying to achieve?

I started working on implementing the http.resend_count spec in the Java instrumentations, and I ran into an interesting issue with the current spec.

Currently (that is, without the resends), in pretty much all Java HTTP client instrumentations, we are creating HTTP client spans for the outermost operation, usually the top-level interface exposed by a particular HTTP client library. This means that there is only one HTTP span created for the entire operation; if any resends occur during the call they'll go unnoticed, and the child SERVER spans will all receive the same parent span id (which is why we need to modify/rewrite all HTTP client instrumentations we have, they're all too "shallow"). The HTTP span also includes the "establishing the connection" phase (in most cases; I'll mention the exceptions to that later); in case a connection error occurs (closed port, server not responding at all, etc) the HTTP span will end with an error, and the telemetry about the thrown exception will be recorded.

This changes with the resend counter spec. As I understand it, we must now create a span for every attempt to actually send an HTTP request over the wire (and bump the counter if there's a retry). Most HTTP clients that I've read through separate the "establishing the connection" phase from the "sending the request" phase pretty cleanly; which means that whenever a DNS resolution/connection/TLS handshake error occurs, the actual HTTP instrumentation won't catch that - because it runs later than that.

I find this to be somewhat problematic, since usually you'd want to know if this kind of situations occur with your application.

There are several low-level HTTP clients that completely separate these two phases - where any details about the HTTP request are simply not accessible at the connection phase (mostly Netty and Netty-based HTTP clients). For these, we already had to introduce something that'll capture the connection level details, which we've called a CONNECT span. Note that this already exists in some of our instrumentations, and predates the resend spec.

Now, to solve the problem with the missing connection-level telemetry we could do one of the following things:

In the top-level HTTP client interface: if there was an exception thrown, and no attempts to send an HTTP request have been made, create a "fake" HTTP span just for the connection error. (Note that this is impossible to implement in the Netty-based clients that I've briefly mentioned above)
Introduce the CONNECT span to the HTTP client spec: create a CONNECT span whenever a connection is established (actual connection; taking a connection from a pool should not emit a span), regardless of whether it was successful or not. (We've had customers ask us to implement spans that capture low-level connection details; this kind of information would be generally useful, at least for some people).

Additional context.

The text was updated successfully, but these errors were encountered:

mateuszrzeszutek · 2023-01-30T12:03:52Z

CC @trask this is a part of the HTTP semconv effort

lmolkova · 2023-02-01T18:27:45Z

@mateuszrzeszutek great find!

I'd like to entertain the idea of having top-level logical HTTP span to capture connect time, DNS time, circuit-breaking, the duration for tries with backoff, and overall operation success.

We might need to give instrumentations some freedom to instrument only client call, HTTP attempts, or both:

I heard from @tedsuo that AWS includes full HTTP request with tracing headers when creating request signature and it can't change between tries. (on Azure, we only include x-ms-* headers or a closed set of headers into the signature)
Client library instrumentations instrument public API surface anyway and don't necessarily need top-level span to account for other things

mateuszrzeszutek · 2023-02-02T16:08:27Z

We might need to give instrumentations some freedom to instrument only client call, HTTP attempts, or both:

I heard from @tedsuo that AWS includes full HTTP request with tracing headers when creating request signature and it can't change between tries. (on Azure, we only include x-ms-* headers or a closed set of headers into the signature)

Client library instrumentations instrument public API surface anyway and don't necessarily need top-level span to account for other things

💯

Had the same thoughts exactly -- I'm pretty sure there are HTTP clients that may do retries but it's impossible to instrument each send attempt because of how code is structured (I did not have a good example of that before; the AWS SDK is a great one), and the spec should also somehow fit these cases too.

Related to https://github.com/open-telemetry/opentelemetry-specification/issues/3155 and #3234 ## Changes This PR contains the less controversial parts of #3234; it describes how the `http.resend_count` attribute should be used, and proposes two ways of instrumenting HTTP clients.

trask · 2023-04-14T16:05:27Z

@mateuszrzeszutek do you think we still need this for Java http instrumentation stability? (or now with open-telemetry/opentelemetry-specification#3290, is the Java http instrumentation default state addressed, and we just need this for opt-in behaviors)

Related to https://github.com/open-telemetry/opentelemetry-specification/issues/3155 and open-telemetry/opentelemetry-specification#3234 ## Changes This PR contains the less controversial parts of open-telemetry/opentelemetry-specification#3234; it describes how the `http.resend_count` attribute should be used, and proposes two ways of instrumenting HTTP clients.

mateuszrzeszutek · 2023-04-28T13:51:08Z

@mateuszrzeszutek do you think we still need this for Java http instrumentation stability? (or now with open-telemetry/opentelemetry-specification#3290, is the Java http instrumentation default state addressed, and we just need this for opt-in behaviors)

OOF, I totally missed this issue 🙈
We've talked about this offline, I don't think it's required; at this point this is an addition that might be introduced later.

Related to https://github.com/open-telemetry/opentelemetry-specification/issues/3155 and open-telemetry/opentelemetry-specification#3234 ## Changes This PR contains the less controversial parts of open-telemetry/opentelemetry-specification#3234; it describes how the `http.resend_count` attribute should be used, and proposes two ways of instrumenting HTTP clients.

github-actions bot assigned SergeyKanzhelev Jan 30, 2023

trask assigned trask and unassigned SergeyKanzhelev Jan 30, 2023

mateuszrzeszutek mentioned this issue Feb 20, 2023

HTTP client span clarification and establishing HTTP connection spec open-telemetry/opentelemetry-specification#3234

Closed

mateuszrzeszutek mentioned this issue Mar 6, 2023

HTTP client span clarification open-telemetry/opentelemetry-specification#3290

Merged

mateuszrzeszutek mentioned this issue Jun 1, 2023

Clarify HTTP client duration #70

Merged

lmolkova transferred this issue from open-telemetry/opentelemetry-specification Jul 9, 2024

github-actions bot assigned jsuereth Jul 9, 2024

lmolkova added enhancement New feature or request area:new area:http labels Jul 9, 2024

lmolkova mentioned this issue Jul 9, 2024

Add .NET network + HTTP connection spans #1192

Open

1 task

lmolkova unassigned jsuereth and trask Jul 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Establishing HTTP connection spec #1226

Establishing HTTP connection spec #1226

mateuszrzeszutek commented Jan 30, 2023

mateuszrzeszutek commented Jan 30, 2023 •

edited

Loading

lmolkova commented Feb 1, 2023 •

edited

Loading

mateuszrzeszutek commented Feb 2, 2023

trask commented Apr 14, 2023

mateuszrzeszutek commented Apr 28, 2023

Establishing HTTP connection spec #1226

Establishing HTTP connection spec #1226

Comments

mateuszrzeszutek commented Jan 30, 2023

mateuszrzeszutek commented Jan 30, 2023 • edited Loading

lmolkova commented Feb 1, 2023 • edited Loading

mateuszrzeszutek commented Feb 2, 2023

trask commented Apr 14, 2023

mateuszrzeszutek commented Apr 28, 2023

mateuszrzeszutek commented Jan 30, 2023 •

edited

Loading

lmolkova commented Feb 1, 2023 •

edited

Loading