Add suppression for input and output data #5

cartermp · 2023-07-26T01:37:19Z

Adds two config parameters:

suppress_response_data, which when set to True, will NOT log response data (like chat responses) to a span
suppress_input_content, which when set to True, will NOT log input data (like a corpus of text passed to a model with a large context window) to a span

This lets people avoid reaching limits on spans. Imagine an app that passes in a large amount of data per request, or a long conversation in a chatbot where the entire conversation history is passed in as input each time -- this would likely exceed the total size a span can have.

estib

Looks good -- although i am interested in talking through one question:

Will people be able to predict when content / responses are likely to be too long at the time they are initiating the openai OTEL auto instrumentor? Or will it be something a bit more dynamic, like "sometimes i get a few really long questions / responses, but sometimes not" depending on the users?

What do you think about adding a config to define dynamic rules of when a response_data or input_content should be included? Like maybe we could let users define some kind of length limit for how long a response_data should be before it gets suppressed? This could be something we do in addition to a boolean on/off switch like what this PR adds, but it could also be something we do instead? (could even allow users to set the length limit to 0 to suppress them entirely)

I dunno, what do you think?

src/opentelemetry/instrumentation/openai/__init__.py

cartermp · 2023-08-05T03:50:27Z

Mmmm, this is a good question:

Will people be able to predict when content / responses are likely to be too long at the time they are initiating the openai OTEL auto instrumentor? Or will it be something a bit more dynamic, like "sometimes i get a few really long questions / responses, but sometimes not" depending on the users?

You can certainly predict it for single requests, or at least a range of I/O size. But yeah, for genuine chat apps, there's no predicting things, nor is there predicting things for agents where you continually build larger and larger context.

I think a setting that's more dynamic in setting, limiting total inputs and/or outputs by a threshold does make sense.

cartermp added 2 commits July 25, 2023 18:26

Add suppression for response messages

b315905

Add suppression for input data

cb30853

cartermp requested a review from estib July 26, 2023 01:37

estib reviewed Aug 3, 2023

View reviewed changes

src/opentelemetry/instrumentation/openai/__init__.py Outdated Show resolved Hide resolved

src/opentelemetry/instrumentation/openai/__init__.py Outdated Show resolved Hide resolved

Cleanup and handle input counts still

edc2981

cartermp merged commit aa651de into main Aug 17, 2023
1 check passed

cartermp deleted the cartermp/suppress branch August 17, 2023 19:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add suppression for input and output data #5

Add suppression for input and output data #5

cartermp commented Jul 26, 2023 •

edited

Loading

estib left a comment

cartermp commented Aug 5, 2023

Add suppression for input and output data #5

Add suppression for input and output data #5

Conversation

cartermp commented Jul 26, 2023 • edited Loading

estib left a comment

Choose a reason for hiding this comment

cartermp commented Aug 5, 2023

cartermp commented Jul 26, 2023 •

edited

Loading