[MLOB-1555] LLM Observability writers #4699

sabrenner · 2024-09-18T18:27:38Z

What does this PR do?

Adds LLM Observability writers for span events (agentless and agent proxy) as well as evaluation metrics (which write directly to our public API).

Important Notes

These writers will run on intervals separate from the main agent exporter, and in a future PR will be initialized in the appropriate spots to start those intervals (as defined in the constructor of the base writer). Because of this, these writers specifically won't interact with any tracer internal exporters, writers, or encoders.
We need to make sure unicode special characters are encoded in their \\u form in payload strings. I wasn't sure if there was a cleaner way to do this, so any input on this is appreciated!

Motivation

Merge in incremental change of LLMObs writers into the LLM Observability SDK release branch.

The timeline of changes to merge looks like (in order):

.github/workflows/llmobs.yml

github-actions · 2024-09-18T18:28:29Z

Overall package size

Self size: 7.17 MB
Deduped: 62.53 MB
No deduping: 62.81 MB

Dependency sizes

| name | version | self size | total size | |------|---------|-----------|------------| | @datadog/native-appsec | 8.1.1 | 18.67 MB | 18.68 MB | | @datadog/native-iast-taint-tracking | 3.1.0 | 12.27 MB | 12.28 MB | | @datadog/pprof | 5.3.0 | 9.85 MB | 10.22 MB | | protobufjs | 7.2.5 | 2.77 MB | 5.16 MB | | @datadog/native-iast-rewriter | 2.4.1 | 2.14 MB | 2.23 MB | | @opentelemetry/core | 1.14.0 | 872.87 kB | 1.47 MB | | @datadog/native-metrics | 2.0.0 | 898.77 kB | 1.3 MB | | @opentelemetry/api | 1.8.0 | 1.21 MB | 1.21 MB | | jsonpath-plus | 9.0.0 | 580.4 kB | 1.03 MB | | import-in-the-middle | 1.8.1 | 71.67 kB | 785.15 kB | | msgpack-lite | 0.1.26 | 201.16 kB | 281.59 kB | | opentracing | 0.14.7 | 194.81 kB | 194.81 kB | | pprof-format | 2.1.0 | 111.69 kB | 111.69 kB | | @datadog/sketches-js | 2.1.0 | 109.9 kB | 109.9 kB | | semver | 7.6.3 | 95.82 kB | 95.82 kB | | lodash.sortby | 4.7.0 | 75.76 kB | 75.76 kB | | lru-cache | 7.14.0 | 74.95 kB | 74.95 kB | | ignore | 5.3.1 | 51.46 kB | 51.46 kB | | int64-buffer | 0.1.10 | 49.18 kB | 49.18 kB | | shell-quote | 1.8.1 | 44.96 kB | 44.96 kB | | istanbul-lib-coverage | 3.2.0 | 29.34 kB | 29.34 kB | | rfdc | 1.3.1 | 25.21 kB | 25.21 kB | | tlhunter-sorted-set | 0.1.0 | 24.94 kB | 24.94 kB | | limiter | 1.1.5 | 23.17 kB | 23.17 kB | | dc-polyfill | 0.1.4 | 23.1 kB | 23.1 kB | | retry | 0.13.1 | 18.85 kB | 18.85 kB | | jest-docblock | 29.7.0 | 8.99 kB | 12.76 kB | | crypto-randomuuid | 1.0.0 | 11.18 kB | 11.18 kB | | koalas | 1.0.2 | 6.47 kB | 6.47 kB | | path-to-regexp | 0.1.10 | 6.38 kB | 6.38 kB | | module-details-from-path | 1.0.3 | 4.47 kB | 4.47 kB |

_{🤖 This report was automatically generated by heaviest-objects-in-the-universe}

…ters

pr-commenter · 2024-09-18T18:46:53Z

Benchmarks

Benchmark execution time: 2024-09-19 18:46:59

Comparing candidate commit ce7e950 in PR branch sabrenner/llmobs-writers with baseline commit 54c8eec in branch sabrenner/llmobs-sdk-release.

Found 0 performance improvements and 0 performance regressions! Performance is the same for 259 metrics, 7 unstable metrics.

Yun-Kim

LGTM from team mlobs, just some small suggestions / clarification questions

packages/dd-trace/src/llmobs/writers/evaluations.js

packages/dd-trace/src/llmobs/writers/spans/agentless.js

Yun-Kim · 2024-09-19T18:15:36Z

packages/dd-trace/src/llmobs/writers/base.js

+      if (typeof value === 'string') {
+        return encodeUnicode(value) // serialize unicode characters
+      }
+      return value


Just for clarification, can you explain what exactly's happening here? Does json.stringify() get called first then we run the encodeUnicode() helper on the result afterwards?

it gets run as JSON.stringify is happening. when passing a callback function to JSON.stringify, it'll execute that function over any values in the object. since we need to encode unicode characters (ie – → \u2013) for our decoder on ingestion, this function will make sure we encode those special characters with the correct unicode value (I think json.dumps does this for us on the Python SDK, but JSON.stringify doesn't do it by default here). There might be a better approach for this, will wait for Node.js folks input on that.

Reference: https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/JSON/stringify#replacer

* [MLOB-1540] add llmobs configuration to global tracer config (#4696) add llmobs config * [MLOB-1555] LLM Observability writers (#4699) LLM Observability writers * [MLOB-1556] LLM Observability tagger (#4718) LLM Observability tagger * [MLOB-1560] LLMObs Span Processor (#4738) * span processor * tests * remove agent exporter log and do not stringify tags * remove llmobs from exporter tests * add in default unserializable value * review comments * warning log for metric * todo-ify * remove some duplicate logic * decouple llmobs span processing with a channel * use a static weakmap to store llmobs tags/annotations instead of span tags * do not register span in map if it does not have an llmobs span kind * span is passed on an object from sp publisher * re-clarify TODOs * only send span in publish * log multiple warnings and return conditional undefined * update error logic * [MLOB-1561] LLM Observability SDK API (#4773) * wip * type definitions * active + try/catch eval metric writer append * test ts * use tagger map and processor as a channel subscriber * change decorate and add in dev changes * try some api changes * add decorate to noop * fix breaking proxy tests * experimental decorators for TS docs * api changes, fix unit + e2e tests * try removing global log mocks * add some util tests * remove logger mocks * add module tests + do not enable when not specified * fix eval metric integration test * wip * memoize getFunctionArguments * move any subscriber and global writer to the module enablement level instead of sdk * should fix TS tests * add ts integration test and fix decorator * devex for ts versions * add noop typescript test * remove startSpan * remove unneeded change * dedup decorator code * Update index.d.ts Co-authored-by: Yun Kim <35776586+Yun-Kim@users.noreply.github.com> * map metrics names * change validKind to validateKind and throw * tagger for metrics follow-up * review feedback * add some tests for not auto-annotating in certain cases --------- Co-authored-by: Yun Kim <35776586+Yun-Kim@users.noreply.github.com> * hard fail instead of soft fail, except for `wrap` span name * add ml-observability codeowners * resolve ts test * update auto-annotation check * tagger can soft fail * using custom ASL instance and scope activation * fix test comments and remove * address review comments * remove llmobs.apiKey config, only rely on global * fix evaulations test * make llmobs storage accessible --------- Co-authored-by: Yun Kim <35776586+Yun-Kim@users.noreply.github.com>

sabrenner added 2 commits September 18, 2024 10:57

writers

0125bbf

tests

185fbe3

datadog-datadog-prod-us1 bot reviewed Sep 18, 2024

View reviewed changes

.github/workflows/llmobs.yml Show resolved Hide resolved

.github/workflows/llmobs.yml Show resolved Hide resolved

.github/workflows/llmobs.yml Show resolved Hide resolved

Merge branch 'sabrenner/llmobs-sdk-release' into sabrenner/llmobs-wri…

4a85292

…ters

sabrenner marked this pull request as ready for review September 19, 2024 14:21

sabrenner requested a review from a team as a code owner September 19, 2024 14:21

Yun-Kim approved these changes Sep 19, 2024

View reviewed changes

make agentless spans and eval metrics endpoints constants

ce7e950

sabrenner changed the title ~~[MLOB-1555] add LLMObs writers~~ [MLOB-1555] LLM Observability writers Sep 24, 2024

rochdev approved these changes Sep 25, 2024

View reviewed changes

sabrenner merged commit 5b215f6 into sabrenner/llmobs-sdk-release Sep 25, 2024
175 of 178 checks passed

sabrenner deleted the sabrenner/llmobs-writers branch September 25, 2024 17:17

sabrenner mentioned this pull request Sep 30, 2024

[MLOB-1524] feat(llmobs): Introduce LLM Observability SDK #4742

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MLOB-1555] LLM Observability writers #4699

[MLOB-1555] LLM Observability writers #4699

sabrenner commented Sep 18, 2024 •

edited

Loading

github-actions bot commented Sep 18, 2024 •

edited

Loading

pr-commenter bot commented Sep 18, 2024 •

edited

Loading

Yun-Kim left a comment

Yun-Kim Sep 19, 2024

sabrenner Sep 19, 2024

rochdev Sep 25, 2024

[MLOB-1555] LLM Observability writers #4699

[MLOB-1555] LLM Observability writers #4699

Conversation

sabrenner commented Sep 18, 2024 • edited Loading

What does this PR do?

Important Notes

Motivation

github-actions bot commented Sep 18, 2024 • edited Loading

Overall package size

pr-commenter bot commented Sep 18, 2024 • edited Loading

Benchmarks

Yun-Kim left a comment

Choose a reason for hiding this comment

Yun-Kim Sep 19, 2024

Choose a reason for hiding this comment

sabrenner Sep 19, 2024

Choose a reason for hiding this comment

rochdev Sep 25, 2024

Choose a reason for hiding this comment

sabrenner commented Sep 18, 2024 •

edited

Loading

github-actions bot commented Sep 18, 2024 •

edited

Loading

pr-commenter bot commented Sep 18, 2024 •

edited

Loading