feat: migrate decoders to protobuf #63

kruskall · 2023-06-14T15:28:36Z

Trying out a new approach for faster decoding.

Related to #52
Blocked by #62

kruskall · 2023-06-15T12:06:54Z

@axw before adding the finishing touch and marking this as ready for review, I wanted to hear your feedback on the new approach I tried out on decoding.

Instead of decoding into a struct and then mapping, we're decoding to a map[string]any directly and then mapping.

In practice this means going from:

	if from.Environment.IsSet() {
		out.Environment = from.Environment.Val
	}

to

	if environment, ok := getString(m, "environment"); ok {
		out.Environment = environment
	}

Nested values can be retrieved without increasing LOC or verbosity:

	if name, ok := getStringPaths(m, "node.configured_name"); ok {
		out.Node.Name = name
	}

We would lose the go struct representation but I think the switch would be worth it. WDYT ?

axw · 2023-06-15T13:11:56Z

@kruskall we used to do this years ago. It had poor performance due to all the allocations it incurs. Moreover, the "input model" Go structs enforce schema validation. I don't want to go back to this.

kruskall · 2023-06-16T17:57:01Z

Ah I see, sorry about that!

I pushed some changes. Should be good for review now!

axw · 2023-06-17T03:00:43Z

@kruskall thanks. That's not to say that what we have is ideal, either. Perhaps after the protobuf work is done we can revisit this.

axw

Looking good, just a handful of issues that I noticed.

I'm quite concerned about how easy it will be to introduce panics in the translation code. I think we should add some fuzz testing (#26) ASAP.

axw · 2023-06-17T03:20:27Z

input/elasticapm/internal/modeldecoder/modeldecodertest/populator.go

+	MetricType          modelpb.MetricType
+	CompressionStrategy modelpb.CompressionStrategy


Suggested change

MetricType modelpb.MetricType

CompressionStrategy modelpb.CompressionStrategy

The reflection code is a good backstop in case we miss some tests, but I think we should prefer to have specific tests, particularly for non-trivial mappings like CompressionStrategy where we need to do a text->enum. I suggest removing these fields (MetricType doesn't seem to be hit anyway?) and exclude the fields from reflection: 32d2cdb

You removed the fields from the reflection code, but still need to add an explicit test for the compression strategy translation. See 32d2cdb.

input/elasticapm/internal/modeldecoder/v2/decoder.go

…y in span test

this was conflicting with fuzz testing

kruskall · 2023-06-17T23:47:43Z

Thanks for the feedback! 🙇

I ran fuzz testing on the decoders and was able to catch a fair amount of issues. The work on fuzz testing depends on google/gofuzz#68 so we can't add it easily. If you want to test it, I had to clone the PR and add a replace directive but it's available here: https://github.com/kruskall/apm-data/tree/test/fuzz-testing

axw · 2023-06-18T05:21:36Z

I ran fuzz testing on the decoders and was able to catch a fair amount of issues. The work on fuzz testing depends on google/gofuzz#68 so we can't add it easily. If you want to test it, I had to clone the PR and add a replace directive but it's available here: https://github.com/kruskall/apm-data/tree/test/fuzz-testing

I was thinking we would use package testing from the standard library. We could either pass the corpus through the JSON decoders, or we could interpret the corpus for setting the modeldecoder struct fields directly.

The latter approach should be most efficient, since we really just care about the translation part. I haven't used it before, but https://github.com/AdaLogics/go-fuzz-headers may be helpful.

axw

LGTM, but please add a test for compression strategy

kruskall · 2023-06-18T12:44:28Z

Thanks! I opened a followup issue to discuss fuzz testing.

kruskall force-pushed the feat/migrate-decoder-error-pb branch 3 times, most recently from 66ede1c to ef83129 Compare June 15, 2023 12:00

feat: migrate decoders to protobuf

4a4acc4

kruskall force-pushed the feat/migrate-decoder-error-pb branch from ef83129 to 4a4acc4 Compare June 16, 2023 17:44

kruskall added 2 commits June 16, 2023 19:53

refactor: reduce diff noise

07acf9b

build: update missing licenses

a6252cd

kruskall changed the title ~~feat: migrate error decoder to protobuf~~ feat: migrate decoders to protobuf Jun 16, 2023

kruskall marked this pull request as ready for review June 16, 2023 17:55

kruskall requested a review from a team June 16, 2023 17:55

kruskall mentioned this pull request Jun 16, 2023

feat: migrate decoders to protobuf elastic/apm-server#11013

Merged

axw reviewed Jun 17, 2023

View reviewed changes

kruskall added 4 commits June 18, 2023 01:29

fix: add nil check to avoid nil dereference discovered by fuzz testing

2086990

test: remove complex type from populator and skip compression strateg…

bf64618

…y in span test

test: ignore non ndjson files in jsonschema test

c4b21e6

this was conflicting with fuzz testing

lint: remove unused variables

6841a5c

kruskall requested a review from axw June 17, 2023 23:49

axw approved these changes Jun 18, 2023

View reviewed changes

test: add compression strategy test

f736d25

kruskall mentioned this pull request Jun 18, 2023

Introduce fuzz testing #66

Open

kruskall merged commit 0acd047 into elastic:main Jun 18, 2023

kruskall deleted the feat/migrate-decoder-error-pb branch June 18, 2023 12:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: migrate decoders to protobuf #63

feat: migrate decoders to protobuf #63

kruskall commented Jun 14, 2023 •

edited

Loading

kruskall commented Jun 15, 2023

axw commented Jun 15, 2023

kruskall commented Jun 16, 2023

axw commented Jun 17, 2023

axw left a comment

axw Jun 17, 2023

kruskall Jun 17, 2023

axw Jun 18, 2023

kruskall commented Jun 17, 2023

axw commented Jun 18, 2023

axw left a comment

kruskall commented Jun 18, 2023

		MetricType modelpb.MetricType
		CompressionStrategy modelpb.CompressionStrategy

feat: migrate decoders to protobuf #63

feat: migrate decoders to protobuf #63

Conversation

kruskall commented Jun 14, 2023 • edited Loading

kruskall commented Jun 15, 2023

axw commented Jun 15, 2023

kruskall commented Jun 16, 2023

axw commented Jun 17, 2023

axw left a comment

Choose a reason for hiding this comment

axw Jun 17, 2023

Choose a reason for hiding this comment

kruskall Jun 17, 2023

Choose a reason for hiding this comment

axw Jun 18, 2023

Choose a reason for hiding this comment

kruskall commented Jun 17, 2023

axw commented Jun 18, 2023

axw left a comment

Choose a reason for hiding this comment

kruskall commented Jun 18, 2023

kruskall commented Jun 14, 2023 •

edited

Loading