Add support for extended clientside aggregation #1048

praboud-stripe · 2023-02-27T15:08:51Z

Summary

This adds Veneur serverside support for statsd clients using extended clientside aggregation. This allows clients to pack multiple values into the same metrics packet - see the example from the docs:

Without extended clientside aggregation:

my_distribution_metric:21|d|#all,my,tags
my_distribution_metric:43.2|d|#all,my,tags
my_distribution_metric:1657|d|#all,my,tags

With extended clientside aggregation:

my_distribution_metric:21:43.2:1657|d|#all,my,tags

NB: clients still need to enable this manually; the Veneur server does not eg: negotiate a protocol version its clients to enable this, or anything fancy like that. This PR just adds support for parsing the packets produced by clients using option.

There are three main changes made by this PR:

We're changing the interface of ParseMetric to take a func(*UDPMetric) callback called multiple times, instead of returning a single *UDPMetric value. Why do it this way? The other alternatives I considered were to either return a slice, or to publish the results back to a channel. Returning a slice requires allocating a slice - I'd rather avoid the extra GC churn if we can, since the caller would just immediately discard the slice after iterating over it. Publishing to a channel is tempting, but also has some problems. The caller is just publishing to a channel anyway, so it seems like this should work - but we determine which channel to publish to based on the Digest of the metric. Internalizing that routing logic into the parsing code seems like a layering violation. We could publish to an intermediate channel, but this requires an extra goroutine to shunt the metrics around. Overall, a callback is definitely a bit less ergonomic than the current pattern of directly returning a *UDPMetric, but it seems like the best option available (and we can add some test helpers to improve the ergonomics in our tests).
The ParseMetric code needs to be re-ordered a bit - instead of parsing the value in essentially the order it which it arrives in the packet, now that there are multiple values, it's more convenient to parse them last, immediately before the metric is published via the callback function. This has changed the exact error message produced by some of the tests checking for invalid packets. This doesn't really matter, though, for two reasons: a) the error is just logged internally - there's no API contract around which error is returned and b) since this is a case where there's more than one thing wrong with the same packet, and it's just a question of which error we hit first, both errors are equally correct.
I've also chosen to convert the ParseMetric code to call bytes.IndexByte directly, rather than using the SplitBytes helper. This isn't strictly required for the rest of this change, but it's faster. Without removing SplitBytes, there's a slight performance regression in the parsing code from adding the new multi-value logic. Net of the the SplitBytes change, this PR improves the benchmark performance of that code. I'm not that attached to this part of the PR, and can unwind it if others are opposed to this part, but I wanted to "make room" from a performance standpoint for the rest of these changes. I did do some profiling of SplitBytes itself to see if there's anything obvious we can do to improve the performance of the helper, but didn't have much luck here.

Motivation

Extended clientside aggregation avoids retransmitting the name & tag of the same metric reported multiple times; this can significantly increase the serialization performance on both the client and server, and reduces network traffic. I'd like to experiment with rolling out extended clientside aggregation to some services within Stripe, and this is the first step in doing so.

Test plan

I've added automated tests covering the new multi-value code. I've also stood up a local copy of the Veneur server & client, and verified that metrics published using extended clientside aggregation are correctly ingested by the server.

praboud-stripe · 2023-02-27T16:33:56Z

cc @bobby-stripe

bobby-stripe · 2023-02-27T17:44:28Z

samplers/parser.go

 	}
-	nameChunk := pipeSplitter.Chunk()[:startingColon]
-	valueChunk := pipeSplitter.Chunk()[startingColon+1:]
+	nameChunk := packet[:valueStart]


we can probably use bytes.Cut here to simplify the logic

Hmm, this is a good suggestion. I tried implementing this locally. It does make the code a fair bit easier to read, but it benchmarks worse than the baseline implementation on master. I'm leaning towards sticking with the bytes.IndexByte implementation.

bobby-stripe · 2023-02-27T17:46:01Z

samplers/parser.go

 	}
-	typeChunk := pipeSplitter.Chunk()
+	typeChunk := packet[typeStart+1 : tagsStart]


again bytes.Cut might be cleaner than bytes.IndexByte for this logic (but maybe not! if you don't think so please ignore)

bobby-stripe · 2023-02-27T17:46:29Z

samplers/parser.go

 	case 'd', 'h': // consider DogStatsD's "distribution" to be a histogram
-		ret.Type = "histogram"
+		metric.Type = "histogram"
 	case 'm': // We can ignore the s in "ms"


* Report allocs * Add support for multiple values in metric packet * Fix error message * Don't use SplitBytes * CHANGELOG.md entry * gofmt * Rerun tests * Rerun tests

praboud-stripe added 5 commits February 27, 2023 10:09

Report allocs

5e6185d

Add support for multiple values in metric packet

47e6a15

Fix error message

4b8012e

Don't use SplitBytes

84a4860

CHANGELOG.md entry

c7c0fb6

praboud-stripe force-pushed the praboud-multi branch from 7db3f89 to c7c0fb6 Compare February 27, 2023 15:09

praboud-stripe added 3 commits February 27, 2023 10:11

gofmt

3112feb

Rerun tests

09afe46

Rerun tests

7bc688c

praboud-stripe requested a review from arnavdugar-stripe February 27, 2023 16:07

arnavdugar-stripe approved these changes Feb 27, 2023

View reviewed changes

bobby-stripe reviewed Feb 27, 2023

View reviewed changes

praboud-stripe added 2 commits February 27, 2023 14:28

Merge branch 'master' into praboud-multi

1d31613

Merge branch 'master' into praboud-multi

64bf568

praboud-stripe merged commit b027d67 into master Feb 27, 2023

praboud-stripe deleted the praboud-multi branch February 27, 2023 19:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for extended clientside aggregation #1048

Add support for extended clientside aggregation #1048

praboud-stripe commented Feb 27, 2023

praboud-stripe commented Feb 27, 2023

bobby-stripe Feb 27, 2023

praboud-stripe Feb 27, 2023

bobby-stripe Feb 27, 2023

bobby-stripe Feb 27, 2023

Add support for extended clientside aggregation #1048

Add support for extended clientside aggregation #1048

Conversation

praboud-stripe commented Feb 27, 2023

Summary

Motivation

Test plan

praboud-stripe commented Feb 27, 2023

bobby-stripe Feb 27, 2023

Choose a reason for hiding this comment

praboud-stripe Feb 27, 2023

Choose a reason for hiding this comment

bobby-stripe Feb 27, 2023

Choose a reason for hiding this comment

bobby-stripe Feb 27, 2023

Choose a reason for hiding this comment