Implement (opt-in) WriteBatchRawV2 that can batch across namespaces #1974

richardartoul · 2019-10-01T19:41:08Z

For workloads that involve multiple namespaces the existing implementation can lead to a lot of extraneous RPC which leads to increased load / instability of the M3DB nodes. This P.R adds the ability to opt-in on the client (when working with versions of M3DB that support the new APIs) to use a new API that batches writes across namespaces transparently leading to improved performance.

codecov · 2019-10-01T19:47:11Z

Codecov Report

Merging #1974 into master will increase coverage by <.1%.
The diff coverage is 74.4%.

@@            Coverage Diff            @@
##           master    #1974     +/-   ##
=========================================
+ Coverage    63.4%    63.4%   +<.1%     
=========================================
  Files        1119     1119             
  Lines      105570   105968    +398     
=========================================
+ Hits        66969    67253    +284     
- Misses      34315    34402     +87     
- Partials     4286     4313     +27

Flag	Coverage Δ
#aggregator	`79.7% <ø> (-0.1%)`	⬇️
#cluster	`56.3% <ø> (ø)`	⬆️
#collector	`63.7% <ø> (ø)`	⬆️
#dbnode	`64.8% <74.4%> (ø)`	⬆️
#m3em	`59.6% <ø> (ø)`	⬆️
#m3ninx	`61.1% <ø> (ø)`	⬆️
#m3nsch	`51.1% <ø> (ø)`	⬆️
#metrics	`17.7% <ø> (ø)`	⬆️
#msg	`74.9% <ø> (ø)`	⬆️
#query	`68.2% <ø> (ø)`	⬆️
#x	`75% <ø> (-0.1%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 67da159...04bc245. Read the comment docs.

justinjc · 2019-10-02T17:19:08Z

src/dbnode/client/config.go

+
+	// UseV2BatchAPIs determines whether the V2 batch APIs are used. Note that the M3DB nodes must
+	// have support for the V2 APIs in order for this feature to be used.
+	UseV2BatchAPIs *bool `yaml:"useV2BatchAPIs"`


Thinking about this in the long term, is this sustainable? The config might be littered with different versions for different sets of APIs and it could get very messy. How about just a single APIVersion *string where users will put in "0.2.3" or something similar?

The plan is to deprecate the old APIs soon so I'm not super worried about it but I can make it a string. I'll probably keep it a bool in the guts of the codebase though just to keep things simple

src/dbnode/generated/thrift/rpc.thrift

justinjc · 2019-10-02T17:40:41Z

src/dbnode/generated/thrift/rpc.thrift

+struct WriteBatchRawV2RequestElement {
+	1: required binary id
+	2: required Datapoint datapoint
+	3: required i64 nameSpace


I understand that this might save a ton of memory, but might be awkward to use from a user perspective. I suppose there can be a thin wrapper around this interface to make this easier.

Why not go all out and have list<binary> ids at the request level and required i64 id here instead?

No one will interact with this directly, it all is handled transparently by the client so I think thats fine.

I've spoken with Rob before about the IDs thing and its pretty uncommon to have multiple data points for the same ID in one request so I'm gonna leave that out for now for simplicity

justinjc · 2019-10-02T17:42:16Z

src/dbnode/generated/thrift/rpc.thrift

 struct WriteTaggedBatchRawRequestElement {
 	1: required binary id
 	2: required binary encodedTags
 	3: required Datapoint datapoint
 }

+struct WriteTaggedBatchRawV2RequestElement {
+	1: required binary id


Same comment regarding list of ids at the request level here too.

same response

src/dbnode/client/write_tagged_op.go

src/dbnode/client/write_op.go

justinjc · 2019-10-02T18:14:27Z

src/dbnode/client/host_queue.go

+	writeOpBatchSize                             tally.Histogram
+	fetchOpBatchSize                             tally.Histogram
+	status                                       status
+	serverSupportsV2APIs                         bool


Similar comment to above regarding versioning. Imagining a serverSupportsV3APIs and serverSupportsV4APIs tag here later on is quite painful.

I think for this portion I'll leave as is as a bool because with an enum you need to handle the case where its an invalid value. Hopefully we can just delete this code soon

src/dbnode/client/host_queue.go

justinjc · 2019-10-02T18:34:10Z

src/dbnode/network/server/tchannelthrift/node/service.go

+		}
+
+		seriesID := s.newPooledID(ctx, elem.ID, pooledReq)
+		batchWriter.Add(


Can batchWriter be nil at this stage?

No I don't think so. It starts off as nil and it will get set in the first iteration of the loop. Everytime after that where it gets set to nil (caus a batch was written) we assign a new one

justinjc · 2019-10-02T18:35:57Z

src/dbnode/network/server/tchannelthrift/node/service.go

+		}
+
+		seriesID := s.newPooledID(ctx, elem.ID, pooledReq)
+		batchWriter.AddTagged(


Can batchWriter be nil at this stage?

same comment

justinjc

LGTM

richardartoul force-pushed the ra/write-batch-multi-ns branch from f2b5d09 to 65bf84a Compare October 1, 2019 19:57

justinjc reviewed Oct 2, 2019

View reviewed changes

Richard Artoul added 5 commits October 3, 2019 11:07

Implement WriteBatchRawV2 that can batch across namespaces

721c555

wire up usev2 flag to client YAML config

ea9e2bb

delete dead code

506e4e8

fix broken test

cb20692

factor out helpers

04bc245

richardartoul force-pushed the ra/write-batch-multi-ns branch from ae3c47d to 04bc245 Compare October 3, 2019 15:07

justinjc approved these changes Oct 3, 2019

View reviewed changes

Merge branch 'master' into ra/write-batch-multi-ns

ad068d3

richardartoul merged commit 193cc7e into master Oct 4, 2019

richardartoul deleted the ra/write-batch-multi-ns branch October 4, 2019 13:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement (opt-in) WriteBatchRawV2 that can batch across namespaces #1974

Implement (opt-in) WriteBatchRawV2 that can batch across namespaces #1974

richardartoul commented Oct 1, 2019 •

edited

Loading

codecov bot commented Oct 1, 2019 •

edited

Loading

justinjc Oct 2, 2019

richardartoul Oct 2, 2019

justinjc Oct 2, 2019

richardartoul Oct 2, 2019

justinjc Oct 2, 2019

richardartoul Oct 2, 2019

justinjc Oct 2, 2019

richardartoul Oct 2, 2019

justinjc Oct 2, 2019

richardartoul Oct 3, 2019

justinjc Oct 2, 2019

richardartoul Oct 3, 2019

justinjc left a comment

Implement (opt-in) WriteBatchRawV2 that can batch across namespaces #1974

Implement (opt-in) WriteBatchRawV2 that can batch across namespaces #1974

Conversation

richardartoul commented Oct 1, 2019 • edited Loading

codecov bot commented Oct 1, 2019 • edited Loading

Codecov Report

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

justinjc left a comment

Choose a reason for hiding this comment

richardartoul commented Oct 1, 2019 •

edited

Loading

codecov bot commented Oct 1, 2019 •

edited

Loading