enhancement(loki sink): Partition HTTP requests to loki by stream & re-enable concurrency #8615

ktff · 2021-08-06T11:54:00Z

Closes #6041
Closes #6932

This PR adds partitioning of requests by stream, on top of the already partitioning by tenant. To enable this, inner struct PartitionBatchSink is also extended with ability to order requests per partition. This ordering guarantee could maybe be useful as a configurable option on some other sinks.

As a side effect, to test this, concurrency in the sink is also re-enabled.

Signed-off-by: ktf <krunotf@gmail.com>

netlify · 2021-08-06T11:54:06Z

✔️ Deploy Preview for vector-project canceled.

🔨 Explore the source changes: 20a581a

🔍 Inspect the deploy log: https://app.netlify.com/sites/vector-project/deploys/6121181b1023e300082dd781

Signed-off-by: ktf <krunotf@gmail.com>

src/sinks/loki.rs

bruceg · 2021-08-16T22:20:50Z

src/sinks/loki.rs

+        // So, since all of the events are of the same partition, and if there is concurrency,
+        // then if ordering inside paritions isn't upheld, the big line events will take longer
+        // time to flush than small line events so loki will receive smaller ones before large
+        // ones hence out of order events.
+        test_out_of_order_events(OutOfOrderAction::Drop, batch_size, events.clone(), events).await;


Hmm, this test is fundamentally racy, isn't it? Is there any way to provoke out of order events without racing?

It is.

Is there any way to provoke out of order events without racing

Not really. There are two more ways to provoke them:

Fail requests and force them to retry.

This isn't quite testing what it needs too since retry logic can correct this.

A mock loki server which we can configure to fail requests would be needed.

Stall the start of the server so that requests pile up.

This is yet again racy.

What we can do is to increase the number of events to increase the chance of out of order event happening.

src/sinks/util/sink.rs

Signed-off-by: ktf <krunotf@gmail.com>

src/sinks/loki.rs

jszwedko · 2021-08-20T23:31:12Z

@ktff It'd be nice to merge this by Monday if we can to get it into the next release. I see it approved but just want to make sure you didn't want to make any other tweaks. Nice work on re-enabling concurrency here; it'll make a lot of people happy.

Signed-off-by: ktf <krunotf@gmail.com>

wgb1990 · 2021-09-01T11:25:46Z

@ktff Hi,partitioning of requests by stream is not suitable for the role of aggregator,because there will be a large number of label combinations in the aggregator, which eventually leads to the slow sending of Loki. In my case, the final events are stacked in the buffer.

ktff · 2021-09-01T12:03:25Z

@wgb1990 by buffer if I assume you mean batch. Events shouldn't be in batch more than batch.timeout_secs, where default value is 1 sec. If you are still seeing events being in batch for a longer period of time, open a issue for it since that would be a bug.

which eventually leads to the slow sending of Loki

If on the other hand you mean that throughput is low because you have a large number of streams but only a few events per stream per second, true, the older solution would be better for that use case. That is doable, so in that case open a feature issue with your use case. I imagine it could be done with a simple enum option or something more dynamic.

wgb1990 · 2021-09-02T03:14:58Z

@ktff #8998

ktff added 8 commits August 3, 2021 20:39

Min change

f3c49bc

Signed-off-by: ktf <krunotf@gmail.com>

Trimming

90e41f9

Signed-off-by: ktf <krunotf@gmail.com>

Avoid cloning

174570a

Signed-off-by: ktf <krunotf@gmail.com>

Per partition ordering

155e58a

Signed-off-by: ktf <krunotf@gmail.com>

Use ordering in loki

5c10e91

Signed-off-by: ktf <krunotf@gmail.com>

Add tests

10eb3aa

Signed-off-by: ktf <krunotf@gmail.com>

Update conncurency

c1458eb

Signed-off-by: ktf <krunotf@gmail.com>

Add use

4402668

Signed-off-by: ktf <krunotf@gmail.com>

ktff added type: enhancement A value-adding code change that enhances its existing functionality. sink: loki Anything `loki` sink related labels Aug 6, 2021

ktff self-assigned this Aug 6, 2021

Merge master

1ab5227

Signed-off-by: ktf <krunotf@gmail.com>

ktff requested a review from bruceg August 6, 2021 12:02

ktff added 4 commits August 6, 2021 14:18

Remove unused

fd6bb26

Signed-off-by: ktf <krunotf@gmail.com>

Collapse if

1fc7039

Signed-off-by: ktf <krunotf@gmail.com>

Yes Clippy

9782d2b

Signed-off-by: ktf <krunotf@gmail.com>

Update internal documentation

3d55213

Signed-off-by: ktf <krunotf@gmail.com>

This was referenced Aug 14, 2021

sink(loki): entry out of order #6651

Closed

chore(loki sink): Add test for unordered retries #8724

Merged

bruceg reviewed Aug 16, 2021

View reviewed changes

ktff added 3 commits August 18, 2021 17:00

Apply comments

4f558c8

Signed-off-by: ktf <krunotf@gmail.com>

Merge master

b15552d

Signed-off-by: ktf <krunotf@gmail.com>

Fmt

02ce57d

Signed-off-by: ktf <krunotf@gmail.com>

ktff requested a review from bruceg August 20, 2021 16:28

bruceg approved these changes Aug 20, 2021

View reviewed changes

src/sinks/loki.rs Outdated Show resolved Hide resolved

ktff added 2 commits August 21, 2021 16:45

Combine labels to String

593eb8e

Signed-off-by: ktf <krunotf@gmail.com>

Fix test

20a581a

Signed-off-by: ktf <krunotf@gmail.com>

ktff merged commit dc57c2b into master Aug 21, 2021

ktff deleted the ktff/loki_streams branch August 21, 2021 17:37

jszwedko mentioned this pull request Mar 9, 2022

Re-enable concurrency for loki sink #11758

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

enhancement(loki sink): Partition HTTP requests to loki by stream & re-enable concurrency #8615

enhancement(loki sink): Partition HTTP requests to loki by stream & re-enable concurrency #8615

ktff commented Aug 6, 2021

netlify bot commented Aug 6, 2021 •

edited

Loading

bruceg Aug 16, 2021

ktff Aug 18, 2021

jszwedko commented Aug 20, 2021 •

edited

Loading

wgb1990 commented Sep 1, 2021 •

edited

Loading

ktff commented Sep 1, 2021

wgb1990 commented Sep 2, 2021

enhancement(loki sink): Partition HTTP requests to loki by stream & re-enable concurrency #8615

enhancement(loki sink): Partition HTTP requests to loki by stream & re-enable concurrency #8615

Conversation

ktff commented Aug 6, 2021

netlify bot commented Aug 6, 2021 • edited Loading

bruceg Aug 16, 2021

Choose a reason for hiding this comment

ktff Aug 18, 2021

Choose a reason for hiding this comment

jszwedko commented Aug 20, 2021 • edited Loading

wgb1990 commented Sep 1, 2021 • edited Loading

ktff commented Sep 1, 2021

wgb1990 commented Sep 2, 2021

netlify bot commented Aug 6, 2021 •

edited

Loading

jszwedko commented Aug 20, 2021 •

edited

Loading

wgb1990 commented Sep 1, 2021 •

edited

Loading