server: set multiple concurrentReadTx instances share one txReadBuffer #12933

wilsonwang371 · 2021-05-09T22:59:36Z

Currently, the concurrentReadTx does a copy of the txReadBuffer inside backend readTx. However, this buffer is never modified by concurrentReadTx. Therefore, we can just use a single copy of the txReadBuffer as long as there are no new changes committed to backend DB.

Therefore, we use a cached txReadBuffer to avoid excessive deep copy operations for the new read transaction.

More performance testing results will be uploaded later.

@gyuho @ptabor

Guys, please take a look at this and let me know if there is any concern or issue.

codecov-commenter · 2021-05-10T00:25:40Z

Codecov Report

Merging #12933 (acaf5ae) into master (aeb9b5f) will decrease coverage by 0.46%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##           master   #12933      +/-   ##
==========================================
- Coverage   73.31%   72.84%   -0.47%     
==========================================
  Files         430      430              
  Lines       34182    34212      +30     
==========================================
- Hits        25060    24922     -138     
- Misses       7194     7350     +156     
- Partials     1928     1940      +12

Flag	Coverage Δ
all	`72.84% <100.00%> (-0.47%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
server/mvcc/backend/backend.go	`82.33% <100.00%> (+1.93%)`	⬆️
server/mvcc/backend/tx_buffer.go	`92.30% <100.00%> (+0.20%)`	⬆️
server/etcdserver/api/rafthttp/peer.go	`77.41% <0.00%> (-18.07%)`	⬇️
server/etcdserver/api/rafthttp/peer_status.go	`89.28% <0.00%> (-10.72%)`	⬇️
pkg/netutil/routes_linux.go	`40.56% <0.00%> (-9.44%)`	⬇️
client/pkg/v3/tlsutil/tlsutil.go	`83.33% <0.00%> (-8.34%)`	⬇️
server/mvcc/watchable_store.go	`82.73% <0.00%> (-6.03%)`	⬇️
server/proxy/grpcproxy/watch.go	`92.02% <0.00%> (-5.53%)`	⬇️
client/v3/leasing/cache.go	`87.50% <0.00%> (-4.61%)`	⬇️
raft/rafttest/node.go	`95.52% <0.00%> (-4.48%)`	⬇️
... and 26 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update aeb9b5f...acaf5ae. Read the comment docs.

wilsonwang371 · 2021-05-10T03:24:58Z

This is a patch inspired by #12529

wilsonwang371 · 2021-05-10T03:25:33Z

Another different approach patch is #12935

wilsonwang371 · 2021-05-11T02:28:48Z

Here are the performance graphs comparing the latest master branch and the concurrent read tx optimization code.

Data file:

concurrentReadTx-optimized.csv

server/mvcc/backend/tx_buffer.go

server/mvcc/backend/backend.go

gyuho · 2021-05-11T06:08:53Z

Test results look great. The master in your graph is after we merge your #10523, right? I will take another look tomorrow.

wilsonwang371 · 2021-05-11T06:09:20Z

Test results look great. The master in your graph is after we merge your #10523, right? I will take another look tomorrow.

Sorry, it should be #12896. I will monitor the tests and fix possible bugs.

wilsonwang371 · 2021-05-11T07:02:28Z

i have to mention that the integration test with 4 cpus failed once and this time it passed. I want to make sure it is not a bug with my code which can fail the integration test.

wilsonwang371 · 2021-05-11T18:34:04Z

@gyuho @ptabor

Guys,

I have two questions here. First, do we still need to add a feature flag for this? Second, if the patch failed the 4cpus integration test once but later no further failures observed, is it safe?

wilsonwang371 · 2021-05-11T18:34:49Z

@jingyih please take a look at this patch if you have time and give some suggestions.

Jeffwan · 2021-05-12T02:58:20Z

Can someone help add rel/3.5-nice-to-have and area/performance label to help track the progress? It would be great to add this PR to scope of etcd-v3.5

gyuho

Can we explain why serving from potentially stale read buffer is safe?

server/mvcc/backend/backend.go

server/mvcc/backend/tx_buffer.go

server/mvcc/backend/backend.go

wilsonwang371 · 2021-05-13T00:44:05Z

Can we explain why serving from potentially stale read buffer is safe?

Thank you Gyuho for the review comments. I updated the code as per your comments. I like the new code logic you posted in the review. I am following it. But please check my updated logic and decide whether the last if-condition is correct. 🙏

gyuho · 2021-05-17T19:21:15Z

After all the changes, we can see that the performance does not have a significant change.

Ah, I see. Do you have an idea why previous tests were much better?

Either way, this was a great exercise. Once the script is contributed, we can look into some ways to automate.

wilsonwang371 · 2021-05-17T19:42:11Z

After all the changes, we can see that the performance does not have a significant change.

Ah, I see. Do you have an idea why previous tests were much better?

Either way, this was a great exercise. Once the script is contributed, we can look into some ways to automate.

I was comparing my first patch with the latest updated patch so ideally it should be 1.0 because we didn't change the real logic much with all the addressed issue fixed. The difference should be within the acceptable variance.

I am going to post a comparison between main branch and latest shared_txReadBuffer branch soon.

The file I compared were concurrentReadTx-optimized2.csv and concurrentReadTx-optimized.csv. That plot is a little bit misleading. I am going to post a new update soon comparing latest patch vs. new main branch.

wilsonwang371 · 2021-05-18T17:33:50Z

@gyuho @ptabor

Hi guys, this is the graph of the performance. The first graph is the latest main branch performance. The second graph is the performance of this patch. The third one is the comparison between the two. It looks like we still can get similar performance improvement.

main-baseline.csv

concurrentReadTx-optimized2.csv

wilsonwang371 · 2021-05-18T17:43:17Z

@wilsonwang371 thank you.

I think @gyuho though about comparison of 'main' vs 'latest'. From the last charts someone can have false impression that the change does not have meaningful feedback.

Can you, please, commit the benchmark script into .e.g. etcd/tools/benchmark-heatmaps/... (e.g. in separate PR).

I will file another PR to check in the benchmark script.

wilsonwang371 · 2021-05-18T18:03:18Z

@ptabor
Piotr, are you familiar with this failure?
Linux ARM64 Graviton2 / test (pull_request)

There are no new changes except merging with the latest code.

ptabor · 2021-05-18T18:11:34Z

@ptabor
Piotr, are you familiar with this failure?
Linux ARM64 Graviton2 / test (pull_request)

There are no new changes except merging with the latest code.

You were hit by known flake: #12983

gyuho · 2021-05-18T21:41:31Z

@wilsonwang371 The latest graph above is way better than #12933 (comment). Why there's so much delta when the logic was the same?

wilsonwang371 · 2021-05-18T22:01:04Z

@wilsonwang371 The latest graph above is way better than #12933 (comment). Why there's so much delta when the logic was the same?

Oh, they are different comparisons.

A. Main branch ----> B. Initial Pull Patch Diff ----> C. Latest Pull Patch Diff

B vs C: Graph in #12933 (comment)
A vs C: Graph in #12933 (comment)

gyuho · 2021-05-18T22:09:23Z

A. Main branch

This PR is rebased from the latest main branch, right?

A vs C

So, this is based on the current main branch we have today, correct?

wilsonwang371 · 2021-05-18T22:15:21Z

A. Main branch

This PR is rebased from the latest main branch, right?

A vs C

So, this is based on the current main branch we have today, correct?

Main branch result is based on :

commit 932d42b
Merge: 1675101 3f7a038
Author: Piotr Tabor ptab@google.com
Date: Mon May 17 14:07:20 2021 +0200

Merge pull request #12971 from ptabor/20210514-split-etcdctl

Split etcdctl into etcdctl (public API access) & etcdutl (direct surgery on files)

The result of this patch is rebased on:

commit aeb9b5f
Merge: 6decbe1 f1123d6
Author: Piotr Tabor ptab@google.com
Date: Sat May 8 09:34:31 2021 +0200

Merge pull request #12855 from ptabor/20210409-backend-hooks

(no)StoreV2 (Part 4): Backend hooks:  precommit updates consistency_index

Shall I rebase it on the latest main branch and do the comparison again? It seems like recent new changes do not have some observable difference in performance.

gyuho · 2021-05-18T23:22:21Z

Shall I rebase it on the latest main branch and do the comparison again? It seems like recent new changes do not have some observable difference in performance.

As long as it has your PR #12896, shouldn't affect.

That said, 46b49a6 is the recent commit around storage. If we can rebase on the latest commit rather than the one in Sat May 8 and do one more tests, it would be great. Thanks!

wilsonwang371 · 2021-05-18T23:29:38Z

Shall I rebase it on the latest main branch and do the comparison again? It seems like recent new changes do not have some observable difference in performance.

As long as it has your PR #12896, shouldn't affect.

That said, 46b49a6 is the recent commit around storage. If we can rebase on the latest commit rather than the one in Sat May 8 and do one more tests, it would be great. Thanks!

This is already done a few hours ago. I did this through the github UI. The checking result is generated after that merge.

But, in order to generate the performance graphs using python done above, it will take around 24 ~48 hours.

wilsonwang371 · 2021-05-19T23:24:40Z

@gyuho

Hi Gyuho,

This is the performance result of the pull request based on 88d26c1

The performance is the same as the code that used in previous comments.

concurrentReadTx-optimized3.csv

gyuho · 2021-05-19T23:57:56Z

@wilsonwang371 Really solid testing data. Greatly appreciate it. Give me some time for me to go through this again, and will make some suggestions for comments.

wilsonwang371 · 2021-05-21T18:42:24Z

@wilsonwang371 Really solid testing data. Greatly appreciate it. Give me some time for me to go through this again, and will make some suggestions for comments.

Hi Gyuho and Piotr,

Is it still possible to have this patch merged, tested and get shipped in 3.5 release? What is our next step before the final 3.5 release available?

@gyuho @ptabor

gyuho · 2021-05-21T19:32:59Z

@wilsonwang371 I will get back to you by this weekend.

gyuho

Can you add the requested comments to explain the logic inline?

server/mvcc/backend/backend.go

gyuho · 2021-05-24T07:53:59Z

@wilsonwang371 Can we also remove merge commit, and squash/rebase from latest main branch? Thanks.

gyuho

lgtm. Thanks for the awesome work @wilsonwang371.

Can we also backport this to release-3.5 branch?

ptabor · 2021-05-25T07:05:55Z

Thank you, Wilson.

+1 to backporting (I think there are 3 chained PRs to cherry pick), and the performance impact of this change is very significant.

According to: https://www.kubernetes.dev/resources/release/, the k8s code freeze is Thursday, July 8th: Week 11 - Code Freeze), so we should have some headroom.

wilsonwang371 force-pushed the shared_txReadBuffer branch 2 times, most recently from a4bda87 to 34793bf Compare May 9, 2021 23:41

wilsonwang371 force-pushed the shared_txReadBuffer branch 4 times, most recently from 0639cca to 2f9612b Compare May 10, 2021 01:24

wilsonwang371 mentioned this pull request May 10, 2021

server: set multiple concurrentReadTx instances share one txReadBuffer. Approach #2 #12935

Closed

gyuho self-assigned this May 10, 2021

gyuho reviewed May 11, 2021

View reviewed changes

server/mvcc/backend/tx_buffer.go Outdated Show resolved Hide resolved

wilsonwang371 force-pushed the shared_txReadBuffer branch 4 times, most recently from 79ee062 to 0cbc33f Compare May 11, 2021 05:56

gyuho reviewed May 11, 2021

View reviewed changes

server/mvcc/backend/backend.go Outdated Show resolved Hide resolved

server/mvcc/backend/backend.go Outdated Show resolved Hide resolved

server/mvcc/backend/backend.go Outdated Show resolved Hide resolved

wilsonwang371 force-pushed the shared_txReadBuffer branch from 0cbc33f to 07aabea Compare May 11, 2021 06:03

wilsonwang371 force-pushed the shared_txReadBuffer branch 2 times, most recently from 1484273 to bcc8aae Compare May 11, 2021 06:50

gyuho added the area/performance label May 12, 2021

gyuho reviewed May 12, 2021

View reviewed changes

ptabor approved these changes May 18, 2021

View reviewed changes

wilsonwang371 mentioned this pull request May 18, 2021

new performance benchmark script #13001

Closed

gyuho suggested changes May 24, 2021

View reviewed changes

server/mvcc/backend/backend.go Show resolved Hide resolved

server/mvcc/backend/backend.go Show resolved Hide resolved

server/mvcc/backend/backend.go Show resolved Hide resolved

gyuho added the backport/v3.5 label May 24, 2021

server: set multiple concurrentReadTx instances share one txReadBuffer.

9c82e8c

wilsonwang371 force-pushed the shared_txReadBuffer branch from 88d26c1 to 9c82e8c Compare May 24, 2021 22:17

gyuho approved these changes May 24, 2021

View reviewed changes

gyuho merged commit 7a7b668 into etcd-io:main May 24, 2021

gyuho mentioned this pull request May 24, 2021

[release-3.5] server: set multiple concurrentReadTx instances share one txReadBuffer. #13036

Merged

wilsonwang371 deleted the shared_txReadBuffer branch May 27, 2021 00:57

This was referenced Jan 9, 2023

[3.4] backports #11990 #12933 #15072

Closed

[3.4] mvcc: reduce count-only range overhead #15099

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

server: set multiple concurrentReadTx instances share one txReadBuffer #12933

server: set multiple concurrentReadTx instances share one txReadBuffer #12933

wilsonwang371 commented May 9, 2021

codecov-commenter commented May 10, 2021 •

edited

Loading

wilsonwang371 commented May 10, 2021

wilsonwang371 commented May 10, 2021

wilsonwang371 commented May 11, 2021 •

edited

Loading

gyuho commented May 11, 2021

wilsonwang371 commented May 11, 2021 •

edited

Loading

wilsonwang371 commented May 11, 2021 •

edited

Loading

wilsonwang371 commented May 11, 2021

wilsonwang371 commented May 11, 2021

Jeffwan commented May 12, 2021

gyuho left a comment

wilsonwang371 commented May 13, 2021 •

edited

Loading

gyuho commented May 17, 2021

wilsonwang371 commented May 17, 2021 •

edited

Loading

wilsonwang371 commented May 18, 2021 •

edited

Loading

wilsonwang371 commented May 18, 2021

wilsonwang371 commented May 18, 2021

ptabor commented May 18, 2021

gyuho commented May 18, 2021

wilsonwang371 commented May 18, 2021

gyuho commented May 18, 2021

wilsonwang371 commented May 18, 2021

gyuho commented May 18, 2021

wilsonwang371 commented May 18, 2021 •

edited

Loading

wilsonwang371 commented May 19, 2021

gyuho commented May 19, 2021

wilsonwang371 commented May 21, 2021

gyuho commented May 21, 2021

gyuho left a comment •

edited

Loading

gyuho commented May 24, 2021

gyuho left a comment

ptabor commented May 25, 2021

server: set multiple concurrentReadTx instances share one txReadBuffer #12933

server: set multiple concurrentReadTx instances share one txReadBuffer #12933

Conversation

wilsonwang371 commented May 9, 2021

codecov-commenter commented May 10, 2021 • edited Loading

Codecov Report

wilsonwang371 commented May 10, 2021

wilsonwang371 commented May 10, 2021

wilsonwang371 commented May 11, 2021 • edited Loading

gyuho commented May 11, 2021

wilsonwang371 commented May 11, 2021 • edited Loading

wilsonwang371 commented May 11, 2021 • edited Loading

wilsonwang371 commented May 11, 2021

wilsonwang371 commented May 11, 2021

Jeffwan commented May 12, 2021

gyuho left a comment

Choose a reason for hiding this comment

wilsonwang371 commented May 13, 2021 • edited Loading

gyuho commented May 17, 2021

wilsonwang371 commented May 17, 2021 • edited Loading

wilsonwang371 commented May 18, 2021 • edited Loading

wilsonwang371 commented May 18, 2021

wilsonwang371 commented May 18, 2021

ptabor commented May 18, 2021

gyuho commented May 18, 2021

wilsonwang371 commented May 18, 2021

gyuho commented May 18, 2021

wilsonwang371 commented May 18, 2021

gyuho commented May 18, 2021

wilsonwang371 commented May 18, 2021 • edited Loading

wilsonwang371 commented May 19, 2021

gyuho commented May 19, 2021

wilsonwang371 commented May 21, 2021

gyuho commented May 21, 2021

gyuho left a comment • edited Loading

Choose a reason for hiding this comment

gyuho commented May 24, 2021

gyuho left a comment

Choose a reason for hiding this comment

ptabor commented May 25, 2021

codecov-commenter commented May 10, 2021 •

edited

Loading

wilsonwang371 commented May 11, 2021 •

edited

Loading

wilsonwang371 commented May 11, 2021 •

edited

Loading

wilsonwang371 commented May 11, 2021 •

edited

Loading

wilsonwang371 commented May 13, 2021 •

edited

Loading

wilsonwang371 commented May 17, 2021 •

edited

Loading

wilsonwang371 commented May 18, 2021 •

edited

Loading

wilsonwang371 commented May 18, 2021 •

edited

Loading

gyuho left a comment •

edited

Loading