Speed up uploads by uploading chunks in parallel #263

daviddavis · 2021-06-03T16:44:15Z

Right now chunk uploads happen sequentially but we could speed them up running them in parallel.

lubosmj · 2024-07-10T11:58:55Z

There was another request from the services team to make the uploading happen in parallel. I am a bit sceptical about this idea because we usually saturate the uplink with uploading in serial. However, there might be data centers where this saturation cannot be reached because of reverse-proxy configurations disallowing a larger chunk size.

I am going to introduce an option to the upload/artifact command that enables parallelism (e.g., --parallel).

daviddavis · 2024-07-10T12:01:01Z

You may want to make the number of parallel threads or processes configurable somehow as I imagine different Pulp instances could support different levels of throughput.

mdellweg · 2024-07-10T12:15:41Z

Doing uploads in parallel requires adding some notion of parallel execution into the cli codebase (that has been unprecedented). I would like to see some clear statements with numbers before gauging the need for adding this kind of complexity to the project. I.e. I cannot say, whether pulp-glue is thread safe. Do we want to rewrite it in async python, using aiohttp instead of requests?

lubosmj · 2024-07-11T12:39:05Z

I made a couple of experiments locally (using oci_env) and here are the results. It appears that uploading chunks in parallel improved the performance by 50%. I did not spend much time writing quality code or using any optimization techniques besides splitting an uploaded file into 4 chunks and then uploading those chunks in sub-chunks in parallel.

Test 1: With creating an artifact (db reset between runs, uploading one commit in a tarball, 717.7MB, 10MB chunks)

SERIAL (current implementation):
(venv) [lmjachky@lmjachky-thinkpadt14gen4 services]$ time pulp ostree repository import-all --name fedora-iot --file a9598e5a-1f0c-48b8-abda-14915a4d051a-commit.tar --repository_name repo --chunk-size 10MB
........................................................................Upload complete.
Creating artifact.
Started background task /pulp/api/v3/tasks/0190a1b6-19eb-7fe1-9b36-c2faf44e516e/
.....Done.

real	0m55.106s
user	0m13.641s

PARALLEL (4 processes for chunked uploading):
(venv) [lmjachky@lmjachky-thinkpadt14gen4 services]$ time pulp ostree repository import-all-parallel --name fedora-iot --file a9598e5a-1f0c-48b8-abda-14915a4d051a-commit.tar --repository_name repo --chunk-size 10MB
.....................................................................Upload complete.
.Upload complete.
.Upload complete.
.Upload complete.
Creating artifact.
Started background task /pulp/api/v3/tasks/0190a1b7-caae-759b-b9cc-81da6ca042b8/
.....Done.

real	0m30.569s
user	0m18.488s

Test 2: Without creating an artifact (db reset between runs, uploading one commit in a tarball, 717.7MB, 10MB chunks)

SERIAL (current implementation):
(venv) [lmjachky@lmjachky-thinkpadt14gen4 services]$ time pulp ostree repository import-all --name fedora-iot --file a9598e5a-1f0c-48b8-abda-14915a4d051a-commit.tar --repository_name repo --chunk-size 10MB
........................................................................Upload complete.

real	0m49.346s
user	0m14.368s

PARALLEL (4 processes for chunked uploading):
(venv) [lmjachky@lmjachky-thinkpadt14gen4 services]$ time pulp ostree repository import-all-parallel --name fedora-iot --file a9598e5a-1f0c-48b8-abda-14915a4d051a-commit.tar --repository_name repo --chunk-size 10MB
.....................................................................Upload complete.
.Upload complete.
.Upload complete.
.Upload complete.

real	0m22.449s
user	0m16.966s

Changes made to pulp-glue: https://gist.github.com/lubosmj/1d736226c1816fb019430e7fb78cdd55. Changes made to pulp-cli-ostree: https://gist.github.com/lubosmj/3bc14338713ab9a55343359ff49829b1. I used processes (https://pypi.org/project/multiprocess/ for easier function pickling) to perform the action.

lubosmj · 2024-07-11T15:45:13Z

TCP congestion control is designed to manage the flow of data to prevent network congestion and ensure fairness among multiple connections. However, this mechanism primarily operates on a per-connection basis. This is what we are trying to bypass by uploading in parallel, right? Multiple TCP connections from a single host can then easily saturate the uplink.

lubosmj · 2024-07-11T18:29:13Z

The following experiment supports that theory. When uploading commits to staging, I am getting amazing results. Almost 4-times better performance, seeing the speed of uploads and used uplink.

Test 1: Serial uploading (1MB chunk, 1 TCP connection, 1.3GB in total)

(venv) [lmjachky@lmjachky-thinkpadt14gen4 services]$ time pulp ostree repository import-all --name rhivos-test-non-parallel --file "auto-osbuild-aws-autosd9-cki-ostree-x86_64-1368897263.017a82ff.repo.tar" --repository_name "auto-osbuild-aws-autosd9-cki-ostree-x86_64-1368897263.017a82ff.repo" --chunk-size 1MB
Uploading file auto-osbuild-aws-autosd9-cki-ostree-x86_64-1368897263.017a82ff.repo.tar
........................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................Upload complete.
Creating artifact.
Started background task /api/pulp/default/api/v3/tasks/0190a2cd-235c-7a9f-adf2-10aa8529519d/
..........................................................................................................................................................................................Done.
Started background task /api/pulp/default/api/v3/tasks/0190a2d1-1cd9-76aa-8c75-07f7ef5f418a/
...............................................................................................................................................................................................................................................................................................................................................................................................................................................................................................Done.

real	42m34.483s
user	0m41.393s

26 minutes uploading chunks (pulpcore), 5 minutes assembling chunks (pulpcore), 11 minutes running import-commit (pulp-ostree)
average upload speed: 2MB/s

Test 2: Parallel uploading (1MB chunk, 4 parallel processes, 4 TCP connections, 1.4GB in total)

(venv) [lmjachky@lmjachky-thinkpadt14gen4 services]$ time pulp ostree repository import-all --name rhivos-test-parallel --file "auto-osbuild-qemu-autosd9-qa-ostree-x86_64-1368897263.017a82ff.repo.tar" --repository_name "auto-osbuild-qemu-autosd9-qa-ostree-x86_64-1368897263.017a82ff.repo" --chunk-size 1MB --parallel
Uploading file auto-osbuild-qemu-autosd9-qa-ostree-x86_64-1368897263.017a82ff.repo.tar
.....................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................Upload complete.
........Upload complete.
.......Upload complete.
....Upload complete.
Creating artifact.
Started background task /api/pulp/default/api/v3/tasks/0190a2f1-332f-7c86-8f4c-735a63265275/
..............................................................................................................................................................................Done.
Started background task /api/pulp/default/api/v3/tasks/0190a2f4-fb30-7bb8-81e9-456a7ebc16f2/
.......................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................
ERROR, SOMEONE RESTARTED GATEWAY!!! BUT WE DO NOT CARE!
requests.exceptions.HTTPError: 503 Server Error: Service Unavailable for url: https://XXXXXXXX.com/api/pulp/default/api/v3/tasks/0190a2f4-fb30-7bb8-81e9-456a7ebc16f2/

real	26m27.961s
user	0m43.025s

7 minutes uploading chunks (pulpcore), 4 minutes assembling chunks (pulpcore), 15+ minutes running import-commit (pulp-ostree)
average upload speed: 8MB/s

Tested with the following changes applied on the respective main branches: lubosmj@8d57381, lubosmj/pulp-cli-ostree@0c4f3ae. OSTree commits were taken from https://autosd.sig.centos.org/AutoSD-9/nightly/ostree-repos/.

daviddavis changed the title ~~Speed up uploads by uploading chunk in parallel~~ Speed up uploads by uploading chunks in parallel Jun 3, 2021

mdellweg added feature request New feature request (template-set) triaged labels Jun 7, 2021

gerrod3 added a commit to gerrod3/pulp-cli that referenced this issue Aug 18, 2021

Upload chunks in parallel

be5f008

fixes: pulp#263

gerrod3 self-assigned this Aug 19, 2021

pulpbot mentioned this issue Jan 17, 2022

file content upload performance needs improvement -- currently about 5x slower than rsync pulp/pulpcore#2007

Open

gerrod3 removed their assignment Aug 31, 2022

lubosmj self-assigned this Jul 10, 2024

lubosmj removed their assignment Jul 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speed up uploads by uploading chunks in parallel #263

Speed up uploads by uploading chunks in parallel #263

daviddavis commented Jun 3, 2021

lubosmj commented Jul 10, 2024 •

edited

Loading

daviddavis commented Jul 10, 2024

mdellweg commented Jul 10, 2024

lubosmj commented Jul 11, 2024

lubosmj commented Jul 11, 2024 •

edited

Loading

lubosmj commented Jul 11, 2024 •

edited

Loading

Speed up uploads by uploading chunks in parallel #263

Speed up uploads by uploading chunks in parallel #263

Comments

daviddavis commented Jun 3, 2021

lubosmj commented Jul 10, 2024 • edited Loading

daviddavis commented Jul 10, 2024

mdellweg commented Jul 10, 2024

lubosmj commented Jul 11, 2024

Test 1: With creating an artifact (db reset between runs, uploading one commit in a tarball, 717.7MB, 10MB chunks)

Test 2: Without creating an artifact (db reset between runs, uploading one commit in a tarball, 717.7MB, 10MB chunks)

lubosmj commented Jul 11, 2024 • edited Loading

lubosmj commented Jul 11, 2024 • edited Loading

Test 1: Serial uploading (1MB chunk, 1 TCP connection, 1.3GB in total)

Test 2: Parallel uploading (1MB chunk, 4 parallel processes, 4 TCP connections, 1.4GB in total)

lubosmj commented Jul 10, 2024 •

edited

Loading

lubosmj commented Jul 11, 2024 •

edited

Loading

lubosmj commented Jul 11, 2024 •

edited

Loading