gh-82054: allow test runner to split test_asyncio #103859

zitterbewegung · 2023-04-25T23:09:08Z

Summary:

This runs test_asyncio sub-tests in parallel using sharding by cinder. These two tests are typically the long-poles in runs because they are modules with a lot of further sub-tests run serially. By breaking out the sub-tests as independent modules we can run a lot more in parallel.

After porting we can see the direct impact is extremely large (15% increase in performance):

Without this change:
- Running make test is 5 min 26 sec
With this change:
- Running make test takes 3 min and 45 seconds

The drawbacks are that this implementation is hacky and due to the sorting of the tests it obscures when the asyncio tests occur and involves changing CPython test infrastructure but, the time saved it is worth it . It's not a complicated change and I think the win in productivity with the change above is significant.

Issue: unittest: execute tests in parallel #82054

carljm

Looks good! Just a couple comments.

Lib/test/libregrtest/runtest.py

Misc/NEWS.d/next/Tests/2023-04-25-23-56-04.gh-issue-gh-82054.3GgB66.rst

bedevere-bot · 2023-04-26T00:06:45Z

A Python core developer has requested some changes be made to your pull request before we can consider merging it. If you could please address their requests along with any other requests in other reviews from core developers that would be appreciated.

Once you have made the requested changes, please leave a comment on this pull request containing the phrase I have made the requested changes; please review again. I will then notify any core developers who have left a review that you're ready for them to take another look at this pull request.

carljm · 2023-04-26T16:16:48Z

@zitterbewegung in order to make the case for landing this, it would be useful to update the PR description with some current data. Copying the description from the Cinder diff isn't really useful, since it's from a much older version. What would be helpful is to run make test with and without this change, and report any improvement in overall runtime.

It also may be that test_asyncio is no longer the longest pole, and we may need to apply the same treatment to some other modules (test_multiprocessing?) in order to see a significant benefit in overall runtime of make test.

zitterbewegung · 2023-04-26T17:58:43Z

@carljm I can see an overall benefit of 100 seconds from running cpython's baseline. I have attached both runs.
To give explicit timings before this change running make test is 5 min 26 sec and with the change it goes to 3 min and 46 seconds (100 seconds saved). Adding multiprocessing gives us a 5 second improvement.

This was executed on a AMD Ryzen Threadripper 3970X 32-Core Processor with 128GB of ram.

Patched_run_of_tests.txt
original_time_cpython.txt

carljm · 2023-04-26T22:36:38Z

@zitterbewegung Awesome, those are great results! Can you directly update the PR description/summary to eliminate the text copied from the Cinder diff and instead just summarize these results?

zitterbewegung · 2023-04-26T22:52:46Z

@zitterbewegung Awesome, those are great results! Can you directly update the PR description/summary to eliminate the text copied from the Cinder diff and instead just summarize these results?

Updated summary with new results.

carljm · 2023-04-26T23:51:31Z

@zitterbewegung Ok, one last thing I see is that make patchcheck is failing on the Azure Pipelines CI job, due to something it doesn't like about the whitespace in runtests.py. Pretty nitpicky, but we do want to keep the CI green; can you run make patchcheck locally and try to adjust the diff such that patchcheck is happy?

zitterbewegung · 2023-04-27T11:57:48Z

@zitterbewegung Awesome, those are great results! Can you directly update the PR description/summary to eliminate the text copied from the Cinder diff and instead just summarize these results?

Done.

zitterbewegung · 2023-04-27T15:01:11Z

@carljm I accidentally closed this PR while I was adding test_multiprocessing which saved 6 more seconds the new PR is #103927

zitterbewegung · 2023-04-27T15:26:38Z

@zitterbewegung Ok, one last thing I see is that make patchcheck is failing on the Azure Pipelines CI job, due to something it doesn't like about the whitespace in runtests.py. Pretty nitpicky, but we do want to keep the CI green; can you run make patchcheck locally and try to adjust the diff such that patchcheck is happy?

I ran make patch check after I did the changes

bedevere-bot mentioned this pull request Apr 25, 2023

unittest: execute tests in parallel #82054

Closed

bedevere-bot added the awaiting review label Apr 25, 2023

zitterbewegung marked this pull request as draft April 25, 2023 23:18

zitterbewegung marked this pull request as ready for review April 25, 2023 23:24

zitterbewegung changed the title ~~gh-82054: Porting execution of tests in parallel by sharding (note this only is for asyncio and compiler)~~ gh-82054: Porting parallel test execution from cinder by sharding (note this only is for asyncio and compiler) Apr 25, 2023

carljm added the skip news label Apr 26, 2023

carljm requested changes Apr 26, 2023

View reviewed changes

Lib/test/libregrtest/runtest.py Outdated Show resolved Hide resolved

Misc/NEWS.d/next/Tests/2023-04-25-23-56-04.gh-issue-gh-82054.3GgB66.rst Outdated Show resolved Hide resolved

bedevere-bot added awaiting changes and removed awaiting review labels Apr 26, 2023

zitterbewegung changed the title ~~gh-82054: Porting parallel test execution from cinder by sharding (note this only is for asyncio and compiler)~~ gh-82054: Porting parallel test execution from cinder by sharding (note this only is for asyncio) Apr 26, 2023

carljm changed the title ~~gh-82054: Porting parallel test execution from cinder by sharding (note this only is for asyncio)~~ gh-82054: allow test runner to split test_asyncio for better parallel execution Apr 26, 2023

zitterbewegung closed this Apr 27, 2023

zitterbewegung force-pushed the main branch from 7848c65 to b701dce Compare April 27, 2023 14:32

zitterbewegung changed the title ~~gh-82054: allow test runner to split test_asyncio for better parallel execution~~ gh-82054: allow test runner to split test_asyncio and test_multiprocessing for better parallel execution Apr 27, 2023

zitterbewegung changed the title ~~gh-82054: allow test runner to split test_asyncio and test_multiprocessing for better parallel execution~~ gh-82054: allow test runner to split test_asyncio Apr 27, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gh-82054: allow test runner to split test_asyncio #103859

gh-82054: allow test runner to split test_asyncio #103859

zitterbewegung commented Apr 25, 2023 •

edited

Loading

carljm left a comment

bedevere-bot commented Apr 26, 2023

carljm commented Apr 26, 2023 •

edited

Loading

zitterbewegung commented Apr 26, 2023 •

edited

Loading

carljm commented Apr 26, 2023

zitterbewegung commented Apr 26, 2023

carljm commented Apr 26, 2023

zitterbewegung commented Apr 27, 2023

zitterbewegung commented Apr 27, 2023 •

edited

Loading

zitterbewegung commented Apr 27, 2023 •

edited

Loading

gh-82054: allow test runner to split test_asyncio #103859

gh-82054: allow test runner to split test_asyncio #103859

Conversation

zitterbewegung commented Apr 25, 2023 • edited Loading

Summary:

carljm left a comment

Choose a reason for hiding this comment

bedevere-bot commented Apr 26, 2023

carljm commented Apr 26, 2023 • edited Loading

zitterbewegung commented Apr 26, 2023 • edited Loading

carljm commented Apr 26, 2023

zitterbewegung commented Apr 26, 2023

carljm commented Apr 26, 2023

zitterbewegung commented Apr 27, 2023

zitterbewegung commented Apr 27, 2023 • edited Loading

zitterbewegung commented Apr 27, 2023 • edited Loading

zitterbewegung commented Apr 25, 2023 •

edited

Loading

carljm commented Apr 26, 2023 •

edited

Loading

zitterbewegung commented Apr 26, 2023 •

edited

Loading

zitterbewegung commented Apr 27, 2023 •

edited

Loading

zitterbewegung commented Apr 27, 2023 •

edited

Loading