Speed up parallelized CI #378

eb8680 · 2023-11-14T15:13:18Z

This PR adds MKL_NUM_THREADS=1 to the unit test run in our CI workflow. This environment variable restricts the maximum number of CPU threads that an Intel MKL implementation of a tensor operation can use. When running many small PyTorch computations in parallel processes on an Intel CPU, operation-level parallelism within kernels can interfere with process-level parallelism, resulting in significant slowdowns compared to executing operation kernels within the process that invoked them. This tweak seems to have reduced our unit test CI times by ~25%.

mkl_num_threads=1

e5bb9ed

eb8680 added testing status:WIP Work-in-progress not yet ready for review labels Nov 14, 2023

eb8680 self-assigned this Nov 14, 2023

eb8680 changed the title ~~Investigate (lack of) speedup in parallelized CI~~ Speed up parallelized CI Nov 14, 2023

eb8680 requested a review from SamWitty November 14, 2023 17:28

eb8680 added status:awaiting review Awaiting response from reviewer and removed status:WIP Work-in-progress not yet ready for review labels Nov 14, 2023

SamWitty approved these changes Nov 14, 2023

View reviewed changes

eb8680 merged commit d6a3521 into master Nov 14, 2023
4 checks passed

eb8680 deleted the eb-fix-ci-parallel branch November 14, 2023 19:30

rfl-urbaniak pushed a commit that referenced this pull request Nov 29, 2023

Speed up parallelized CI (#378)

f4d72c0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speed up parallelized CI #378

Speed up parallelized CI #378

eb8680 commented Nov 14, 2023 •

edited

Loading

Speed up parallelized CI #378

Speed up parallelized CI #378

Conversation

eb8680 commented Nov 14, 2023 • edited Loading

eb8680 commented Nov 14, 2023 •

edited

Loading