Stablize cargo test #277

yihuaf · 2021-09-06T22:46:21Z

The root cause of the flakiness observed in the hook, clean up file descriptors, and channel tests are the result of running the tests in parallel in the same process by cargo test. Many of the operations have side effects on the underlying OS and the process, such as closing down fds. These tests also depends on the underlying OS state such as file descriptor states. The best we can do is to mark these tests as serial and properly clean up any left over resources such as opened fds before the test ends. If the underlying OS decided to generate errors for some of the IO calls we are doing, then there is not much we can do. Not sure if retry is a valid solution here and retry generally will make the code much more complex.

Note: runc actually doesn't have unit tests on most of these operations. The other solution is to fully depend on the integration tests to test out these code path. It may be the reasonable thing to do if these unit tests prove to be hard to get right.

Note2: Currently, I ran cargo test for 100 iterations and no failed tests. There may still be flakiness, but it should be extremely rare.

codecov-commenter · 2021-09-06T22:48:50Z

Codecov Report

Merging #277 (4c5d2d6) into main (ef9a92a) will increase coverage by 0.01%.
The diff coverage is 53.84%.

@@            Coverage Diff             @@
##             main     #277      +/-   ##
==========================================
+ Coverage   69.57%   69.59%   +0.01%     
==========================================
  Files          46       46              
  Lines        5710     5651      -59     
==========================================
- Hits         3973     3933      -40     
+ Misses       1737     1718      -19

channel test parent process should call waitpid in the end. Parent should be able to wait before child process completly exits. For our purpose, rust test runner don't like multi processes. Serial is enough to work around this issue.

This reverts commit cda20ed.

utam0k · 2021-09-07T01:11:04Z

Thank you for your investigation. Can I ask you to write a clear description of the results of this investigation in the section of the test that we are using serial other tests?

The root cause of the flakiness observed in the hook, clean up file descriptors, and channel tests are the result of running the tests in parallel in the same process by cargo test. Many of the operations have side effects on the underlying OS and the process, such as closing down fds. These tests also depends on the underlying OS state such as file descriptor states. The best we can do is to mark these tests as serial and properly clean up any left over resources such as opened fds before the test ends. If the underlying OS decided to generate errors for some of the IO calls we are doing, then there is not much we can do. Not sure if retry is a valid solution here and retry generally will make the code much more complex.

yihuaf · 2021-09-07T04:32:54Z

Thank you for your investigation. Can I ask you to write a clear description of the results of this investigation in the section of the test that we are using serial other tests?

The root cause of the flakiness observed in the hook, clean up file descriptors, and channel tests are the result of running the tests in parallel in the same process by cargo test. Many of the operations have side effects on the underlying OS and the process, such as closing down fds. These tests also depends on the underlying OS state such as file descriptor states. The best we can do is to mark these tests as serial and properly clean up any left over resources such as opened fds before the test ends. If the underlying OS decided to generate errors for some of the IO calls we are doing, then there is not much we can do. Not sure if retry is a valid solution here and retry generally will make the code much more complex.

Done. I left the tty test alone because it was the original user of serial tests I think and is not part of this change.

utam0k

thanks

yihuaf added 8 commits September 7, 2021 01:04

restore some of the changes

abcd6ae

channel test parent process should call waitpid in the end. Parent should be able to wait before child process completly exits. For our purpose, rust test runner don't like multi processes. Serial is enough to work around this issue.

Revert "restore some of the changes"

eb6b4f4

This reverts commit cda20ed.

hook test no longer prints out printenv to mess up the test ooutput

3d00ac8

hook command now ignores BrokenPipe

9aaeb4f

Adds a simple script to stress test cargo test

ca5a21f

we want to know what unknown message is

fb7a8ea

Make the fd related tests serial

b821ab4

Mark hook related test serial

5b188ee

yihuaf force-pushed the yihuaf/test branch from 5e07462 to 5b188ee Compare September 6, 2021 23:05

fix fmt

08f33bf

yihuaf requested review from Furisto and utam0k September 6, 2021 23:14

Add comments to explain the use of serial

4c5d2d6

utam0k approved these changes Sep 7, 2021

View reviewed changes

utam0k merged commit f026d7e into youki-dev:main Sep 7, 2021

yihuaf deleted the yihuaf/test branch September 7, 2021 05:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Stablize cargo test #277

Stablize cargo test #277

yihuaf commented Sep 6, 2021 •

edited

Loading

codecov-commenter commented Sep 6, 2021 •

edited

Loading

utam0k commented Sep 7, 2021 •

edited

Loading

yihuaf commented Sep 7, 2021 •

edited

Loading

utam0k left a comment

Stablize cargo test #277

Stablize cargo test #277

Conversation

yihuaf commented Sep 6, 2021 • edited Loading

codecov-commenter commented Sep 6, 2021 • edited Loading

Codecov Report

utam0k commented Sep 7, 2021 • edited Loading

yihuaf commented Sep 7, 2021 • edited Loading

utam0k left a comment

Choose a reason for hiding this comment

yihuaf commented Sep 6, 2021 •

edited

Loading

codecov-commenter commented Sep 6, 2021 •

edited

Loading

utam0k commented Sep 7, 2021 •

edited

Loading

yihuaf commented Sep 7, 2021 •

edited

Loading