Run all tests in the `nix-shell`; eliminate docker infrastructure #379

spencerkclark · 2023-08-28T13:59:51Z

This PR refactors the build infrastructure in this repo to eliminate the need for the Docker component. All development and testing is now done in the nix shell. This should be a quality of life improvement for anyone developing the fortran model, as it no longer requires maintaining checksums in two separate build environments.

In so doing it introduces the following changes:

New make rules are provided for compiling the model in different modes:
- build -- build executables in repro (our production mode) and debug mode.
- build_repro -- build only the repro mode executable.
- build_debug -- build only the debug mode executable.
Tests are run with each of the executables available in the local bin directory, and are tagged with the associated compile mode.
An option, check_layout_invariance, is provided to trigger regression tests be run with a 1x2 domain decomposition instead of a 1x1 domain decomposition to check invariance to the domain decomposition layout; this is used for the all the coarse-graining regression tests and replaces the previous test_run_reproduces_across_layouts test that would run in the docker image.
debug-mode and repro-mode simulations produce different answers, which is something we noticed in Bump base image to one with Ubuntu version 20.04 #364 when upgrading compiler versions as well, and so require different reference checksums.

In working on this PR, we ran the fortran model in debug mode in more contexts than we had previously, some of which turned up errors, which we currently work around by using pytest.skip (something we had implicitly already been doing before):

Working on this PR also brought my attention to the fact that pytest's tmpdir fixture does not automatically get cleaned up after each test; pytest versions older than 7.3.0 keep around directories from the last three runs of pytest, which fill up disk space quickly since running these tests requires creating 10's of run directories, each with their own initial conditions and input files (#380). For the time being I manually clean up these run directories after successful tests.

Resolves #340.

spencerkclark · 2023-08-29T14:10:41Z

FV3/gfsphysics/physics/samfdeepcnv.f

These changes are needed to fix debug-mode failures when running with the TKE-EDMF scheme active; they do not change answers. See discussion in #364 (comment) for more context.

spencerkclark · 2023-08-30T19:04:09Z

After this PR is reviewed I will update the required checks to:

Minimal fortran and wrapper tests in repro mode
Minimal fortran tests in debug mode

Makefile

tests/pytest/_regtest_outputs/test_regression.test_regression[debug-default.yml-False].out

brianhenn

Thanks @spencerkclark things worked well for me. Happy to approve if you like or can wait for another set of eyes.

README.md

tests/pytest/test_regression.py

brianhenn

Thanks @spencerkclark

This PR fixes errors in test skipping logic introduced in #379 (see [this CI run](https://app.circleci.com/pipelines/github/ai2cm/fv3gfs-fortran/2682/workflows/e71d0119-0c60-44c7-bf4f-d97f10ccbb0b/jobs/5541)). In that PR these particular tests were configured only to be attempted to be run upon merges to master, because they would be skipped anyway. This PR also now ensures that we at least attempt to run the emulation tests in debug mode in CI when the developer requests to, since it will exercise the skipping logic before merging to master (upon which all pure fortran tests are attempted to be run in both repro and debug mode). [This CI job](https://app.circleci.com/pipelines/github/ai2cm/fv3gfs-fortran/2688/workflows/0c3ca3a2-3b4d-46d6-8cca-a65e1b9a5a0f/jobs/5557) confirms that this PR worked as intended.

I noticed this when looking over this file earlier, and it would be good to correct for maximum test coverage. This was introduced in #379, so it has not been present for long.

spencerkclark marked this pull request as draft August 28, 2023 14:00

spencerkclark force-pushed the nix-debug-mode branch 3 times, most recently from 8c75df4 to ce95f2d Compare August 28, 2023 14:36

spencerkclark mentioned this pull request Aug 28, 2023

Model does not restart reproducibly in debug mode when initially starting from GFS initial conditions #381

Open

spencerkclark force-pushed the nix-debug-mode branch from 5d09131 to ea629f5 Compare August 28, 2023 20:38

spencerkclark commented Aug 29, 2023

View reviewed changes

spencerkclark force-pushed the nix-debug-mode branch 4 times, most recently from 3f048ce to cb4185c Compare August 30, 2023 16:52

spencerkclark marked this pull request as ready for review August 30, 2023 16:57

spencerkclark force-pushed the nix-debug-mode branch from cb4185c to 1e619a4 Compare August 30, 2023 17:37

spencerkclark added 6 commits August 30, 2023 17:56

Remove Docker infrastructure and related tests

3420dbb

Remove need for conftest.py and --native option

109a1b4

Add ability to run tests with debug mode executable

862f9ea

Restore tests of layout invariance

d6716d2

Update CircleCI tests to exercise debug mode code

b74d605

Update relevant README files

50de16b

spencerkclark force-pushed the nix-debug-mode branch from 1e619a4 to 50de16b Compare August 30, 2023 17:57

brianhenn reviewed Aug 31, 2023

View reviewed changes

Makefile Show resolved Hide resolved

brianhenn reviewed Aug 31, 2023

View reviewed changes

Makefile Show resolved Hide resolved

brianhenn reviewed Aug 31, 2023

View reviewed changes

tests/pytest/_regtest_outputs/test_regression.test_regression[debug-default.yml-False].out Show resolved Hide resolved

brianhenn reviewed Aug 31, 2023

View reviewed changes

README.md Show resolved Hide resolved

tests/pytest/test_regression.py Show resolved Hide resolved

Update README per review comments

8c2831b

spencerkclark force-pushed the nix-debug-mode branch from 0637993 to 8c2831b Compare August 31, 2023 20:36

brianhenn approved these changes Sep 7, 2023

View reviewed changes

spencerkclark merged commit adafc50 into master Sep 7, 2023

spencerkclark deleted the nix-debug-mode branch September 7, 2023 18:56

spencerkclark mentioned this pull request Sep 7, 2023

Fix test skipping logic in debug-mode emulation tests #382

Merged

This was referenced Sep 14, 2023

Bump base image to one with Ubuntu version 20.04 #364

Closed

Fix typo in test_regression.py #383

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Run all tests in the `nix-shell`; eliminate docker infrastructure #379

Run all tests in the `nix-shell`; eliminate docker infrastructure #379

spencerkclark commented Aug 28, 2023 •

edited

Loading

spencerkclark Aug 29, 2023

spencerkclark commented Aug 30, 2023

brianhenn left a comment

brianhenn left a comment

Run all tests in the nix-shell; eliminate docker infrastructure #379

Run all tests in the nix-shell; eliminate docker infrastructure #379

Conversation

spencerkclark commented Aug 28, 2023 • edited Loading

spencerkclark Aug 29, 2023

Choose a reason for hiding this comment

spencerkclark commented Aug 30, 2023

brianhenn left a comment

Choose a reason for hiding this comment

brianhenn left a comment

Choose a reason for hiding this comment

Run all tests in the `nix-shell`; eliminate docker infrastructure #379

Run all tests in the `nix-shell`; eliminate docker infrastructure #379

spencerkclark commented Aug 28, 2023 •

edited

Loading