gix-diff make_diff_repo test fixture archive is always regenerated #1440

EliahKagan · 2024-07-04T04:09:52Z

Current behavior 😯

Overview

The generated archive gix-diff/tests/fixtures/generated-archives/make_diff_repo.tar is always regenerated on non-Windows platforms--or at least on Ubuntu and macOS--when any of the following tests in gix-diff/tests/tree/mod.rs is run, as confirmed by running them individually and then running git status:

many_different_states
maximal_difference
interesting_rename
interesting_rename_2

Those tests use the changes::to_obtain_tree::db function, which uses that fixture. But they are not the only tests in the file that call that function. These tests do not produce a change to any tracked file, and therefore either do not regenerate gix-diff/tests/fixtures/generated-archives/make_diff_repo.tar, or regenerate it with the exact same contents:

many_different_states_nested
maximal_difference_nested

I have verified that this happens, and that the above description holds, both on Ubuntu 22.04 LTS and on macOS.

History

More broadly, this is a long-standing situation, though most past information is less specific (I did not previously investigate which tests produced the problem). As described in a4bb7b9 (#1435), it precedes 55c635a (#1425) and dcab79a (#1415). My attempt to fix it in a4bb7b9 (#1435) was also unsuccessful both for Ubuntu and macOS (that commit updated other generated archives, but not this one, even though it tried to).

After that, you tried to update it in e6c8cd6, as described in #1435 (review). When I found this was also not successful, I decided to investigate further, and this issue is the outcome of that research.

The cause

I think I know the cause, but it seems odd that this seems not to have happened anywhere else in the project, and also I am not sure what the best approach to fixing it is. The four non-"nested" tests cause the fixture script to be called with different command-line arguments from the two "nested" tests. I believe that, whichever happens to run the fixture last--which is in general nondeterministic since tests are usually run in parallel--saves its archive.

To show the distinction in the arguments that are used, first consider the implementation of the db function:

https://github.com/Byron/gitoxide/blob/e2e8cf9206058365be50ea899cc4dcc22e923bef/gix-diff/tests/tree/mod.rs#L17-L24

That passes command-line arguments represented by its args argument to the fixture script. The four non-"nested" test cases call db with an args argument of None:

https://github.com/Byron/gitoxide/blob/e2e8cf9206058365be50ea899cc4dcc22e923bef/gix-diff/tests/tree/mod.rs#L156-L157

https://github.com/Byron/gitoxide/blob/e2e8cf9206058365be50ea899cc4dcc22e923bef/gix-diff/tests/tree/mod.rs#L538-L539

https://github.com/Byron/gitoxide/blob/e2e8cf9206058365be50ea899cc4dcc22e923bef/gix-diff/tests/tree/mod.rs#L641-L642

https://github.com/Byron/gitoxide/blob/e2e8cf9206058365be50ea899cc4dcc22e923bef/gix-diff/tests/tree/mod.rs#L674-L675

In contrast, the two "nested" test cases call db with an args argument of ["a"].iter().copied():

https://github.com/Byron/gitoxide/blob/e2e8cf9206058365be50ea899cc4dcc22e923bef/gix-diff/tests/tree/mod.rs#L487-L488

https://github.com/Byron/gitoxide/blob/e2e8cf9206058365be50ea899cc4dcc22e923bef/gix-diff/tests/tree/mod.rs#L588-L589

I suspect that this is a race condition and that it may cause test failures on occasion. This suspicion is based on the general situation, taken together with my recollection of having observed occasional failures of tests in this module. But this seems rare, and unfortunately I did not record the details of those failures. It is possible that they are not related to this issue; it is also possible that I am misremembering and that the failures I observed were from tests in a different module.

Expected behavior 🤔

The committed archives should be reusable, such that running any combination of tests without setting GIX_TEST_IGNORE_ARCHIVES will use the existing archives unless a change has been made to one of the scripts that generates them since the last time regenerated archives were committed.

I think this could be fixed by modifying the script so that the test repositories whose object databases are being used are separate, or splitting it into multiple scripts, or adjusting how script and archive identities are computed and compared so that multiple archives are automatically kept in this situation, or adding the generated archive to .gitignore if saving a reusable archive is infeasible. However, the last option, in addition to possibly forgoing the performance benefit of using pre-generated archives, may not fully fix this if a race condition is part of its effect.

But I also think I am probably missing some insights into the problem and how it should be fixed, because this doesn't seem like the only area in the test suite where test fixture scripts are called in different tests with different combinations of arguments.

Git behavior

Not applicable. I don't think this corresponds closely enough to the way Git's test suite works to make a comparison.

Steps to reproduce 🕹

I don't think this depends on how the individual tests are run. To run the individual tests, I ran them in VS Code (on both the Ubuntu and macOS test systems--I don't have desktop access to macOS, but I was able to use vscode-server and SSH in). This uses cargo test behind the scenes.

I can report the specific commands that the rust-analyzer extension uses to run them of needed, and/or find an alternative approach to confirm the results. Please let me know if more information would be helpful.

What I think may be more useful to report are the specific commands I ran manually to clear things out between running the individual test cases:

git restore .
gix clean -xd -m '*generated*' -e

Although I often run gix clean -xde to get a fully clean environment, using the above command is the same as you used in #1435 (review) and also is a far better choice for investigating this kind of problem, because it avoids unnecessarily forcing recompliation. Although building again is sometimes useful in testing, such as in #1361, it should not be needed here.

The text was updated successfully, but these errors were encountered:

Byron · 2024-07-04T07:10:38Z

Thanks for the investigation!

I think the cause suggested here is perfectly on point: it creates only one archive despite arguments being used which alter the contents of the archive. This paired with the non-determinism of which test runs first and we have a race and the occasionally changing content of the tar archive.
All workarounds described here would be applicable, but I'd hope archive generation can be fixed instead, maybe by appending the arguments to the name.

Interestingly, the arguments are part of the hash, which is used when storing the data on disk, so there is no race nor a clash. However, since the unique hash is used inside of the archive, there is a clash.

Let me try to fix this real quick.

… with arguments. (#1440) Previously there was a race condition that would cause archives to be created either with or without arguments, depending on which test was run first. After its creation, they wouldn't be looked at again as on disk they would already be available in their usable form.

EliahKagan · 2024-07-04T22:44:02Z

Thanks! I've verified that this lets Ubuntu, macOS, and Windows all use, and not have to regenerate, the archive. However, it also causes a new failure on Windows when GIX_TEST_IGNORE_ARCHIVES=1.

It turns out this is due to a problem in the fixture script on Windows, which causes it to produce a repository that does not meet the expectations in the test case. This PR is thus overall an improvement on all platforms including--really, especially--Windows, because the pre-generated archive is correct for the test, which now tests what it is meant to test.

I've opened #1442 about the new failure.

… with arguments. (GitoxideLabs#1440) Previously there was a race condition that would cause archives to be created either with or without arguments, depending on which test was run first. After its creation, they wouldn't be looked at again as on disk they would already be available in their usable form.

EliahKagan mentioned this issue Jul 4, 2024

Use << rather than <<- heredoc operator #1435

Merged

Byron added the acknowledged an issue is accepted as shortcoming to be fixed label Jul 4, 2024

Byron self-assigned this Jul 4, 2024

Byron mentioned this issue Jul 4, 2024

fix test-lib archive creation #1441

Merged

Byron closed this as completed in #1441 Jul 4, 2024

EliahKagan mentioned this issue Jul 4, 2024

many_different_states fails on Windows with GIX_TEST_IGNORE_ARCHIVES=1 #1442

Closed

EliahKagan mentioned this issue Sep 11, 2024

Have CI check if we need to commit regenerated archives #1590

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gix-diff make_diff_repo test fixture archive is always regenerated #1440

gix-diff make_diff_repo test fixture archive is always regenerated #1440

EliahKagan commented Jul 4, 2024 •

edited

Loading

Byron commented Jul 4, 2024

EliahKagan commented Jul 4, 2024 •

edited

Loading

gix-diff make_diff_repo test fixture archive is always regenerated #1440

gix-diff make_diff_repo test fixture archive is always regenerated #1440

Comments

EliahKagan commented Jul 4, 2024 • edited Loading

Current behavior 😯

Overview

History

The cause

Expected behavior 🤔

Git behavior

Steps to reproduce 🕹

Byron commented Jul 4, 2024

EliahKagan commented Jul 4, 2024 • edited Loading

EliahKagan commented Jul 4, 2024 •

edited

Loading

EliahKagan commented Jul 4, 2024 •

edited

Loading