fix: make bootstrap_impl=script compute correct directory when RUNFILES_MANIFEST_FILE set #2177

scasagrande · 2024-09-03T16:19:24Z

The script-based bootstrap wasn't computing the correct runfiles directory when
RUNFILES_MANIFEST_FILE was set. The path it computed stripped off the manifest
file name, but didn't re-add the .runfiles suffix to point to the runfiles
directory.

To fix, just re-append the .runfiles suffix after it removes the manifest file
name portion of the path.

Reproducing this is a bit tricky and it's difficult to reproduce the necessary build
flags in a test; all of the following must be met:

--enable_runfiles=false, but this cannot be set by transitions, only via command line
--build_runfile_manifests=true (this can be set in a transition, but see below)
Due to RUNFILES_MANIFEST_FILE is unset inside the sandbox bazel#7994, even if a manifest is created,
the RUNFILES_MANIFEST_FILE env var won't be set unless the test strategy is local
(i.e. not sandboxed, which is the default).

To work around those issues, the test just recreates the necessary envvar state and
invokes the binary. The underlying files may not exist, but that's OK for the code
paths were testing.

Fixes #2186

aignas · 2024-09-04T01:23:05Z

You can add a test in https://github.com/bazelbuild/rules_python/tree/main/tests/base_rules

scasagrande · 2024-09-04T05:15:33Z

Okay I've been looking through the existing tests and I must admit I am a little lost here.

I took a look at tests/bootstrap_impls and I see the we already have a test for a target having a data dependency on a py_binary target: https://github.com/bazelbuild/rules_python/blob/main/tests/bootstrap_impls/BUILD.bazel#L47

However my change is handling the case where RUNFILES_DIR is not defined. Instead the bootstrap script determines the runfiles dir based on RUNFILES_MANIFEST_FILE, which should be a sibling to the .runfiles folder.

Can you help me out with what you'd want to see for testing this?

scasagrande · 2024-09-04T05:34:50Z

Maybe my first step is just to make a separate minimal reproduction of the issue, and then we can boil that down into a test.

I'll see what I can do there, but I also just fixed my original issue another way. My scenario is as follows:

bazel run rust_binary target -> dep on rust_library -> data dep on py_binary

py_binary is called essentially like this

    let rf = runfiles::Runfiles::create()?;

    let path = runfiles::rlocation!(
        rf,
        [
            "_main",
            "path",
            "to",
            "target"
        ]
        .iter()
        .collect::<PathBuf>()
    );

    let output = std::process::Command::new(path)
        .env_remove("PYTHONPATH")
        .output()?;

Which results in the error. If I use my patch, then it works. Or if I modify my process call as follows, then it also works:

    let output = std::process::Command::new(path)
        .env_remove("PYTHONPATH")
        .env_remove("RUNFILES_MANIFEST_FILE")
        .output()?;

aignas · 2024-09-04T13:24:29Z

I think the second code snippet is the correct fix and no changes are needed to rules_python.

scasagrande · 2024-09-04T13:31:15Z

okay, no worries :)

although perhaps some tests are needed in this area anyways. If I can make this change and not impact any existing tests it makes me believe that there is a coverage gap

rickeylev · 2024-09-04T20:20:35Z

My initial read here is that there is a legit bug. Does this only happen with bootstrap=script and not bootstrap=system_python, or does it happen with both? The part that catches my eye is that there's an absolute local file system path to something in runfiles, but it doesn't have the binary.runfiles path component -- that seems clearly wrong.

For binaries-nested-in-binaries its hard to tell whether the problem is on the caller or callee side. In general you shouldn't have to modify the environment when calling the binary in order for it to work.

The expectation is, given e.g. a test with a binary in data, the test can invoke the binary and have it Just Work for both bazel test :outer and bazel build :outer; bazel-bin/outer. (I use a test--data-->bin as an example, any executable--data-->executable should behave similarly)

Under the hood, it's assumed that the two binary's runfile trees are merged. Hence the inner binary will find the runfiles manifest created by the outer binary, which is OK, because that manifest is the union of both binaries. Similarly, the inner binary is going to find the outer binary's runfiles directory (e.g. bazel-bin/outer.runfiles) and identify that as the runfiles root to use (because there is only one runfiles root).

scasagrande · 2024-09-04T20:33:02Z

Does this only happen with bootstrap=script and not bootstrap=system_python, or does it happen with both?

only with script

For binaries-nested-in-binaries its hard to tell whether the problem is on the caller or callee side. In general you shouldn't have to modify the environment when calling the binary in order for it to work.

agreed all around.

The expectation is, given e.g. a test with a binary in data, the test can invoke the binary and have it Just Work

agreed

aignas · 2024-09-05T14:02:28Z

Thank you @rickeylev for raising #2186.

Looking at this again with a fresh set of eyes, it does look like this is a
correct fix, but it would be also good to add a similar fix on line 50. It
does seem that all of the other code has .runfiles suffix in the echo
statements and these two were missing.

rickeylev

LGTM. I also added a test.

scasagrande · 2024-09-05T16:35:09Z

double checked your test locally, and verified that when I remove the fix from this PR that the test correctly errors with the same "unable to find python3" error:

==================== Test output for //tests/bootstrap_impls:run_binary_bootstrap_script_zip_no_test:
env: /private/var/tmp/_bazel_steven/e3e538e5f92c867c7a03c82bec7d4cfa/sandbox/darwin-sandbox/51/execroot/_main/bazel-out/darwin_arm64-fastbuild/bin/tests/bootstrap_impls/run_binary_bootstrap_script_zip_no_test/_main~python~python_3_11_aarch64-apple-darwin/bin/python3: No such file or directory
Test case failed: using RUNFILES_MANIFEST_FILE with output manifest
expected output to match: Hello
but got:\n

thank you for adding a test to cover this!

Fix finding runfiles dir

071b86d

scasagrande requested review from rickeylev and aignas as code owners September 3, 2024 16:19

scasagrande closed this Sep 4, 2024

rickeylev mentioned this pull request Sep 5, 2024

bootstrap_impl=script doesn't handle RUNFILES_MANIFEST_FILE #2186

Closed

scasagrande reopened this Sep 5, 2024

scasagrande and others added 4 commits September 5, 2024 10:11

Expand fix to cover additional case

42967f4

add test

5f1230c

fixup! add test

fa27b7f

fixup! fixup! add test

c23a174

rickeylev approved these changes Sep 5, 2024

View reviewed changes

rickeylev changed the title ~~fix: bootstrap template runfiles dir~~ fix: make bootstrap_impl=script compute correct directory when RUNFILES_MANIFEST_FILE set Sep 5, 2024

rickeylev added this pull request to the merge queue Sep 5, 2024

rickeylev removed this pull request from the merge queue due to a manual request Sep 5, 2024

rickeylev added 2 commits September 5, 2024 09:54

update changelog

5888ed3

fixup! update changelog

53862d9

rickeylev enabled auto-merge September 5, 2024 16:55

rickeylev added this pull request to the merge queue Sep 5, 2024

Merged via the queue into bazelbuild:main with commit 65d1326 Sep 5, 2024
4 checks passed

scasagrande deleted the fix/boostramp-template-runfiles-py-binary branch September 5, 2024 17:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: make bootstrap_impl=script compute correct directory when RUNFILES_MANIFEST_FILE set #2177

fix: make bootstrap_impl=script compute correct directory when RUNFILES_MANIFEST_FILE set #2177

scasagrande commented Sep 3, 2024 •

edited by rickeylev

Loading

aignas commented Sep 4, 2024

scasagrande commented Sep 4, 2024 •

edited

Loading

scasagrande commented Sep 4, 2024

aignas commented Sep 4, 2024

scasagrande commented Sep 4, 2024

rickeylev commented Sep 4, 2024

scasagrande commented Sep 4, 2024

aignas commented Sep 5, 2024

rickeylev left a comment

scasagrande commented Sep 5, 2024

fix: make bootstrap_impl=script compute correct directory when RUNFILES_MANIFEST_FILE set #2177

fix: make bootstrap_impl=script compute correct directory when RUNFILES_MANIFEST_FILE set #2177

Conversation

scasagrande commented Sep 3, 2024 • edited by rickeylev Loading

aignas commented Sep 4, 2024

scasagrande commented Sep 4, 2024 • edited Loading

scasagrande commented Sep 4, 2024

aignas commented Sep 4, 2024

scasagrande commented Sep 4, 2024

rickeylev commented Sep 4, 2024

scasagrande commented Sep 4, 2024

aignas commented Sep 5, 2024

rickeylev left a comment

Choose a reason for hiding this comment

scasagrande commented Sep 5, 2024

scasagrande commented Sep 3, 2024 •

edited by rickeylev

Loading

scasagrande commented Sep 4, 2024 •

edited

Loading