refactor/fix: store dists in parse_requirements output #1917

aignas · 2024-05-23T07:51:27Z

This moves some of the code out of the pip.bzl extension and changes
the layout of the code to prepare for multi-platform whl support.

Summary:

parse_requirements: add whls and sdists attribute, so that we can use
a function to populate the lists. Not sure if there is a better way to
do this.
parse_requirements: add an extra code to ensure that we are handling
the target platform filtering correctly.
select_whl: split the select_whl into select_whls, which filters
the whls (this can be used later in multi-platform selects) and
select_whl , which just is used get the most appropriate whl for the
host platform.
Additionally fix the logic in select_whl, which would result in
Python 3.12 wheels being selected on Python 3.11 interpreters because
we were not taking into account the interpreter tag when doing the
filtering.

Fixes #1930

This means that we can have a function that then later adds data to the parse_requirements return result.

… of a single candidate

… marker

dougthor42 · 2024-05-30T16:09:56Z

I tested #1930 this with the latest commit 552c526. Got error:

ERROR: Traceback (most recent call last):
        File "/usr/local/google/home/dthor/.cache/bazel/_bazel_dthor/dbe74c4144b5c9a438d84a119652bef9/external/rules_python~/python/private/bzlmod/pip.bzl", line 444, column 30, in _pip_impl
                _create_whl_repos(module_ctx, pip_attr, hub_whl_map, whl_overrides, hub_group_map, simpleapi_cache)
        File "/usr/local/google/home/dthor/.cache/bazel/_bazel_dthor/dbe74c4144b5c9a438d84a119652bef9/external/rules_python~/python/private/bzlmod/pip.bzl", line 197, column 37, in _create_whl_repos
                parse_requirements_add_dists(
        File "/usr/local/google/home/dthor/.cache/bazel/_bazel_dthor/dbe74c4144b5c9a438d84a119652bef9/external/rules_python~/python/private/parse_requirements_add_dists.bzl", line 61, column 31, in parse_requirements_add_dists
                whls = select_whls(
        File "/usr/local/google/home/dthor/.cache/bazel/_bazel_dthor/dbe74c4144b5c9a438d84a119652bef9/external/rules_python~/python/private/whl_target_platforms.bzl", line 172, column 27, in select_whls
                _, _, whl = sorted(any_whls)[-1]
Error in sorted: unsupported comparison: struct <=> struct
ERROR: Analysis of target '//src/pyle/dataking/system_optimization/snake_optimizer:snake_metrics_test' failed; build aborted: error evaluating module extension pip in @@rules_python~//python/extensions:pip.bzl

With this we should not get failures, but we would get different wheels, which from the user point of view might be better experience. I have also rewrote the tests to better test things and be more succinct. The complex thing here is that we want to do some filtering of the whls but we should not do too much a we are doing what the select should be doing.

dougthor42 · 2024-05-31T02:52:22Z

With 348ca06 it looks like the credential helper is no longer working:

===== stdout start =====                                                                                                                                                                                                                      
Looking in indexes: https://[redacted]/simple, https://pypi.python.org/simple                                                                                                                                       
User for us-west2-python.pkg.dev:                                                                                                                                                                                                             
===== stdout end =====

I would only see this prompt if my cred helper script was broken or I remove common --credential_helper=... from my .bazelrc.

aignas · 2024-05-31T04:02:35Z

Well credential helper not working is weird, this is definitely unintended. I thought I haven't touched the code there.

@dougthor42, could you try bazel clean to ensure that we have no PyPI index cache?

dougthor42 · 2024-05-31T04:46:30Z

For the record, all I'm doing to test these things is changing the git_override commit in my MODULE.bazel, followed by bazel build //... and if that passes, bazel test //.... The 8791cbb commit below is the commit that I've been doing all my bazel-ifying with and it works wonderfully.

git_override(
    module_name = "rules_python",
    commit = "8791cbbaa2336e24555a6b577ed6e19df24d7d88",
    remote = "https://github.com/bazelbuild/rules_python",
)

git_override(
    module_name = "rules_python_gazelle_plugin",
    commit = "8791cbbaa2336e24555a6b577ed6e19df24d7d88",
    remote = "https://github.com/bazelbuild/rules_python",
    strip_prefix = "gazelle",
)

At first I thought there might be something else between 8791cbb..348ca06 that caused the cred helper to fail, but I didn't have the issue in the previous commit 552c526 so it was definitely a change in 348ca06.

I forgot to mention it, but going from 8791cbb to 348ca06 also caused a bunch of debug statements to show up for various packages. Example:

DEBUG: /usr/local/google/home/dthor/.cache/bazel/_bazel_dthor/dbe74c4144b5c9a438d84a119652bef9/external/rules_python~/python/private/bzlmod/pip.bzl:285:22: WARNING: falling back to pip for installing the right file for tensorboard==2.16.2     --hash=sha256:9f2b4e7dad86667615c0e5cd072f1ea8403fc032a299f0072d6f74855775cc45

run bazel clean

Sure. Same result for 348ca06 (debug messages and credential helper failure)

For 0261471:

no debug message during build 🎉
No credential helper failure during build 🎉

I'm running test now, but a fresh test can take like 30 minutes (which is why we need to bazel-ify our code!)

aignas · 2024-05-31T05:27:45Z

Ah, yeah, the DEBUG statements is what I was looking for, but was not sure if they were present. Thanks for debugging, really appreciate it, I must have broken something here.

aignas · 2024-05-31T06:49:38Z

Added extra params to pip.parse:

    quiet = False,
    verbosity = "TRACE",

You can select the verbosity from INFO, DEBUG and TRACE to better get an understanding why the wheels are getting dropped.

Maybe you could later upload some of the trace/debug logs if they are not sensitive.

rickeylev

LGTM, I think. Nothing in particular stuck out to me as needing significant changes.

re: conversation about cred helper failing: I don't see why this PR would cause that. Maybe the index-url vs extra-index-url? IIRC, Doug has a private index, so maybe that is getting mixed up somewhere, somehow.

rickeylev · 2024-05-31T06:21:57Z