Make it so applying and removing patches are repeatable without errors #2502

revans2 · 2024-10-14T14:17:27Z

There are a number of issues with dealing with patches for the CUDF repo. This fixes some of them. It does not fix submodule update --init as that happens externally. It also does not fix other parts of upmerging a branch unless you have unapplied the patches before hand. Sadly it also does not provide any good way to do development work on CUDF using the submodule as your development environment.

What this does do it make it so that if you apply patches to CUDF and try to build again without removing those patches it should work.

Signed-off-by: Robert (Bobby) Evans <bobby@apache.org>

revans2 · 2024-10-14T14:17:34Z

build

jlowe

What's here looks OK.

What about the dangerous and surprising behavior of specifying submodule.patch.skip=true as discussed in #2497 (comment)? If this is an artifact of how the validate build works, maybe we should change the name of the property to something less "inviting" for developers to try, as this won't do what they think it does.

revans2 · 2024-10-14T16:44:16Z

What's here looks OK.

What about the dangerous and surprising behavior of specifying submodule.patch.skip=true as discussed in #2497 (comment)? If this is an artifact of how the validate build works, maybe we should change the name of the property to something less "inviting" for developers to try, as this won't do what they think it does.

That is a good point. At a minimum I need to document it better. Thinking about it I am trying to understand the use case fro it. Originally I put it in as a way to skip the apply/unapply steps. But then I used it as a part of the CI process to auto upmerge CUDF, which made @gerashegalov want to lock it down so we don't accidentally build the plugin without the patches being installed. But I don't see that being the case any more. Perhaps I can move it back to what it was originally intended to be and document it better to let people know that there is a foot/gun here no matter what it does.

revans2 · 2024-10-14T17:02:40Z

build

jlowe

Minor typo, lgtm. Note that this will make the submodule sync take a bit longer because it will build everything only to rebuild it, but ccache may be able to avoid a lot of the rebuild.

I'm personally OK if we decide to keep the build-skipping as long as we update the name of the config to reflect that. submodule.patch.skip implies it's only going to skip patching submodules, not also skip all native builds (including those outside of the submodule).

pom.xml

gerashegalov · 2024-10-14T18:36:32Z

build/unapply-patches

-if [ -n "$(git status --porcelain --untracked-files=no)" ] ; then
+CHANGED_FILES=$(git status --porcelain --untracked-files=no)
+
+if [ \( -s "$FULLY_PATCHED_FILE" \) -a  \( -n "$CHANGED_FILES" \) ] ; then


I would prefer a direct test of patch applicability via git apply --check for both forward and --reverse . If it is neither of the states we are in, we can ask/warn the developer to commit changes and regenerate the patch if it is intentional

gerashegalov · 2024-10-14T18:39:03Z

build/apply-patches

+# is to save some state files about what happened. But a user could mess with CUDF directly
+# so we want to have ways to double check that they are indeed correct.
+
+FULLY_PATCHED_FILE="$CUDF_DIR/spark-rapids-jni.patch"


there may technically be a patch that applies to non-cudf portion of spark-rapids, maybe should go to the top-level dir?

Curious what's the use-case for patching outside of the submodule? I thought we would checkin the change directly rather than a patch of that change.

We only have a sole use case with the submodule patching right now. No doubt direct modifications of the source tree outside the module is preferrable. In terms of hypotheticals, there could be another submodule or some native dependency that we want consume bypassing cudf which needs patching.

Right now all patches apply only to CUDF. I don't want to try and make that more generic right now. If we do hit that situation I would rather have sub-directories under patches for each third party sub directory, and then apply/remove patches for each subdirectory separately.

ok, separate subdirs work too

Signed-off-by: Robert (Bobby) Evans <bobby@apache.org>

revans2 · 2024-10-14T19:59:10Z

build

revans2 · 2024-10-14T21:32:27Z

@gerashegalov I added in the direct test, and you were also right that git apply is much cleaner than patch. Please take another look

gerashegalov

LGTM

NVIDIA#2502) * Make it so applying and removing patches are repeatable without errors Signed-off-by: Robert (Bobby) Evans <bobby@apache.org> * Adjust config for skipping a patch * More fixes Signed-off-by: Robert (Bobby) Evans <bobby@apache.org> --------- Signed-off-by: Robert (Bobby) Evans <bobby@apache.org>

Make it so applying and removing patches are repeatable without errors

396eeb7

Signed-off-by: Robert (Bobby) Evans <bobby@apache.org>

jlowe reviewed Oct 14, 2024

View reviewed changes

Adjust config for skipping a patch

3971af0

jlowe previously approved these changes Oct 14, 2024

View reviewed changes

pom.xml Outdated Show resolved Hide resolved

gerashegalov reviewed Oct 14, 2024

View reviewed changes

More fixes

0f45ab0

Signed-off-by: Robert (Bobby) Evans <bobby@apache.org>

revans2 dismissed jlowe’s stale review via 0f45ab0 October 14, 2024 19:59

ttnghia approved these changes Oct 14, 2024

View reviewed changes

jlowe approved these changes Oct 14, 2024

View reviewed changes

gerashegalov approved these changes Oct 14, 2024

View reviewed changes

gerashegalov merged commit 2c3b60c into NVIDIA:branch-24.12 Oct 14, 2024
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make it so applying and removing patches are repeatable without errors #2502

Make it so applying and removing patches are repeatable without errors #2502

revans2 commented Oct 14, 2024

revans2 commented Oct 14, 2024

jlowe left a comment

revans2 commented Oct 14, 2024

revans2 commented Oct 14, 2024

jlowe left a comment

gerashegalov Oct 14, 2024

gerashegalov Oct 14, 2024

jlowe Oct 14, 2024

gerashegalov Oct 14, 2024

revans2 Oct 14, 2024

gerashegalov Oct 14, 2024

revans2 commented Oct 14, 2024

revans2 commented Oct 14, 2024

gerashegalov left a comment

Make it so applying and removing patches are repeatable without errors #2502

Make it so applying and removing patches are repeatable without errors #2502

Conversation

revans2 commented Oct 14, 2024

revans2 commented Oct 14, 2024

jlowe left a comment

Choose a reason for hiding this comment

revans2 commented Oct 14, 2024

revans2 commented Oct 14, 2024

jlowe left a comment

Choose a reason for hiding this comment

gerashegalov Oct 14, 2024

Choose a reason for hiding this comment

gerashegalov Oct 14, 2024

Choose a reason for hiding this comment

jlowe Oct 14, 2024

Choose a reason for hiding this comment

gerashegalov Oct 14, 2024

Choose a reason for hiding this comment

revans2 Oct 14, 2024

Choose a reason for hiding this comment

gerashegalov Oct 14, 2024

Choose a reason for hiding this comment

revans2 commented Oct 14, 2024

revans2 commented Oct 14, 2024

gerashegalov left a comment

Choose a reason for hiding this comment