Fix all XPathMatcherTest TODO's + handle nested elements with the same name #4532

DidierLoiseau · 2024-09-28T10:16:49Z

What's changed?

Fixed all existing TODO’s in `XPathMatcherTest
Handle XML documents that contain nested elements with the same name for XPath using //
Code cleanup

What's your motivation?

Needed to use such XPaths.

Anything in particular you'd like reviewers to focus on?

I will put comments on the bits of code I cleaned up.

There are still limitations with XPath that do not start with / or that start with //, in particular, they don’t support // in the middle and there were not tests for that.

Have you considered any alternatives or workarounds?

I think the implementation would really still benefit from a lot of refactoring, or even a full rewrite:

There is no validation of the provided XPath
The XPath parsing is mixed with the XPath maching (e.g. extracting conditions, determining whether we are facing an attribute or a function etc.)
The implementation of XPaths starting with / is completely dissociated from those starting with // or no /, which causes different limitations for the two.

I think it would make sense to represent the XPath as a structured object instead of a String[]…

Any additional context

As a user, it’s not really clear what is supported and what is not without looking at the unit test…

Checklist

I've added unit tests to cover both positive and negative cases
I've read and applied the recipe conventions and best practices
I've used the IntelliJ IDEA auto-formatter on affected files

…e name

DidierLoiseau · 2024-09-28T10:20:27Z

rewrite-xml/src/main/java/org/openrewrite/xml/XPathMatcher.java

@@ -92,18 +92,8 @@ public boolean matches(Cursor cursor) {
                    if (index < 0) {
                        return false;
                    }
-                    if (part.startsWith("@")) { // is attribute selector
-                        partWithCondition = part;
-                        tagForCondition = i > 0 ? path.get(i - 1) : path.get(i);


Using i straight up in path.get() was completely wrong since path is reverted while parts is not – pathIndex should always be used.

Note that if a part starts with @, it MUST be the last part – I wanted to check it here, but then I realized all ifs actually result in the same code (and conditions matching sub-elements were not handled…).

DidierLoiseau · 2024-09-28T10:21:33Z

rewrite-xml/src/main/java/org/openrewrite/xml/XPathMatcher.java

-                        }
-                    }
+                    partWithCondition = part;
+                    tagForCondition = path.get(pathIndex);
                } else if (i < path.size() && i > 0 && parts[i - 1].endsWith("]")) {


I don’t understand what’s the purpose of this if. All tests still pass if I remove it but I’m not sure whether I’m missing an edge case or it’s really unneeded.

@Attribute

- paths with multiple occurrences of // - // preceding @Attribute

DidierLoiseau · 2024-09-28T14:24:23Z

rewrite-xml/src/main/java/org/openrewrite/xml/XPathMatcher.java

        } else {
            Collections.reverse(path);

            // Deal with the two forward slashes in the expression; works, but I'm not proud of it.
-            if (expression.contains("//") && !expression.contains("://") && Arrays.stream(parts).anyMatch(StringUtils::isBlank)) {


Removing :// does not seem to break anything, but fixes XPaths with both // and a namespace URL condition.

I don’t think the first condition is still relevant either, however the isBlank check is inconsistent with the indexOf("") on the next line. I think it should use isEmpty() like I did to compute tagMatchingParts.

Note that the first condition is insufficient on its own because of URLs again – or actually any value that could contain double slashes in a condition.

sambsnyd · 2024-10-04T20:09:09Z

Great stuff thanks @DidierLoiseau !

Fix all XPathMatcherTest TODO's + handle nested elements with the sam…

9f796c5

…e name

DidierLoiseau commented Sep 28, 2024

View reviewed changes

More XPath matching fixes

d86f21c

- paths with multiple occurrences of // - // preceding @Attribute

DidierLoiseau commented Sep 28, 2024

View reviewed changes

timtebeek self-requested a review September 28, 2024 16:27

DidierLoiseau mentioned this pull request Sep 28, 2024

Expose AddOrUpdateChild as a recipe using XPath + child XML #4533

Merged

3 tasks

Merge branch 'main' into issue/xpath-double-slash

4ae3171

sambsnyd merged commit 21e73e8 into openrewrite:main Oct 4, 2024
2 checks passed

timtebeek assigned DidierLoiseau Oct 6, 2024

DidierLoiseau mentioned this pull request Oct 6, 2024

Xpath 'nodes from anywhere' + attribute selection does not work #4528

Closed

DidierLoiseau deleted the issue/xpath-double-slash branch October 16, 2024 20:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix all XPathMatcherTest TODO's + handle nested elements with the same name #4532

Fix all XPathMatcherTest TODO's + handle nested elements with the same name #4532

DidierLoiseau commented Sep 28, 2024 •

edited

Loading

DidierLoiseau Sep 28, 2024

DidierLoiseau Sep 28, 2024

DidierLoiseau Sep 28, 2024

sambsnyd commented Oct 4, 2024

Fix all XPathMatcherTest TODO's + handle nested elements with the same name #4532

Fix all XPathMatcherTest TODO's + handle nested elements with the same name #4532

Conversation

DidierLoiseau commented Sep 28, 2024 • edited Loading

What's changed?

What's your motivation?

Anything in particular you'd like reviewers to focus on?

Have you considered any alternatives or workarounds?

Any additional context

Checklist

DidierLoiseau Sep 28, 2024

Choose a reason for hiding this comment

DidierLoiseau Sep 28, 2024

Choose a reason for hiding this comment

DidierLoiseau Sep 28, 2024

Choose a reason for hiding this comment

sambsnyd commented Oct 4, 2024

DidierLoiseau commented Sep 28, 2024 •

edited

Loading