New dedupe in 5.0.0 prevents multiple rule violations to be reported from the same source range #920

disposedtrolley · 2020-01-16T03:41:47Z

Describe the bug
We've recently upgraded to Spectral 5.0.0 and have noticed that multiple rule violations for the same range in the code are no longer reported. We have some rules which target a broad block of the OpenAPI JSON (i.e. a map of objects), and we delegate our custom rule function to iterate over the map and return an array of rule violations. All of these violations would point to the same source file, column, and line number.

Looking through the PR history, I see that a fingerprinting function has been added which attempts to eliminate duplicate rule violations being reported. The function only seems to use the code, path, range, and source attributes of the rule to compute a fingerprint, not the error message itself. This means that if we return an array of rule violations from custom functions, only the first element will be rendered to the user.

To Reproduce

Given an OpenAPI file with multiple paths
Write a custom rule which targets $.paths
Attach the rule to a custom function which iterates over paths and returns an array of rule violations
Observe that only the first rule violation in the returned array is reported to the user

Expected behavior
All rule violations should be reported.

Environment (remove any that are not applicable):

Library version: 5.0.0
OS: macOS 10.15.2

Additional context
I know that targeting a broad block of JSON for rules isn't optimal, but there may be some cases where we need more complex logic than what the JSONPath syntax can offer to run our rules. I think it might be beneficial to:

modify the dedupe function to account for the error message when computing the fingerprint.
update the custom rules section of the docs to strongly recommend providing JSONPath expressions which target individual fields only

P.S. thanks for all the hard work that went into 5.0.0! We're seeing much better performance now.

/cc @nogates @davidlopezre @magnetikonline @adoragoh just something to watch out for when writing Spectral rules :)

The text was updated successfully, but these errors were encountered:

nulltoken · 2020-01-19T12:38:25Z

modify the dedupe function to account for the error message when computing the fingerprint.

@P0lip A proposal for this already exists in #913 and is being discussed with @marbemac (cf. https://github.com/stoplightio/spectral/pull/913/files#r368241693)

nulltoken · 2020-01-20T22:32:37Z

@disposedtrolley Could you please provide us with a simple repro case that would mimic your issue?

disposedtrolley · 2020-01-21T22:25:50Z

Here you go.

If you extract the zip and run spectral lint openapi.yaml -r rules.yaml in the extracted directory, you should see that only one of the two rule violations returned by custom-func in the this-rule-does-not-work rule is actually displayed.

spectral_issue.zip

nulltoken · 2020-02-08T16:37:57Z

@disposedtrolley Sorry the for the delay.

Thanks for the repro.

module.exports = targetVal => {
	return [{message: "problem 1"}, {message: "problem 2"}]
}

Although this might actually win the prize for the "minimal repro case" poster child contest 😉 , it doesn't provide a lot to work with. I'm especially missing the context of what you're trying to achieve from a functional standpoint.

You're completely right that the fingerprinting strips out the second problem. It bears the same code, and is located at the same range.

I've hit a similar issue while working on #913 and eventually found out that you can refine the path (thus generating a different range) while reporting from the function (cf. https://github.com/stoplightio/spectral/blob/develop/src/functions/typedEnum.ts#L34-L39).

The function is given a first path pointing at the root of the object to consider and eventually returns more precise locations pointing at inner parts of the considered object. That made sense with regards to what this typedEnum function was processing (examining an enum and identifying offending values of that enum).

Wouldn't that help you solve your issue, could you please provide a bit more data with regards to your own context?

disposedtrolley · 2020-02-08T20:48:08Z

I'm especially missing the context of what you're trying to achieve from a functional standpoint.

Haha sorry! Let me try to elaborate.

One of the rules we have ensures that a specific extension property is either present in the root, or in all path definitions. The custom function we have targets the entire OAS ($) as it needs context around what exists in various parts of the OAS. The function may return more than one error message depending on where the extension property was found and not found.

nulltoken · 2020-02-09T22:31:59Z

@disposedtrolley Thanks for the explanation.

I believe on way to cope with your requirements, considering the current fingerprinting behavior would be to alter your function. One proposal could be:

Scan all paths definitions. If no override exists in those, nor at the root level, raise an error without specifying an inner path
If it's only specified at the root, then all is good.
If it's specified in all path definitions and nothing is set at the root level, then all is good as well.
If it's specified in some path definitions and nothing is set at the root level, then raise an error for each path definition which hasn't been decorated, specifying the inner path of each path definition.

At his stage, it's specified at the root and, at least, in one path definition.

I'd go with reporting the root being specified where it shouldn't be (without specifying an inner path). This approach will require two passes of run and fixes to get the spec cleaned up (the first one to get the root dropped, the second one to clean all the paths without any decoration)

Of course, there are certainly other ways to implement that 😉

Although the current fingerprinting implementation requires a bit more work, in the end it compels us to write more precise error reporting, eventually providing the user with a better experience with proper locations for each issues.

disposedtrolley · 2020-02-11T19:33:17Z

Thanks for the suggestions @nulltoken!

We’re definitely looking at changing how we process rules and realise that it’s an edge case (even for us). I’m leaning towards the two pass solution.

Have the docs been updated to reflect these changes? Maybe a warning that targeting a broad block of the OAS isn’t such a good idea anymore? Otherwise happy to get this one closed out!

nulltoken · 2020-03-25T20:06:41Z

Have the docs been updated to reflect these changes? Maybe a warning that targeting a broad block of the OAS isn’t such a good idea anymore? Otherwise happy to get this one closed out!

@disposedtrolley I've just opened #1035 to try and document this. Feedback woud be welcome 😄

disposedtrolley · 2020-03-27T00:35:27Z

@nulltoken looks great! Thanks for keeping the docs fresh :D

disposedtrolley added the t/bug Something isn't working label Jan 16, 2020

disposedtrolley changed the title ~~New dedupe in 5.0.0 prevents multiple rule violations to be returned for the same range~~ New dedupe in 5.0.0 prevents multiple rule violations to be reported from the same source range Jan 16, 2020

P0lip self-assigned this Jan 17, 2020

nulltoken mentioned this issue Jan 20, 2020

feat: new rule to detect enum value that do not respect specified type #913

Merged

4 tasks

This was referenced Mar 24, 2020

Can't show more than one message from a custom function, even though the response is an array #1030

Closed

chore(doc): how to return multiple results from a function #1035

Merged

nulltoken assigned nulltoken and unassigned P0lip Mar 25, 2020

nulltoken closed this as completed Mar 27, 2020

radicarl mentioned this issue Aug 3, 2020

Returning multiple rule violations on same path #1299

Closed

ifoukarakis mentioned this issue Jun 12, 2021

Only first message displayed for custom function handling external $ref files. #1671

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New dedupe in 5.0.0 prevents multiple rule violations to be reported from the same source range #920

New dedupe in 5.0.0 prevents multiple rule violations to be reported from the same source range #920

disposedtrolley commented Jan 16, 2020

nulltoken commented Jan 19, 2020

nulltoken commented Jan 20, 2020

disposedtrolley commented Jan 21, 2020

nulltoken commented Feb 8, 2020

disposedtrolley commented Feb 8, 2020

nulltoken commented Feb 9, 2020

disposedtrolley commented Feb 11, 2020

nulltoken commented Mar 25, 2020

disposedtrolley commented Mar 27, 2020

New dedupe in 5.0.0 prevents multiple rule violations to be reported from the same source range #920

New dedupe in 5.0.0 prevents multiple rule violations to be reported from the same source range #920

Comments

disposedtrolley commented Jan 16, 2020

nulltoken commented Jan 19, 2020

nulltoken commented Jan 20, 2020

disposedtrolley commented Jan 21, 2020

nulltoken commented Feb 8, 2020

disposedtrolley commented Feb 8, 2020

nulltoken commented Feb 9, 2020

disposedtrolley commented Feb 11, 2020

nulltoken commented Mar 25, 2020

disposedtrolley commented Mar 27, 2020