Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

metadata validation failing for dandiset 000971 #1958

Closed
bendichter opened this issue Jun 19, 2024 · 4 comments
Closed

metadata validation failing for dandiset 000971 #1958

bendichter opened this issue Jun 19, 2024 · 4 comments

Comments

@bendichter
Copy link
Member

https://dandiarchive.org/dandiset/000971/draft

We are currently seeing the following errors:

image

Looking at the metadata, I am not sure what the problem is.

  1. Is this an error in metadata validation?
  2. Is there any way we could improve these error messages to be more informative? In particular, it is not clear which contributor is failing validation here, so it is difficult to identify and fix the error for metadata that has many contributors. Could we add this information? I think I would be able to determine this from the full json-schema traceback. Maybe we could add a "details" button by each validation error which shows this traceback.

cc @pauladkisson

@waxlamp
Copy link
Member

waxlamp commented Jun 19, 2024

Hey @bendichter, I agree that we need to do a lot better with these error messages.

In the current case, I also cannot find where the errors are. I'm going to look into what's going on so the specific case can be resolved (while retaining general ideas for making error reporting significantly more useful in the future).

@waxlamp
Copy link
Member

waxlamp commented Jun 19, 2024

@bendichter: so it turns out that Joseph M. Bianco's entry in the contributor list has a stray comma that should be removed; this extra comma was causing the name to not comport with the regular expression. (Side note: it continues to amaze me what escapes the human eye in matters like this--I truly couldn't see that extra comma as I scanned over the list of contributors! If only there were a scientific field that studied such matters...)

The second and third errors result from the validator then thinking "hrm, maybe this failed validation because it's not really a Person, but an Organization", and finding errors related to that assumption (i.e., if this is an Organization then indeed the schemaKey field should read Organization, and it should have an ROR ID). This is a bug in the validation logic, since we essentially need the system to trust that the schemaKey field is always correct (and therefore, never generate errors based on other prospective validation passes). @mvandenburgh is already working on a fix for this.

If you fix the contributor entry and the validation errors go away, I'll close this issue in favor of one that addresses the need for better error messages in general. Mike and I collected some intel that will help and I will post it there.

@bendichter
Copy link
Member Author

Thanks for tracking this down, @waxlamp ! You are not alone- @paulakdisson and I both also missed this!

@pauladkisson
Copy link

Thank you @waxlamp! Metadata validation is now passing.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants