Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

More incorrect records beyond #1 #10

Closed
BrittanyBunk opened this issue Apr 4, 2020 · 4 comments
Closed

More incorrect records beyond #1 #10

BrittanyBunk opened this issue Apr 4, 2020 · 4 comments

Comments

@BrittanyBunk
Copy link

BrittanyBunk commented Apr 4, 2020

Problem

The publisher date is incorrect on multiple records past the ones with 9999 (that was figured out) were found in the data dump. I put together all the ones I could into one file.

Notes of document

  • this is an estimate of the records that should be changed, so some on here may be legitimate and some that need correction are not on this spreadsheet).
  • the first column is the OL ID, one would just add OL in front and M behind it to make it complete

Solutions

As mentioned in #1, the number of records that need a correction are almost 1/2 million. There are methods to approach it: by hand or by bot. Since I don't really know the answer, I'm opening this up so that it's known and there's a way to work on it. Not every record is incorrect, but should be evaluated. Here's how far I got with the manual process, so feel free to improve it:

@seabelis
Copy link
Collaborator

seabelis commented Apr 4, 2020

Please give more detail about what the issue is and what the expected result is.

@BrittanyBunk
Copy link
Author

@seabelis updated :)

@seabelis
Copy link
Collaborator

seabelis commented Apr 5, 2020

This is outside the scope of this repo's purpose. This is an issue that needs to be solved programatically. It's not reasonable to ask a human to sort through a half-million records manually.

@BrittanyBunk
Copy link
Author

@seabelis Cool! You don't mind putting something like that into the description of the repo? That way, people will know when they come in that bigger librarian issues that require bots would go to the main github page.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants