Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DPLA bulk data comparison dependent on ingestion version #381

Open
ghukill opened this issue Feb 28, 2019 · 2 comments
Open

DPLA bulk data comparison dependent on ingestion version #381

ghukill opened this issue Feb 28, 2019 · 2 comments

Comments

@ghukill
Copy link
Contributor

ghukill commented Feb 28, 2019

As Michigan moved from Ingestion1 --> Ingestion3, looks as though format of bulk data written to S3 has changed (unsurprisingly).

Not acting on for now, but worth noting.

@antmoth
Copy link
Collaborator

antmoth commented Jun 21, 2019

It looks like this should be something we can address, at the point at which it becomes important enough to sink time into.

I assume this is relevant: https://digitalpubliclibraryofamerica.atlassian.net/wiki/spaces/TECH/pages/5931056/Database+export+files

@ghukill ghukill changed the title DPLA bulk data downloader dependent on ingestion version DPLA bulk data comparison dependent on ingestion version Jul 31, 2019
@antmoth antmoth added the Hard label Jul 31, 2019
@antmoth
Copy link
Collaborator

antmoth commented Jul 31, 2019

Because this relies on the record_id being the same, this may need to be rethought. Also, how valuable it is depends on how valuable other hubs would consider it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants