DailyMed NDC->Image File - Initial Work #318
Open
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Resolves #309
Explanation
While #309 isn't 100% solved, I wanted to start here and treat future enhancements as separate issues to sort of start cleaning up long-standing branches.
This PR provides a way to extract 1 part of the 5 parts of zipped file of DailyMed full prescription SPL data. You can change the part number to download all 5. We need to create an enhancement issue for automating 1-5 with sequential tasks. Honestly the reason I haven't prioritized this is because my hard drive space is horribly low and I can only download one at a time anyway.
This will extract all the files and run XSLT against them to do the following:
Rationale
This gets us to around 50k NDC->image mappings. There is still work to be done to understand the true denominator of label images that are out there in order to understand the delta between where we are now and how we would need to get to 100%.
Tests