Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement retrieving more than 10k PMIDs and all metadata #1424

Merged
merged 3 commits into from
Nov 24, 2023

Conversation

bgyori
Copy link
Member

@bgyori bgyori commented Oct 26, 2023

This PR adds a simple wrapper around PubMed's edirect CLI (https://www.ncbi.nlm.nih.gov/books/NBK179288/) to retreive PMIDs such that we can easily get all PMIDs for queries that return more than 10k results (this turns out to be very convoluted to solve with the REST API even if not entirely impossible).

It also adds a wrapper around a function for getting metadata to allow for a single function call above the limit of 200 PMIDs per metadata request call.

@bgyori bgyori changed the title Implement retrieving more than 10k PMIDs Implement retrieving more than 10k PMIDs and all metadata Nov 24, 2023
@bgyori bgyori merged commit a3e6984 into sorgerlab:master Nov 24, 2023
2 checks passed
@bgyori bgyori deleted the pubmed_cli branch December 31, 2023 03:09
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants