Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Institution/organization info via rentrez #189

Open
trilisser opened this issue May 18, 2023 · 4 comments
Open

Institution/organization info via rentrez #189

trilisser opened this issue May 18, 2023 · 4 comments

Comments

@trilisser
Copy link

trilisser commented May 18, 2023

Good day! A GenBank info page contains data on the organization made sequencing. Can it be extracted via rentrez not downloading an entire gb file with db="nuccore", rettype = "gb"?

image

Best

@allenbaron
Copy link

Not sure I understand your request. Are you just asking if you can get the journal reference for a particular record? I don't think that's possible but you can get the whole record with entrez_fetch().

To get the record you provided, as an example, you would execute x <- rentrez::entrez_fetch(db = "nuccore", id = "MZ413793", rettype = "text"), which returns only ~ 3 KB of data. You can choose whatever return type is most convenient to you and then extract the information of interest. Try ?entrez_fetch for more info.

@trilisser
Copy link
Author

trilisser commented Jun 13, 2023

Sorry for the late response. I want to get the information that I highlighted by red square without downloading an entire record (for example by rentrez_summary which allow me to get collection date, strain name, etc. without downloading a whole record), because this takes time in the case of houndreds of records. Also, as far as I understand, the area highlighted is not a journal reference, it is information on an institute which provide a genome sequence.

@allenbaron
Copy link

If the data you want is not in entrez_summary(), the only option I'm aware of is entrez_fetch() and I do not think a specific field can be specified in the that is similar to what can be done with entrez_link(). Maybe a different approach can be found by searching through some examples in the book: https://www.ncbi.nlm.nih.gov/books/NBK179288/.

@trilisser
Copy link
Author

Thank you very much for your help! I'll explore your link.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants