ncbi_data_analysis

Miscellaneous python scripts to deal with NCBI records and sequences

draw_chem.py

Draws chemical structures from a file containing CID and SMILES values (in each row, space-separated).

fetch_gb.py ¹

Retrieves GenBank records from NCBI from GI list.

fetch_ncbi_taxonomy.py

Retrieves the taxonomic lineages from a list of scientific taxonomic names.

fetch_PubChem_compound.py

Retrieves PubChem records from CID list.

get_fasta_from_gb.py

Retrieves fasta sequences from GenBank records.

get_metadata_from_BioSample.py

Retrieves metadata from BioSample records' summary as a table.

get_metadata_from_gb.py

Retrieves metadata from GenBank records.

get_prot_from_gb.py

Retrieves protein sequences metadata from GenBank records.

parse_taxids.py

Gets taxonomic lineages from a list of taxids.

search_ncbi_by_term.py ¹

Retrieve GI list from NCBI from Entrez terms.

Tell NCBI who you are by stating your e-mail address using -email <your@email>. Also, respect the NCBI guidelines for posting requests (see https://www.ncbi.nlm.nih.gov/books/NBK25497): 'In order not to overload the E-utility servers, NCBI recommends that users post no more than three URL requests per second and limit large jobs to either weekends or between 9:00 PM and 5:00 AM Eastern time during weekdays. Failure to comply with this policy may result in an IP address being blocked from accessing NCBI. If NCBI blocks an IP address, service will not be restored unless the developers of the software accessing the E-utilities register values of the tool and email parameters with NCBI.' Note that Biopython tools intrinsically respect the three posts per second frequency. ↩ ↩²

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ncbi_data_analysis

draw_chem.py

fetch_gb.py ¹

fetch_ncbi_taxonomy.py

fetch_PubChem_compound.py

get_fasta_from_gb.py

get_metadata_from_BioSample.py

get_metadata_from_gb.py

get_prot_from_gb.py

parse_taxids.py

search_ncbi_by_term.py ¹

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
LICENSE		LICENSE
README.md		README.md
draw_chem.py		draw_chem.py
fetch_PubChem_compound.py		fetch_PubChem_compound.py
fetch_gb.py		fetch_gb.py
fetch_ncbi_taxonomy.py		fetch_ncbi_taxonomy.py
get_fasta_from_gb.py		get_fasta_from_gb.py
get_metadata_from_BioSample.py		get_metadata_from_BioSample.py
get_metadata_from_gb.py		get_metadata_from_gb.py
get_prot_from_gb.py		get_prot_from_gb.py
parse_taxids.py		parse_taxids.py
search_ncbi_gi_by_term.py		search_ncbi_gi_by_term.py

License

jgmv/ncbi_data_analysis

Folders and files

Latest commit

History

Repository files navigation

ncbi_data_analysis

draw_chem.py

fetch_gb.py 1

fetch_ncbi_taxonomy.py

fetch_PubChem_compound.py

get_fasta_from_gb.py

get_metadata_from_BioSample.py

get_metadata_from_gb.py

get_prot_from_gb.py

parse_taxids.py

search_ncbi_by_term.py 1

Footnotes

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

fetch_gb.py ¹

search_ncbi_by_term.py ¹

Packages