Accompanying code to produce and analyze the data from the paper Quantifying the Impact of Biobanks and Cohort Studies.
The associated data can be found in the corresponding Zenodo Repository.
The documentation to run the code is found in the PDF file documentation.pdf
.
Several repositories were used to extract the names of biobanks and associated cohort studies, their mentions across biomedical documents, and their validation.
- BioLINCC
- BBMRI-ERIC
- Birthcohorts
- CEDC (Cancer Epidemiology Descriptive Cohort Database)
- Cohort profiles and updates
- DCEG (Division of Cancer Epidemiology & Genetics)
- dbGaP
- DPUK (Dementias Platform UK)
- EPND (European platform for neurodegenerative diseases)
- IADRP
- JPND (EU Join Program Neurodegenerative disease research)
- Maelstrom
- Molgenis (European Networks Health Data and Cohort Catalogue**)**)
- P3 (Public Population Project in Genomics and Society)
- SciCrunch
- The Pooling Project of Prospective Studies of Diet and Cancer
- UKRI (UK research and innovation cohort directory)
- Wikipedia biobanks
- Wikipedia cohorts
The following databases were stored in Google BigQuery: