-
Notifications
You must be signed in to change notification settings - Fork 97
OpenCGA Storage Hadoop
imedina edited this page May 16, 2016
·
1 revision
The main aim is to store and index gvcf (genome variant call format) as well as vcf files. To provide this functionality, the information is stored in two different tables depending on the purpose of the query.
-
Storage The information stored should be a full representation of the provided information (gvcf or vcf format) and be able to reproduce the full content of the original file by exporting the data. The focus is a compact and comprehensive representation of the data per region.
-
Indexing The observed genomic differences in any individual in a study are stored and annotated in this table.
OpenCGA is an open source project and it is freely available.
General
- Home
- Architecture
- Data Models
- RESTful Web Services
- Configuration
- Download and Installation
- Tutorials
OpenCGA Catalog
OpenCGA Storage
About