Skip to content

amatov/FragmentomicsSubclinicalDisease

Repository files navigation

Utilization of cfDNA fragment size patterns ​for disease detection & classification ​based on low-coverage WGS data

Presentations and clinical applications on this project (Classification of Genome-Wide cfDNA Fragmentation Patterns with Deep Learning) are available here: http://dx.doi.org/10.13140/RG.2.2.34819.89121/1 (5 PDF files)

We consider the relative entropy between cohorts’ cfDNA fragment lengths and test two hypotheses.

  1. We can pinpoint particular lengths for which disease differs from healthy.

  2. We can identify distinct differences for colorectal (CRC) as well as other cancer types (ovarian, pancreatic, gastric, breast, lung cancer and cholangiocarcinoma).

Preliminary Kullback-Leibler divergence (PMC5812299) analysis of the Delfi (PMC6774252) data shows:

  1. Cancer vs healthy:
  • Healthy individuals and cancer patients exhibit differences for particular fragment lengths (classification of new clinical samples and early detection of disease).
  • We measure two to three peaks (see KLD_CRC_FRL.pdf) on the divergence histogram (identify the disease stage).
  1. Cancer vs cancer:
  • CRC patients and other cancers exhibit differences for particular fragment lengths (identify the tissue of origin).
  • At least 8% of the fragments belong to diverging populations (determine the degree of overlap between the regulation of different tumors).

The Delfi 2 / Endo II cohort consisted of samples from clinical trials NCT03637686, NCT03748680, NCT04084249