Utilization of cfDNA fragment size patterns for disease detection & classification based on low-coverage WGS data
We consider the relative entropy between cohorts’ cfDNA fragment lengths and test two hypotheses.
-
We can pinpoint particular lengths for which disease differs from healthy.
-
We can identify distinct differences for colorectal (CRC) as well as other cancer types (ovarian, pancreatic, gastric, breast, lung cancer and cholangiocarcinoma).
Preliminary Kullback-Leibler divergence analysis of the Delfi data shows:
- Cancer vs healthy:
- Healthy individuals and cancer patients exhibit differences for particular fragment lengths (classification of new clinical samples and early detection of disease).
- We measure two to three peaks on the divergence histogram (identify the disease stage).
- Cancer vs cancer:
- CRC patients and other cancers exhibit differences for particular fragment lengths (identify the tissue of origin).
- At least 8% of the fragments belong to diverging populations (determine the degree of overlap between the regulation of different tumors).