|
1 | 1 | --- |
2 | 2 | affilliation: UNIVERSITY OF MICHIGAN AT ANN ARBOR |
3 | | -description: "After the completion of the Human Genome Project, several landmarking\ |
4 | | - \ consortia have accumulated large amounts of genomic data towards understanding\ |
5 | | - \ the functions of human genome. The ENCODE project has annotated genome-wide regulatory\ |
6 | | - \ elements. The Roadmap Epigenomic project has characterized tissue-speci\uFB01\ |
7 | | - c variation in epigenetic state. The NIH Common Fund GTEx project has delineated\ |
8 | | - \ tissue-speci\uFB01c gene expression and transcription regulation. The NIH Common\ |
9 | | - \ Fund 4D Nucleome (4DN) project has revealed dynamic 3D chromatin organization\ |
10 | | - \ in many cell and tissue types. Each of the aforementioned consortia has generated\ |
11 | | - \ thousands or even tens of thousands of datasets, and provided different insights\ |
12 | | - \ regarding human genome at an unprecedent scale and depth. However, the datasets\ |
13 | | - \ generated from these consortia are isolated in terms of cell types and tissue\ |
14 | | - \ types covered, how the data are stored, and the resolution of the genomic data.\ |
15 | | - \ These gaps bring realistic data analysis challenges to biomedical researchers\ |
16 | | - \ when they use these public datasets jointly in their research \u2014 they need\ |
17 | | - \ to go through different data portals with heterogeneous processing pipelines,\ |
18 | | - \ different data formats, and unmatched resolutions. We aim to develop the most\ |
19 | | - \ cutting-edge deep learning approaches to impute high-resolution chromatin contact\ |
20 | | - \ maps, and integrate the high-resolution chromatin contact maps with transcriptional\ |
21 | | - \ data available from GTEx project and epigenomic data from ENCODE/Roadmap. We plan\ |
22 | | - \ to share the integrated data on a public web server with a multi-panel interactive\ |
23 | | - \ visualization genome browser. The integrated data will provide an important resource\ |
24 | | - \ for understanding of tissue-speci\uFB01c genetic variation in the light of the\ |
25 | | - \ spatial organization of these genomic and epigenomic elements and their functional\ |
26 | | - \ implications." |
27 | 3 | end_date: '2022-08-31T12:00:00-04:00' |
28 | 4 | grant_num: R03OD030599 |
29 | 5 | pi: LIU, JIE |
30 | 6 | title: A database for high-resolution chromatin contact maps and human genetic variants |
31 | 7 | website: https://github.com/liu-bioinfo-lab/caesar |
32 | | ---- |
| 8 | +--- |
| 9 | +After the completion of the Human Genome Project, several landmarking consortia have accumulated large amounts of genomic data towards understanding the functions of human genome. The ENCODE project has annotated genome-wide regulatory elements. The Roadmap Epigenomic project has characterized tissue-specific variation in epigenetic state. The NIH Common Fund GTEx project has delineated tissue-specific gene expression and transcription regulation. The NIH Common Fund 4D Nucleome (4DN) project has revealed dynamic 3D chromatin organization in many cell and tissue types. Each of the aforementioned consortia has generated thousands or even tens of thousands of datasets, and provided different insights regarding human genome at an unprecedent scale and depth. However, the datasets generated from these consortia are isolated in terms of cell types and tissue types covered, how the data are stored, and the resolution of the genomic data. These gaps bring realistic data analysis challenges to biomedical researchers when they use these public datasets jointly in their research — they need to go through different data portals with heterogeneous processing pipelines, different data formats, and unmatched resolutions. We aim to develop the most cutting-edge deep learning approaches to impute high-resolution chromatin contact maps, and integrate the high-resolution chromatin contact maps with transcriptional data available from GTEx project and epigenomic data from ENCODE/Roadmap. We plan to share the integrated data on a public web server with a multi-panel interactive visualization genome browser. The integrated data will provide an important resource for understanding of tissue-specific genetic variation in the light of the spatial organization of these genomic and epigenomic elements and their functional implications. |
0 commit comments