Skip to content

Commit 92cb158

Browse files
Merge pull request #273122 from mamtagiri/genomics-mg
update data enters
2 parents 84eaafc + a64fec0 commit 92cb158

File tree

2 files changed

+101
-0
lines changed

2 files changed

+101
-0
lines changed
Lines changed: 53 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,53 @@
1+
---
2+
title: ImmuneCODE database
3+
description: Learn how to use the ImmuneCODE database in Azure Open Datasets.
4+
ms.service: open-datasets
5+
ms.topic: sample
6+
ms.date: 11/09/2023
7+
---
8+
9+
# ImmuneCODE database
10+
11+
The ImmuneCODE™ database, which includes hundreds of millions of T-cell Receptor (TCR) sequences from over 1,400 subjects exposed to or infected with the SARS-CoV-2 virus, and over 160,000 high-confidence SARS-CoV-2-specific TCRs.
12+
The database is accessible at no cost. Its data can be analyzed to aid global initiatives aimed at comprehending the immune response to the SARS-CoV-2 virus and crafting novel interventions. To learn more about the dataset refer the associated [publication.](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7418738/)
13+
14+
The latest ImmuneCODE datasets available contains: Release 002.
15+
16+
- The 1,486 subjects exposed to or infected with the SARSCoV-2 virus: ImmuneCODE-Repertoires-002.2.
17+
- The sample metadata: ImmuneCODE-Repertoire-Tags-002.2.tsv (572 KB) Release 002.2.
18+
- The high-confidence SARS-CoV-2-specific (Over 160,000): ImmuneCODE-MIRA-Release 002.1.
19+
- The sample metadata: ImmuneCODE-Repertoire-Tags-002.2.xlsx (352 KB) Release 002.2.
20+
21+
[!INCLUDE [open-datasets-usage-note](./includes/open-datasets-usage-note.md)]
22+
23+
## Data source
24+
25+
This dataset is a mirror of https://clients.adaptivebiotech.com/pub/covid-2020
26+
27+
## Data volumes and update frequency
28+
29+
This dataset contains approximately 228 GB of data and is updated daily.
30+
31+
## Storage location
32+
33+
This dataset is stored in the West US 2 Azure regions. Allocating compute resources in West US 2 is recommended for affinity.
34+
35+
## Data access
36+
37+
West US 2: 'https://dataset1000genomes.blob.core.windows.net/dataset'
38+
39+
West Central US: 'https://dataset1000genomes-secondary.blob.core.windows.net/dataset'
40+
41+
[SAS Token](../storage/common/storage-sas-overview.md): sv=2019-10-10&si=prod&sr=c&sig=9nzcxaQn0NprMPlSh4RhFQHcXedLQIcFgbERiooHEqM%3D
42+
43+
## Use terms
44+
45+
To learn more about the data use terms refer the [publication](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7418738/) and [Terms of Use](https://clients.adaptivebiotech.com/terms-of-use).
46+
47+
## Contact
48+
49+
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7418738/
50+
51+
## Next steps
52+
53+
View the rest of the datasets in the [Open Datasets catalog](dataset-catalog.md).
Lines changed: 48 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,48 @@
1+
---
2+
title: Open Targets
3+
description: Learn how to use the Open Targets dataset in Azure Open Datasets.
4+
ms.service: open-datasets
5+
ms.topic: sample
6+
ms.date: 04/16/2021
7+
---
8+
9+
# Open Targets
10+
11+
The Open Targets Platform is a data resource to facilitate the systematic identification and prioritization of potential therapeutic drug targets. This resource integrates publicly available datasets, including those datasets that are generated by the Open Targets consortium, to build and score target-disease associations, aiding in the identification and prioritization of drug targets. Additionally, it incorporates pertinent annotation information about targets, diseases, phenotypes, drugs, and their key relationships.
12+
13+
The Open Targets Genetics highlights variant-centric statistical evidence to allow both prioritization of candidate causal variants at trait-associated loci and identification of potential drug targets. It collects and combines genetic associations gathered from published literature as well as newly derived data from sources like UK Biobank and FinnGen. Additionally, it includes functional genomics information such as chromatin conformation and interactions, along with quantitative trait loci (eQTLs, pQTLs, and sQTLs). Large-scale pipelines apply statistical fine-mapping across thousands of trait-associated loci to resolve association signals and link each variant to its proximal and distal target genes using a 'Locus2Gene' assessment. Integrated cross-trait colocalisation analyses and linking to detailed pharmaceutical compounds extend the capacity of Open Targets Genetics to explore drug repositioning opportunities and shared genetic architecture.
14+
15+
- To read further about Open Targets Platform visit - [Open Targets Platform](https://platform.opentargets.org)
16+
- To read further about Open Targets Genetics visit - [Open Targets Genetics](https://genetics.opentargets.org)
17+
18+
[!INCLUDE [Open Dataset usage notice](./includes/open-datasets-usage-note.md)]
19+
20+
## Data source
21+
22+
This dataset is a mirror of http://ftp.ebi.ac.uk/pub/databases/opentargets/platform/latest and http://ftp.ebi.ac.uk/pub/databases/opentargets/genetics/latest/
23+
24+
## Data volumes and update frequency
25+
26+
This dataset contains approximately 350 GB of data and is updated daily.
27+
28+
## Storage location
29+
30+
This dataset is stored in the West US 2 Azure region. Allocating compute resources in West US 2 is recommended for affinity.
31+
32+
## Data access
33+
34+
West US 2: `https://datasetopentargets.blob.core.windows.net/dataset`
35+
36+
[SAS Token](../storage/common/storage-sas-overview.md): sv=2019-10-10&si=prod&sr=c&sig=9nzcxaQn0NprMPlSh4RhFQHcXedLQIcFgbERiooHEqM%3D
37+
38+
39+
## Use terms
40+
41+
Please refer to the data use terms as described [here](https://platform-docs.opentargets.org/licence).
42+
43+
## Contact
44+
45+
[https://www.internationalgenome.org/contact](https://community.opentargets.org)
46+
47+
48+
View the rest of the datasets in the [Open Datasets catalog](dataset-catalog.md).

0 commit comments

Comments
 (0)