Merge pull request #273122 from mamtagiri/genomics-mg

v-regandowner · web-flow · commit 92cb158c9a71 · 2024-06-12T15:24:29.000-04:00
update data enters
diff --git a/articles/open-datasets/dataset-immunecode.md b/articles/open-datasets/dataset-immunecode.md
@@ -0,0 +1,53 @@
+---
+title: ImmuneCODE database
+description: Learn how to use the ImmuneCODE database in Azure Open Datasets.
+ms.service: open-datasets
+ms.topic: sample
+ms.date: 11/09/2023
+---
+
+# ImmuneCODE database
+
+The ImmuneCODE™ database, which includes hundreds of millions of T-cell Receptor (TCR) sequences from over 1,400 subjects exposed to or infected with the SARS-CoV-2 virus, and over 160,000 high-confidence SARS-CoV-2-specific TCRs. 
+The database is accessible at no cost. Its data can be analyzed to aid global initiatives aimed at comprehending the immune response to the SARS-CoV-2 virus and crafting novel interventions. To learn more about the dataset refer the associated [publication.](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7418738/)
+
+The latest ImmuneCODE datasets available contains: Release 002.
+
+- The 1,486 subjects exposed to or infected with the SARSCoV-2 virus: ImmuneCODE-Repertoires-002.2.
+- The sample metadata: ImmuneCODE-Repertoire-Tags-002.2.tsv (572 KB) Release 002.2.
+- The high-confidence SARS-CoV-2-specific (Over 160,000): ImmuneCODE-MIRA-Release 002.1.
+- The sample metadata: ImmuneCODE-Repertoire-Tags-002.2.xlsx (352 KB) Release 002.2.
+
+[!INCLUDE [open-datasets-usage-note](./includes/open-datasets-usage-note.md)]
+
+## Data source
+
+This dataset is a mirror of https://clients.adaptivebiotech.com/pub/covid-2020
+
+## Data volumes and update frequency
+
+This dataset contains approximately 228 GB of data and is updated daily.
+
+## Storage location
+
+This dataset is stored in the West US 2 Azure regions. Allocating compute resources in West US 2 is recommended for affinity.
+
+## Data access
+
+West US 2: 'https://dataset1000genomes.blob.core.windows.net/dataset'
+
+West Central US: 'https://dataset1000genomes-secondary.blob.core.windows.net/dataset'
+
+[SAS Token](../storage/common/storage-sas-overview.md): sv=2019-10-10&si=prod&sr=c&sig=9nzcxaQn0NprMPlSh4RhFQHcXedLQIcFgbERiooHEqM%3D
+
+## Use terms
+
+To learn more about the data use terms refer the [publication](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7418738/) and [Terms of Use](https://clients.adaptivebiotech.com/terms-of-use).
+
+## Contact
+
+https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7418738/
+
+## Next steps
+
+View the rest of the datasets in the [Open Datasets catalog](dataset-catalog.md).
diff --git a/articles/open-datasets/dataset-open-targets.md b/articles/open-datasets/dataset-open-targets.md
@@ -0,0 +1,48 @@
+---
+title: Open Targets
+description: Learn how to use the Open Targets dataset in Azure Open Datasets.
+ms.service: open-datasets
+ms.topic: sample
+ms.date: 04/16/2021
+---
+
+# Open Targets
+
+The Open Targets Platform is a data resource to facilitate the systematic identification and prioritization of potential therapeutic drug targets. This resource integrates publicly available datasets, including those datasets that are generated by the Open Targets consortium, to build and score target-disease associations, aiding in the identification and prioritization of drug targets. Additionally, it incorporates pertinent annotation information about targets, diseases, phenotypes, drugs, and their key relationships.
+
+The Open Targets Genetics highlights variant-centric statistical evidence to allow both prioritization of candidate causal variants at trait-associated loci and identification of potential drug targets. It collects and combines genetic associations gathered from published literature as well as newly derived data from sources like UK Biobank and FinnGen. Additionally, it includes functional genomics information such as chromatin conformation and interactions, along with quantitative trait loci (eQTLs, pQTLs, and sQTLs). Large-scale pipelines apply statistical fine-mapping across thousands of trait-associated loci to resolve association signals and link each variant to its proximal and distal target genes using a 'Locus2Gene' assessment. Integrated cross-trait colocalisation analyses and linking to detailed pharmaceutical compounds extend the capacity of Open Targets Genetics to explore drug repositioning opportunities and shared genetic architecture.
+
+- To read further about Open Targets Platform visit - [Open Targets Platform](https://platform.opentargets.org)
+- To read further about Open Targets Genetics visit - [Open Targets Genetics](https://genetics.opentargets.org)
+
+[!INCLUDE [Open Dataset usage notice](./includes/open-datasets-usage-note.md)]
+
+## Data source
+
+This dataset is a mirror of http://ftp.ebi.ac.uk/pub/databases/opentargets/platform/latest and http://ftp.ebi.ac.uk/pub/databases/opentargets/genetics/latest/
+
+## Data volumes and update frequency
+
+This dataset contains approximately 350 GB of data and is updated daily.
+
+## Storage location
+
+This dataset is stored in the West US 2 Azure region. Allocating compute resources in West US 2 is recommended for affinity.
+
+## Data access
+
+West US 2: `https://datasetopentargets.blob.core.windows.net/dataset`
+
+[SAS Token](../storage/common/storage-sas-overview.md): sv=2019-10-10&si=prod&sr=c&sig=9nzcxaQn0NprMPlSh4RhFQHcXedLQIcFgbERiooHEqM%3D
+
+
+## Use terms
+
+Please refer to the data use terms as described [here](https://platform-docs.opentargets.org/licence).
+
+## Contact
+
+[https://www.internationalgenome.org/contact](https://community.opentargets.org)
+
+
+View the rest of the datasets in the [Open Datasets catalog](dataset-catalog.md).