Skip to content

Commit 167fe6f

Browse files
authored
Create dataset-immunecode.md
1 parent a465487 commit 167fe6f

File tree

1 file changed

+49
-0
lines changed

1 file changed

+49
-0
lines changed
Lines changed: 49 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,49 @@
1+
---
2+
title: ImmuneCODE database
3+
description: Learn how to use the ImmuneCODE database in Azure Open Datasets.
4+
ms.service: open-datasets
5+
ms.topic: sample
6+
ms.date: 11/09/2023
7+
---
8+
9+
# ImmuneCODE Database
10+
11+
The ImmuneCODE™ database, which includes hundreds of millions of T-cell Receptor (TCR) sequences from over 1,400 subjects exposed to or infected with the SARS-CoV-2 virus, as well as over 160,000 high-confidence SARS-CoV-2-specific TCRs. This database is made freely available, and the data contained in it can be analyzed to assist with the global efforts to understand the immune response to the SARS-CoV-2 virus and develop new interventions. To learn more about the dataset refer the associated [publication](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7418738/)
12+
The latest ImmuneCODE datasets available contains:Release 002
13+
1. 1,486 subjects exposed to or infected with the SARSCoV-2 virus: ImmuneCODE-Repertoires-002.2
14+
2. sample metadata: ImmuneCODE-Repertoire-Tags-002.2.tsv (572 KB)Release 002.2
15+
3. Over 160,000 high-confidence SARS-CoV-2-specific TCRs: ImmuneCODE-MIRA-Release002.1
16+
4. sample metadata: ImmuneCODE-Repertoire-Tags-002.2.xlsx (352 KB) Release 002.2
17+
18+
[!INCLUDE [Open Dataset usage notice](../../includes/open-datasets-usage-note.md)]
19+
20+
## Data source
21+
22+
This dataset is a mirror of https://clients.adaptivebiotech.com/pub/covid-2020
23+
24+
## Data volumes and update frequency
25+
26+
This dataset contains approximately 228 GB of data and is updated daily.
27+
28+
## Storage location
29+
30+
This dataset is stored in the West US 2 Azure regions. Allocating compute resources in West US 2 is recommended for affinity.
31+
32+
## Data Access
33+
34+
West US 2: 'https://dataset1000genomes.blob.core.windows.net/dataset'
35+
36+
West Central US: 'https://dataset1000genomes-secondary.blob.core.windows.net/dataset'
37+
38+
[SAS Token](../storage/common/storage-sas-overview.md): sv=2019-10-10&si=prod&sr=c&sig=9nzcxaQn0NprMPlSh4RhFQHcXedLQIcFgbERiooHEqM%3D
39+
40+
## Use Terms
41+
42+
To learn more about the data use terms, refer the [publication] (https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7418738/) and [Terms od Use] (https://clients.adaptivebiotech.com/terms-of-use)
43+
## Contact
44+
45+
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7418738/
46+
47+
## Next steps
48+
49+
View the rest of the datasets in the [Open Datasets catalog](dataset-catalog.md).

0 commit comments

Comments
 (0)