Skip to content

Commit 890237f

Browse files
committed
Freshness update for dataset-clinvar-annotations.md . . .
1 parent 35b8d31 commit 890237f

File tree

1 file changed

+11
-24
lines changed

1 file changed

+11
-24
lines changed

articles/open-datasets/dataset-clinvar-annotations.md

Lines changed: 11 additions & 24 deletions
Original file line numberDiff line numberDiff line change
@@ -3,43 +3,38 @@ title: ClinVar Annotations
33
description: Learn how to use the ClinVar Annotations dataset in Azure Open Datasets.
44
ms.service: open-datasets
55
ms.topic: sample
6-
ms.date: 04/16/2021
6+
ms.reviewer: franksolomon
7+
ms.date: 06/13/2024
78
---
89

910
# ClinVar Annotations
1011

11-
[ClinVar](https://www.ncbi.nlm.nih.gov/clinvar/) is a freely accessible, public archive of reports of the relationships among human variations and phenotypes, with supporting evidence. It facilitates access to and communication about the relationships asserted between human variation and observed health status, and the history of that interpretation. It provides access to a broader set of clinical interpretations that can be incorporated into genomics workflows and applications.
12+
The [ClinVar](https://www.ncbi.nlm.nih.gov/clinvar/) resource is a freely accessible, public archive of reports - with supporting evidence - about the relationships among human variations and phenotypes. It facilitates access to and communication about the claimed relationships between human variation and observed health status, and about the history of that interpretation. It provides access to a broader set of clinical interpretations that researchers can incorporate into genomics workflows and applications.
1213

13-
For more information on the data, see the [Data Dictionary](https://www.ncbi.nlm.nih.gov/projects/clinvar/ClinVarDataDictionary.pdf) and [FAQ](https://www.ncbi.nlm.nih.gov/clinvar/docs/faq/).
14+
Visit the [Data Dictionary](https://www.ncbi.nlm.nih.gov/projects/clinvar/ClinVarDataDictionary.pdf) and the [FAQ resource](https://www.ncbi.nlm.nih.gov/clinvar/docs/faq/) for more information about the data.
1415

1516
[!INCLUDE [Open Dataset usage notice](./includes/open-datasets-usage-note.md)]
1617

1718
## Data source
1819

19-
This dataset is a mirror of ftp://ftp.ncbi.nlm.nih.gov/pub/clinvar/xml/
20+
This dataset is a mirror of the National Library of Medicine ClinVar [FTP resource](https://ftp.ncbi.nlm.nih.gov/pub/clinvar/xml/).
2021

21-
## Data volumes and update frequency
22+
## Data update frequency
2223

23-
This dataset contains approximately 56 GB of data and is updated daily.
24-
25-
## Storage location
26-
27-
This dataset is stored in the West US 2 and West Central US Azure regions. Allocating compute resources in West US 2 or West Central US is recommended for affinity.
24+
This dataset receives daily updates.
2825

2926
## Data Access
3027

31-
West US 2: 'https://datasetclinvar.blob.core.windows.net/dataset'
28+
[FTP resource](https://ftp.ncbi.nlm.nih.gov/pub/clinvar/)
3229

33-
West Central US: 'https://datasetclinvar-secondary.blob.core.windows.net/dataset'
34-
35-
[SAS Token](../storage/common/storage-sas-overview.md): sv=2019-02-02&se=2050-01-01T08%3A00%3A00Z&si=prod&sr=c&sig=qFPPwPba1RmBvaffkzkLuzabYU5dZstSTgMwxuLNME8%3D
30+
[FTP Overview](https://www.ncbi.nlm.nih.gov/clinvar/docs/ftp_primer/)
3631

3732
## Use Terms
3833
Data is available without restrictions. More information and citation details, see [Accessing and using data in ClinVar](https://www.ncbi.nlm.nih.gov/clinvar/docs/maintenance_use/).
3934

4035
## Contact
4136

42-
For any questions or feedback about this dataset, contact [email protected].
37+
For any questions or feedback about this dataset, contact [[email protected]](mailto:[email protected]).
4338

4439
## Data access
4540

@@ -49,17 +44,9 @@ For any questions or feedback about this dataset, contact [email protected]
4944

5045
<!-- nbstart https://opendatasets-api.azure.com/discoveryapi/OpenDataset/DownloadNotebook?serviceType=AzureNotebooks&package=azure-storage&registryId=genomics-clinvar -->
5146

52-
5347
## Getting the ClinVar data from Azure Open Dataset
5448

55-
Several public genomics data has been uploaded as an Azure Open Dataset [here](https://azure.microsoft.com/services/open-datasets/catalog/). We create a blob service linked to this open dataset. You can find examples of data calling procedure from Azure Open Dataset for `ClinVar` dataset in below:
56-
57-
Users can call and download the following path with this notebook: 'https://datasetclinvar.blob.core.windows.net/dataset/ClinVarFullRelease_00-latest.xml.gz.md5'
58-
59-
> [!NOTE]
60-
> Users needs to log-in their Azure Account via Azure CLI for viewing the data with Azure ML SDK. On the other hand, they do not need do any actions for downloading the data.
61-
62-
For more information on installing the Azure CLI, see [Install the Azure CLI](/cli/azure/install-azure-cli)
49+
Several public genomics data resources were uploaded as Azure Open Dataset at [this](https://azure.microsoft.com/services/open-datasets/catalog/) resource.
6350

6451
### Calling the data from 'ClinVar Data Set'
6552

0 commit comments

Comments
 (0)