Skip to content

Commit b2862d6

Browse files
committed
Updates after review
1 parent 041474c commit b2862d6

File tree

1 file changed

+6
-5
lines changed

1 file changed

+6
-5
lines changed

articles/storage/common/tape-migration-guide.md

Lines changed: 6 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
---
2-
title: Azure Storage tape migration guide
3-
description: Azure Storage tape migration overview guide provides basic guidance for tape migrations
2+
title: Azure Storage tape migration overview
3+
description: Azure Storage tape migration overview provides basic guidance for tape migrations
44
author: dukicn
55
ms.author: nikoduki
66
ms.topic: conceptual
@@ -9,13 +9,13 @@ ms.service: azure-storage
99
ms.subservice: storage-common-concepts
1010
---
1111

12-
# Azure Storage tape migration guide
12+
# Azure Storage tape migration overview
1313

1414
This article focuses on tape migrations. It aims to simplify, provide guidance, and considerations to run through a successful migration of data stored on various tape media to Azure storage services.
1515

1616
## Overview
1717

18-
Tape is one of the dominant storage media, and it stores a large part of world's data. Technology exists for decades, but hundreds of exabytes of new tapes are still being shipped every year.
18+
Tape stores a large portion of worlds data, and remains one of the dominant types of storage media. Tape media has existed for decades, and is still heavily used with hundreds of exabytes of new tapes shipped every year.
1919

2020
Tapes are a great medium for storing cold data. They're fast in sequential reading, but stages requiring mechanical movements (like loading, and unloading of tapes, tape seeks, etc.) are slower. That makes tapes unusable for traditional, random based access, and is the main reason that even today data stored on tapes is rarely used. In addition, tapes are a magnetic medium that require special handling. They're sensitive to environment, particularly temperature, and humidity. If kept within their operating environmental range, they can achieve high durability, and good restore success rate. However, when kept in unfriendly environment, deterioration happens often, and renders the tape unreadable.
2121

@@ -114,6 +114,7 @@ Information phase is critical for gathering key requirements. Gathered informati
114114
- What is the migration deadline? Are there any critical milestones?
115115
- How much network bandwidth is available for migration?
116116
- Where are tapes physically stored, and can they be shipped?
117+
- Do you already have hash values for all files? If yes, which hashing algorithm is used?
117118
- Are tapes needed after migration?
118119
- How to maintain temperature, and humidity for tapes during migration / transport?
119120
- Who are main stakeholders?
@@ -145,7 +146,7 @@ Once the migration design is final, we start the migration process. Before rampi
145146
![Flowchart that shows migration phase](./media/tape-migration-guide/tape-migration-phase.png)
146147

147148
#### Data validation
148-
For each file we migrate, we need to perform data validation to make sure that data wasn't corrupted during the migration process. In ideal situation, source data already contains hash values that can be easily compared to hash values post-migration. If hashes don't exist, they must be calculated before the file is migrated. If hashes match, file is marked as migrated. If not, file is discarded, and migrated again. Sometimes the data is corrupted on the source tapes. Having the original hash values helps with catching those rare cases. If they happen, we can read the data from secondary copy if it exists. Data validation process is a critical component for a migration design. Process for handling failed validation must be defined. Migration phase is also constantly monitored to make sure we can react to unpredictable situation, and adapt to it. Regular reporting to main stakeholders is important to keep the migration on track.
149+
For each file we migrate, we need to perform data validation to make sure that data wasn't corrupted during the migration process. Data validation is done by comparing hash values before the migration, and after the migration. There are many types of hashing algorithms that can be used. A common approach is to use MD5 since Azure Storage contains a pre-defined metadata field Content-MD5 that can be filled during the migration. This allows referring to that MD5 value when ever we interact with the data to validate the data hasn't been changed, or corrupted.In ideal situation, source data already contains hash values that can be easily compared to hash values post-migration. If hashes don't exist, they must be calculated before the file is migrated. If hashes match, file is marked as migrated. If not, file is discarded, and migrated again. Sometimes the data is corrupted on the source tapes. Having the original hash values helps with catching those rare cases. If they happen, we can read the data from secondary copy if it exists. Data validation process is a critical component for a migration design. Process for handling failed validation must be defined. Migration phase is also constantly monitored to make sure we can react to unpredictable situation, and adapt to it. Regular reporting to main stakeholders is important to keep the migration on track.
149150

150151
### Post-migration phase
151152

0 commit comments

Comments
 (0)