Skip to content

Commit 041474c

Browse files
committed
Added flowchart for migration phase
1 parent deb5ba9 commit 041474c

File tree

2 files changed

+11
-7
lines changed

2 files changed

+11
-7
lines changed
79.8 KB
Loading

articles/storage/common/tape-migration-guide.md

Lines changed: 11 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -39,8 +39,8 @@ Before a tape migration process starts, options must be carefully considered. Fi
3939

4040
|Approach | Pros | Cons |
4141
| ------- | ---- | ---- |
42-
| Customer performed migration | - Data never leaves the site <br> - No logistics for shipping tapes | - Requires hardware resources <br> - Adds more work to personnel <br> - Requires specific knowledge in handling tapes <br> - Possible unknown costs|
43-
| Tape migration partner | - Simple pricing, and known cost upfront (paid per tape) <br> - No impact on production <br> - No impact on personnel | - Requires logistics for shipping tapes <br> - Security considerations required due to shipping tapes <br> - Multiple copies needed for data availability during migration |
42+
| **Customer performed migration** | - Data never leaves the site <br> - No logistics for shipping tapes | - Requires hardware resources <br> - Adds more work to personnel <br> - Requires specific knowledge in handling tapes <br> - Possible unknown costs|
43+
| **Tape migration partner** | - Simple pricing, and known cost upfront (paid per tape) <br> - No impact on production <br> - No impact on personnel | - Requires logistics for shipping tapes <br> - Security considerations required due to shipping tapes <br> - Multiple copies needed for data availability during migration |
4444

4545
Several major considerations can easily guide our decision on who can execute the migration, customer, or partner.
4646

@@ -50,9 +50,9 @@ Resources are the most critical part of the tape migration process, and we divid
5050

5151
| Category | Notes |
5252
| -------- | ------|
53-
| People | - Specific set of skills are required<br> - Process is labor intensive |
54-
| Hardware | - Different tape generations require different type of hardware <br> - Speed of the migration is proportional to available drives |
55-
| Software | - Access to software that created the data is needed <br> - Access to encryption keys is needed |
53+
| **People** | - Specific set of skills are required<br> - Process is labor intensive |
54+
| **Hardware** | - Different tape generations require different type of hardware <br> - Speed of the migration is proportional to available drives |
55+
| **Software** | - Access to software that created the data is needed <br> - Access to encryption keys is needed |
5656

5757
Hardware is usually the most challenging part. If we're migrating existing tape generations, hardware is available, but used as part of the existing production. But for older tape generations, hardware is often end-of-life, and it's harder to acquire. With older tape generation, using a tape migration partner is a preferred, and simpler option.
5858
When production hardware is used for migrations, careful planning is needed to make sure migration doesn't interfere with the production workloads. Here we can apply three different models:
@@ -96,7 +96,7 @@ Main criteria for deciding the format is how do we plan to use the migrated data
9696

9797
## Migration process
9898

99-
Once we made decisions on migration execution, and preferred file format, we can start with the migration. Migration goes through several phases.
99+
Once we made decisions on migration execution, and preferred data format, we can start with the migration. Migration goes through several phases.
100100
![Picture showing tape migration phases](./media/tape-migration-guide/tape-migration-steps.png)
101101

102102
### Information phase
@@ -141,7 +141,11 @@ After we gathered basic information, we can prepare for the migration. Preparati
141141

142142
### Migration phase
143143

144-
Once the migration design is final, we start the migration process. Before ramping up to full migration pace, we always perform a test with a smaller sample. Goal for the test is to make sure that end-to-end process works. It allows us to make tweaks, and improve the process. Once the test is successful, and we're happy with the results, we execute the migration. For each file we migrate, we need to perform data validation to make sure that data wasn't corrupted during the migration process. In ideal situation, source data already contains hash values that can be easily compared to hash values post-migration. If hashes don't exist, they must be calculated before the file is migrated. If hashes match, file is marked as migrated. If not, file is discarded, and migrated again. Sometimes the data is corrupted on the source tapes. Having the original hash values helps with catching those rare cases. If they happen, we can read the data from secondary copy if it exists. Data validation process is a critical component for a migration design. Process for handling failed validation must be defined. Migration phase is also constantly monitored to make sure we can react to unpredictable situation, and adapt to it. Regular reporting to main stakeholders is important to keep the migration on track.
144+
Once the migration design is final, we start the migration process. Before ramping up to full migration pace, we always perform a test with a smaller sample. Goal for the test is to make sure that end-to-end process works. It allows us to make tweaks, and improve the process. Once the test is successful, and we're happy with the results, we execute the migration. Migration phase is slightly different if we're using native files vs. virtual tapes. In both cases, it's a repetitive process that circles through all the tapes, and reads their entire content. This flowchart shows the migration phase when migrating to native files.
145+
![Flowchart that shows migration phase](./media/tape-migration-guide/tape-migration-phase.png)
146+
147+
#### Data validation
148+
For each file we migrate, we need to perform data validation to make sure that data wasn't corrupted during the migration process. In ideal situation, source data already contains hash values that can be easily compared to hash values post-migration. If hashes don't exist, they must be calculated before the file is migrated. If hashes match, file is marked as migrated. If not, file is discarded, and migrated again. Sometimes the data is corrupted on the source tapes. Having the original hash values helps with catching those rare cases. If they happen, we can read the data from secondary copy if it exists. Data validation process is a critical component for a migration design. Process for handling failed validation must be defined. Migration phase is also constantly monitored to make sure we can react to unpredictable situation, and adapt to it. Regular reporting to main stakeholders is important to keep the migration on track.
145149

146150
### Post-migration phase
147151

0 commit comments

Comments
 (0)