Skip to content

Commit 3007993

Browse files
Merge pull request #252457 from normesta/gen2
Adding a table example
2 parents e4222d2 + ad1ab94 commit 3007993

File tree

1 file changed

+51
-0
lines changed

1 file changed

+51
-0
lines changed

articles/storage/blobs/data-lake-storage-migrate-gen1-to-gen2-azure-portal.md

Lines changed: 51 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -230,6 +230,57 @@ The following functionality isn't supported in the compatibility layer.
230230

231231
## Frequently asked questions
232232

233+
#### How long will migration take?
234+
235+
Data and metadata are migrated in parallel. The total time required to complete a migration is equal to whichever of these two processes complete last.
236+
237+
The following table shows the approximate speed of each migration processing task.
238+
239+
> [!NOTE]
240+
> These time estimates are approximate and can vary. For example, copying a large number of small files can slow performance.
241+
242+
| Processing task | Speed |
243+
|----------------------------------------|---------------------------------------|
244+
| Data copy | 9 TB per hour |
245+
| Data validation | 9 million files per hour |
246+
| Metadata copy | 4 million files and folders per hour |
247+
| Metadata processing | 25 million files and folders per hour |
248+
| Additional metadata processing (data copy option)<sup>1</sup> | 50 million files and folders per hour |
249+
250+
<sup>1</sup> The additional metadata processing time applies only if you choose the **Copy data to a new Gen2 account** option. This processing time does not apply if you choose the **Complete migration to a new gen2 account** option.
251+
252+
##### Example: Processing a large amount of data and metadata
253+
254+
This example assumes **300 TB** of data and **200 million** data and metadata items.
255+
256+
| Task | Estimated time |
257+
|--|--|
258+
| Copy data | 300 TB / 9 TB = 33.33 hours |
259+
| Validate data | 200 million / 9 million = 22.22 hours|
260+
| **Total data migration time** | **33.33 + 22.2 = 55.55 hours** |
261+
| Copy metadata | 200 million / 4 million = 50 hours |
262+
| Metadata processing | 200 million / 25 million = 8 hours |
263+
| Additional metadata processing - data copy option only | 200 million / 50 million = 4 hours |
264+
| **Total metadata migration time** | **50 + 8 + 4 = 62 hours** |
265+
| **Total time to perform a data-only migration** | **62 hours** |
266+
| **Total time to perform a complete migration**| **62 - 4 = 58 hours** |
267+
268+
##### Example: Processing a small amount of data and metadata
269+
270+
This example assumes that **2 TB** of data and **56 thousand** data and metadata items.
271+
272+
| Task | Estimated time |
273+
|--|--|
274+
| Copy data | (2 TB / 9 TB) * 60 minutes = 13.3 minutes|
275+
| Validate data | (56,000 / 9 million) * 3,600 seconds = 22.4 seconds |
276+
| **Total data migration time** | **13.3 minutes + 22.4 seconds = approximately 14 minutes** |
277+
| Copy metadata | (56,000 / 4 million) * 3,600 seconds = approximately 51 seconds |
278+
| Metadata processing | 56,000/ 25 million = 8 seconds |
279+
| Additional metadata processing - data copy option only | (56,000 / 50 million) * 3,600 seconds = 4 seconds|
280+
| **Total metadata migration time** | **51 + 8 + 4 = 63 seconds** |
281+
| **Total time to perform a data-only migration** | **14 minutes** |
282+
| **Total time to perform a complete migration** | **14 minutes - 4 seconds = 13 minutes and 56 seconds (approximately 14 minutes)** |
283+
233284
#### How much does the data migration cost?
234285

235286
There's no cost to use the portal-based migration tool, however you'll be billed for usage of Azure Data Lake Gen1 and Gen2 services. During the data migration, you'll be billed for the data storage and transactions of the Gen1 account.

0 commit comments

Comments
 (0)