You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/databox/data-box-disk-deploy-copy-data.md
+53-19Lines changed: 53 additions & 19 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -7,7 +7,7 @@ author: stevenmatthew
7
7
ms.service: databox
8
8
ms.subservice: disk
9
9
ms.topic: tutorial
10
-
ms.date: 11/18/2022
10
+
ms.date: 03/26/2024
11
11
ms.author: shaas
12
12
---
13
13
@@ -17,13 +17,12 @@ ms.author: shaas
17
17
# Doc scores:
18
18
# 11/18/22: 75 (2456/62)
19
19
# 09/01/23: 100 (2159/0)
20
+
-->
20
21
21
22
::: zone target="docs"
22
-
-->
23
23
24
24
# Tutorial: Copy data to Azure Data Box Disk and verify
25
25
26
-
<!--
27
26
::: zone-end
28
27
29
28
::: zone target="chromeless"
@@ -35,13 +34,23 @@ After the disks are connected and unlocked, you can copy data from your source d
35
34
::: zone-end
36
35
37
36
::: zone target="docs"
38
-
-->
37
+
38
+
> [!IMPORTANT]
39
+
> Azure Data Box now supports access tier assignment at the blob level. The steps contained within this tutorial reflect the updated data copy process and are specific to block blobs.
40
+
>
41
+
>For help with determining the appropriate access tier for your block blob data, refer to the [Determine appropriate access tiers for block blobs](#determine-appropriate-access-tiers-for-block-blobs) section. Follow the steps containined within the [Copy data to disks](#copy-data-to-disks) section to copy your data to the appropriate access tier.
42
+
>
43
+
> The information contained within this section applies to orders placed after April 1, 2024.
44
+
45
+
> [!CAUTION]
46
+
> This article references CentOS, a Linux distribution that is nearing End Of Life (EOL) status. Please consider your use and planning accordingly.
39
47
40
48
This tutorial describes how to copy data from your host computer and generate checksums to verify data integrity.
41
49
42
50
In this tutorial, you learn how to:
43
51
44
52
> [!div class="checklist"]
53
+
> * Determine appropriate access tiers for block blobs
45
54
> * Copy data to Data Box Disk
46
55
> * Verify data
47
56
@@ -52,20 +61,48 @@ Before you begin, make sure that:
52
61
- You have completed the [Tutorial: Install and configure your Azure Data Box Disk](data-box-disk-deploy-set-up.md).
53
62
- Your disks are unlocked and connected to a client computer.
54
63
- The client computer used to copy data to the disks is running a [Supported operating system](data-box-disk-system-requirements.md#supported-operating-systems-for-clients).
55
-
- The intended storage type for your data matches [Supported storage types](data-box-disk-system-requirements.md#supported-storage-types-for-upload).
64
+
- The intended storage type for your data matches the [Supported storage types](data-box-disk-system-requirements.md#supported-storage-types-for-upload).
56
65
- You've reviewed [Managed disk limits in Azure object size limits](data-box-disk-limits.md#azure-object-size-limits).
57
66
67
+
## Determine appropriate access tiers for block blobs
68
+
69
+
> [!IMPORTANT]
70
+
> The information contained within this section applies to orders placed after April 1<sup>st</sup>, 2024.
71
+
72
+
Azure Storage allows you to store block blob data in multiple access tiers within the same storage account. This ability allows data to be organized and stored more efficiently based on how often it's accessed. The following table contains information and recommendations about Azure Storage access tiers.
73
+
74
+
| Tier | Recommendation | Best practice |
75
+
|-------------|----------------|---------------|
76
+
|**Hot**| Useful for online data accessed or modified frequently. This tier has the highest storage costs, but the lowest access costs. | Data in this tier should be in regular and active use. |
77
+
|**Cool**| Useful for online data accessed or modified infrequently. This tier has lower storage costs and higher access costs than the hot tier. | Data in this tier should be stored for at least 30 days. |
78
+
|**Cold**| Useful for online data accessed or modified rarely but still requiring fast retrieval. This tier has lower storage costs and higher access costs than the cool tier.| Data in this tier should be stored for a minimum of 90 days. |
79
+
|**Archive**| Useful for offline data rarely accessed and having lower latency requirements. | Data in this tier should be stored for a minimum of 180 days. Data removed from the archive tier within 180 days is subject to an early deletion charge. |
80
+
81
+
For more information about blob access tiers, see [Access tiers for blob data](../storage/blobs/access-tiers-overview.md). For more detailed best practices, see [Best practices for using blob access tiers](../storage/blobs/access-tiers-best-practices.md).
82
+
83
+
You can transfer your block blob data to the appropriate access tier by copying it to the corresponding folder within Data Box Disk. This process is discussed in greater detail within the [Copy data to disks](#copy-data-to-disks) section.
84
+
58
85
## Copy data to disks
59
86
60
87
Review the following considerations before you copy the data to the disks:
61
88
62
-
- It is your responsibility to ensure that you copy your local data to the folders that correspond to the appropriate data format. For instance, copy block blob data to the *BlockBlob* folder. Block blobs being archived should be copied to the *BlockBlob_Archive* folder. If the local data format doesn't match the appropriate folder for the chosen storage type, the data upload to Azure fails in a later step.
89
+
- It is your responsibility to copy local data to the share which corresponds to the appropriate data format. For instance, copy block blob data to the *BlockBlob* share. Copy VHDs to the *PageBlob* share. If the local data format doesn't match the appropriate folder for the chosen storage type, the data upload to Azure fails in a later step.
90
+
- You can't copy data directly to a share's *root* folder. Instead, create a folder within the appropriate share and copy your data into it.
91
+
- Folders located at the *PageBlob* share's *root* correspond to containers within your storage account. A new container will be created for any folder whose name does not match an existing container within your storage account.
92
+
- Folders located at the *AzFile* share's *root* correspond to Azure file shares. A new file share will be created for any folder whose name does not match an existing file share within your storage account.
93
+
- The *BlockBlob* share's *root* level contains one folder corresponding to each access tier. When copying data to the *BlockBlob* share, create a subfolder within the top-level folder corresponding to the desired access tier. As with the *PageBlob* share, a new containers will be created for any folder whose name doesn't match an existing container. Data within the container will be copied to the tier corresponding to the subfolder's top-level parent.
94
+
95
+
A container will also be created for any folder residing at the *BlockBlob* share's *root*, though the data it will be copied to the container's default access tier. To ensure that your data is copied to the desired access tier, don't create folders at the *root* level.
96
+
97
+
> [!IMPORTANT]
98
+
> Data uploaded to the archive tier remains offline and needs to be rehydrated before reading or modifying. Data copied to the archive tier must remain for at least 180 days or be subject to an early deletion charge. Archive tier is not supported for ZRS, GZRS, or RA-GZRS accounts.
99
+
63
100
- While copying data, ensure that the data size conforms to the size limits described within in the [Azure storage and Data Box Disk limits](data-box-disk-limits.md) article.
64
101
- To preserve metadata such as ACLs, timestamps, and file attributes when transferring data to Azure Files, follow the guidance within the [Preserving file ACLs, attributes, and timestamps with Azure Data Box Disk](data-box-disk-file-acls-preservation.md) article.
65
102
- If you use both Data Box Disk and other applications to upload data simultaneously, you may experience upload job failures and data corruption.
66
103
67
104
> [!IMPORTANT]
68
-
> Data uploaded to the archive tier remains offline and needs to be rehydrated before reading or modifying. Data copied to the archive tier must remain for at least 180 days or be subject to an early deletion charge. Archive tier is not supported for ZRS, GZRS, or RA-GZRS accounts.
105
+
> If you specified managed disks as one of the storage destinations during order creation, the following section is applicable.
69
106
70
107
> [!IMPORTANT]
71
108
> If you specified managed disks as one of the storage destinations during order creation, the following section is applicable.
@@ -81,23 +118,23 @@ Perform the following steps to connect and copy data from your computer to the D
81
118
82
119
|Selected storage destination |Storage account type|Staging storage account type |Folders and subfolders |
The following screenshot shows an order where a GPv2 storage account and archive tier were specified:
91
128
92
129
:::image type="content" source="media/data-box-disk-deploy-copy-data/content-sml.png" alt-text="Screenshot of the contents of the disk drive." lightbox="media/data-box-disk-deploy-copy-data/content.png":::
93
130
94
-
1. Copy data to be imported as block blobs into the *BlockBlob* folder. Copy data to be stored as block blobs with the archive tier into the *BlockBlob_Archive* folder. Similarly, copy VHD or VHDX data to the *PageBlob* folder, and file share data into *AzureFile* folder.
131
+
1.Copy VHD or VHDX data to the *PageBlob* folder. All files copied to the *PageBlob* folder are copied into a default `$root` container within the Azure Storage account. A container is created in the Azure storage account for each subfolder within the *PageBlob* folder.
95
132
96
-
A container is created in the Azure storage account for each subfolder within the *BlockBlob* and *PageBlob* folders. All files copied to the *BlockBlob* and *PageBlob* folders are copied into a default `$root`container within the Azure Storage account. Any files in the `$root` container are always uploaded as block blobs.
133
+
Copy data to be placed in Azure file shares to a subfolder within the *AzureFile* folder. All files copied to the *AzureFile* folder are copied as files to a default container of type `databox-format-[GUID]`, for example, `databox-azurefile-7ee19cfb3304122d940461783e97bf7b4290a1d7`.
97
134
98
-
Copy data to be placed in Azure file shares to a subfolder within the *AzureFile* folder. All files copied to the *AzureFile*folder are copied as files to a default container of type `databox-format-[GUID]`, for example, `databox-azurefile-7ee19cfb3304122d940461783e97bf7b4290a1d7`.
135
+
You can't copy files directly to the *BlockBlob*'s *root* folder. Within the root folder, you'll find a sub-folder corresponding to each of the available access tiers. To copy your blob data, you must first select the folder corresponding to one of the access tiers. Next, create a sub-folder within that tier's folder to store your data. Finally, copy your data to the newly created sub-folder. Your new sub-folder represents the container created within the storage account during ingestion. Your data is uploaded to this container as blobs. As with the *AzureFile* share, a new blob storage container will be created for each sub-folder located at the *BlockBlob*'s *root* folder. The data within these folders will be saved according to the storage account's default access tier.
99
136
100
-
Before you begin to copy data, you need to move any files and folders that exist in the root directory to a different folder.
137
+
Before you begin to copy data, you need to move any files and folders that exist in the root directory to a different folder.
101
138
102
139
> [!IMPORTANT]
103
140
> All the containers, blobs, and filenames should conform to [Azure naming conventions](data-box-disk-limits.md#azure-block-blob-page-blob-and-file-naming-conventions). If these rules are not followed, the data upload to Azure will fail.
@@ -310,10 +347,8 @@ Advance to the next tutorial to learn how to return the Data Box Disk and verify
310
347
> [!div class="nextstepaction"]
311
348
> [Ship your Azure Data Box back to Microsoft](./data-box-disk-deploy-picked-up.md)
312
349
313
-
<!--
314
350
::: zone-end
315
-
-->
316
-
<!--
351
+
317
352
::: zone target="chromeless"
318
353
319
354
### Copy data to disks
@@ -348,4 +383,3 @@ Take the following steps to verify your data.
348
383
For more information on data validation, see [Validate data](#validate-data). If you experience errors during validation, see [troubleshoot validation errors](data-box-disk-troubleshoot.md).
| Page blob | 4 TiB <br> Every file uploaded in page blob format must be 512 bytes aligned (an integral multiple), else the upload fails. <br> VHD and VHDX are 512 bytes aligned. |
76
-
| Azure Files |1 TiB |
76
+
| Azure Files |4 TiB |
77
77
| Managed disks | 4 TiB <br> For more information on size and limits, see: <li>[Scalability targets of Standard SSDs](../virtual-machines/disks-types.md#standard-ssds)</li><li>[Scalability targets of Premium SSDs](../virtual-machines/disks-types.md#standard-hdds)</li><li>[Scalability targets of Standard HDDs](../virtual-machines/disks-types.md#premium-ssds)</li><li>[Pricing and billing of managed disks](../virtual-machines/disks-types.md#billing)</li>
78
78
79
79
## Azure block blob, page blob, and file naming conventions
0 commit comments