You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/storage/blobs/data-lake-storage-query-acceleration-how-to.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -17,7 +17,7 @@ This article shows you how to use query acceleration (preview) to retrieve a sub
17
17
Query acceleration (preview) is a new capability for Azure Data Lake Storage that enables applications and analytics frameworks to dramatically optimize data processing by retrieving only the data that they require to perform a given operation. To learn more, see [Azure Data Lake Storage Query Acceleration (preview)](data-lake-storage-query-acceleration.md).
18
18
19
19
> [!NOTE]
20
-
> The query acceleration feature is in public preview, and is available in the West Central US, Canada Central, and France Central regions. To review limitations, see the [Known issues](data-lake-storage-known-issues.md) article. To enroll in the preview, see [this form](https://aka.ms/adls/qa-preview-signup).
20
+
> The query acceleration feature is in public preview, and is available in the Canada Central and France Central regions. To review limitations, see the [Known issues](data-lake-storage-known-issues.md) article. To enroll in the preview, see [this form](https://aka.ms/adls/qa-preview-signup).
Query acceleration (preview) is a new capability for Azure Data Lake Storage that enables applications and analytics frameworks to dramatically optimize data processing by retrieving only the data that they require to perform a given operation. This reduces the time and processing power that is required to gain critical insights into stored data.
16
16
17
17
> [!NOTE]
18
-
> The query acceleration feature is in public preview, and is available in the West Central US, Canada Central, and France Central regions. To review limitations, see the [Known issues](data-lake-storage-known-issues.md) article. To enroll in the preview, see [this form](https://aka.ms/adls/qa-preview-signup).
18
+
> The query acceleration feature is in public preview, and is available in the Canada Central and France Central regions. To review limitations, see the [Known issues](data-lake-storage-known-issues.md) article. To enroll in the preview, see [this form](https://aka.ms/adls/qa-preview-signup).
Copy file name to clipboardExpand all lines: articles/storage/blobs/query-acceleration-sql-reference.md
+5-19Lines changed: 5 additions & 19 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -17,7 +17,7 @@ ms.reviewer: ereilebr
17
17
Query acceleration supports an ANSI SQL-like language for expressing queries over blob contents. The query acceleration SQL dialect is a subset of ANSI SQL, with a limited set of supported data types, operators, etc., but it also expands on ANSI SQL to support queries over hierarchical semi-structured data formats such as JSON.
18
18
19
19
> [!NOTE]
20
-
> The query acceleration feature is in public preview, and is available in the West Central US, Canada Central, and France Central regions. To review limitations, see the [Known issues](data-lake-storage-known-issues.md) article. To enroll in the preview, see [this form](https://aka.ms/adls/qa-preview-signup).
20
+
> The query acceleration feature is in public preview, and is available in the Canada Central and France Central regions. To review limitations, see the [Known issues](data-lake-storage-known-issues.md) article. To enroll in the preview, see [this form](https://aka.ms/adls/qa-preview-signup).
21
21
22
22
## SELECT Syntax
23
23
@@ -280,31 +280,17 @@ SELECT weight,warehouses[0].longitude,id,tags[1] FROM BlobStorage[*]
280
280
281
281
## Sys.Split
282
282
283
-
This is a special form of the SELECT statement, which is only available for CSV-formatted data:
283
+
This is a special form of the SELECT statement, which is only available for CSV-formatted data.
284
284
285
285
```sql
286
286
SELECTsys.split(split_size)FROM BlobStorage
287
287
```
288
288
289
-
In this form, the query returns a list of non-negative integers of sizes of record aligned regions.
289
+
Use this statement to get batches of data records from a CSV file. That way you can process records in parallel instead of having to download them all at one time.
290
290
291
-
### Example
291
+
Use the *split_size* parameter to specify the number of bytes that you want the statement to return. For example, if you want to process only 10 MB of data, you're statement would look like this: `SELECT sys.split(10485760)FROM BlobStorage` because 10 MB is equal to 10,485,760 bytes. This statement returns as many records as can fit into those 10 MB.
The following are examples for the above csv blob:
296
-
297
-
|Command|Result|
298
-
|---|---|
299
-
|SELECT SYS.SPLIT(1) FROM BlobStorage<br>SELECT SYS.SPLIT(6) FROM BlobStorage|"10\n16\n8\n6\n5\n"|
300
-
|SELECT SYS.SPLIT(7) FROM BlobStorage<br>SELECT SYS.SPLIT(8) FROM BlobStorage|"10\n16\n8\n11\n"|
301
-
|SELECT SYS.SPLIT(9) FROM BlobStorage|"10\n16\n14\n5\n"|
302
-
|SELECT SYS.SPLIT(10) FROM BlobStorage|"10\n16\n14\n5\n"|
303
-
|SELECT SYS.SPLIT(11) FROM BlobStorage<br>SELECT SYS.SPLIT(14) FROM BlobStorage|"26\n14\n5\n"|
304
-
|SELECT SYS.SPLIT(15) FROM BlobStorage<br>SELECT SYS.SPLIT(26) FROM BlobStorage|"26\n19\n"|
305
-
|SELECT SYS.SPLIT(27) FROM BlobStorage<br>SELECT SYS.SPLIT(34) FROM BlobStorage|"34\n11\n"|
306
-
|SELECT SYS.SPLIT(28) FROM BlobStorage<br>SELECT SYS.SPLIT(40) FROM BlobStorage|"40\n5\n"|
307
-
|SELECT SYS.SPLIT(41) FROM BlobStorage<br>SELECT SYS.SPLIT(…) FROM BlobStorage|"45\n"|
293
+
In most cases, the total bytes returned will be slightly higher than the number that you specify. That's because the **sys.split** function returns complete records only, and a record might start just before the threshold that you specify.
308
294
309
295
>[!NOTE]
310
296
> The split_size must be at least 10 MB (10485760).
0 commit comments