You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/data-explorer/lightingest.md
+47-39Lines changed: 47 additions & 39 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,6 +1,6 @@
1
1
---
2
-
title: LightIngest
3
-
description: Learn about LightIngest, a command-line utility for ad-hoc data ingestion into Kusto
2
+
title: LightIngest is a command-line utility for ingestion into Azure Data Explorer.
3
+
description: Learn about LightIngest, a command-line utility for ad-hoc data ingestion into Azure Data Explorer.
4
4
author: orspod
5
5
ms.author: orspodek
6
6
ms.reviewer: tzgitlin
@@ -11,7 +11,7 @@ ms.date: 03/17/2020
11
11
12
12
# LightIngest
13
13
14
-
LightIngest is a command-line utility for ad-hoc data ingestion into Kusto.
14
+
LightIngest is a command-line utility for ad-hoc data ingestion into Azure Data Explorer.
15
15
The utility can pull source data from a local folder or from an Azure blob storage container.
16
16
17
17
## Prerequisites
@@ -39,25 +39,24 @@ The utility can pull source data from a local folder or from an Azure blob stora
39
39
>
40
40
>
41
41
42
-
1.(Mandatory) Enter `LightIngest` followed by the connection string to the Kusto cluster that will manage the ingestion.
43
-
The connection string should be enclosed in double quotes and follow the [Kusto connection strings specification](https://docs.microsoft.com/azure/kusto/api/connection-strings/kusto).
42
+
1. Enter `LightIngest` followed by the connection string to the Azure Data Explorer cluster that will manage the ingestion.
43
+
Enclose the connection string in double quotes and follow the [Kusto connection strings specification](https://docs.microsoft.com/azure/kusto/api/connection-strings/kusto).
> * It's recommended to configure `LightIngest` to work with the ingestion endpoint at `https://ingest-{yourClusterNameAndRegion}.kusto.windows.net`. This way the Kusto service can manage the ingestion load, and it provides for recovery in case of transient errors. However, you can also configure `LightIngest` to work directly with the engine endpoint (`https://{yourClusterNameAndRegion}.kusto.windows.net`).
52
-
> * Knowing the raw data size is important for optimal ingestion performance. `LightIngest` will estimate the uncompressed size of local files. However, for compressed blobs, `LightIngest` could have difficulties correctly estimating their raw size without first downloading them. When ingesting compressed blobs, it will be helpful for `LightIngest` performance if you set the `rawSizeBytes` property on the blob metadata to uncompressed data size in bytes.
50
+
* The recommended method is for `LightIngest` to work with the ingestion endpoint at `https://ingest-{yourClusterNameAndRegion}.kusto.windows.net`. This way, the Azure Data Explorer service can manage the ingestion load, and you can easily recover from transient errors. However, you can also configure `LightIngest` to work directly with the engine endpoint (`https://{yourClusterNameAndRegion}.kusto.windows.net`).
51
+
* For optimal ingestion performance, it is important for LightIngest to know the raw data size and so `LightIngest` will estimate the uncompressed size of local files. However, `LightIngest` might not be able to correctly estimate the raw size of compressed blobs without first downloading them. Therefore, when ingesting compressed blobs, set the `rawSizeBytes` property on the blob metadata to uncompressed data size in bytes.
53
52
54
-
## Command line arguments reference
53
+
## General command-line arguments
55
54
56
55
|Argument name |Short name |Type |Mandatory |Description |
| | |string |Mandatory |[Kusto Connection String](https://docs.microsoft.com/azure/kusto/api/connection-strings/kusto) specifying the Kusto endpoint that will handle the ingestion. Should be enclosed in double quotes |
59
-
|-database |-db |string |Optional |Target Kusto database name |
60
-
|-table | |string |Mandatory |Target Kusto table name |
57
+
| | |string |Mandatory |[Azure Data Explorer Connection String](https://docs.microsoft.com/azure/kusto/api/connection-strings/kusto) specifying the Kusto endpoint that will handle the ingestion. Should be enclosed in double quotes |
58
+
|-database |-db |string |Optional |Target Azure Data Explorer database name |
59
+
|-table | |string |Mandatory |Target Azure Data Explorer table name |
61
60
|-sourcePath |-source |string |Mandatory |Path to source files or root URI of the blob container. If the data is in blobs, must contain storage account key or SAS. Recommended to enclose in double quotes |
62
61
|-prefix | |string |Optional |When the source data to ingest resides on blob storage, this URL prefix is shared by all blobs, excluding the container name. For example, if the data is in `MyContainer/Dir1/Dir2`, then the prefix should be `Dir1/Dir2`. Enclosing in double quotes is recommended |
63
62
|-pattern | |string |Optional |Pattern by which source files/blobs are picked. Supports wildcards. For example, `"*.csv"`. Recommended to enclose in double quotes |
@@ -68,11 +67,11 @@ The utility can pull source data from a local folder or from an Azure blob stora
68
67
|-tag | |string |Optional |[Tags](https://docs.microsoft.com/azure/kusto/management/extents-overview#extent-tagging) to associate with the ingested data. Multiple occurrences are permitted |
69
68
|-dontWait | |bool |Optional |If set to 'true', does not wait for ingestion completion. Useful when ingesting large amounts of files/blobs |
70
69
71
-
### Additional arguments for advanced scenarios
70
+
### Command-line arguments for advanced scenarios
72
71
73
72
|Argument name |Short name |Type |Mandatory |Description |
|-compression |-cr |double |Optional |Compression ratio hint. Useful when ingesting compressed files/blobs to help Kusto assess the raw data size. Calculated as original size divided by compressed size |
74
+
|-compression |-cr |double |Optional |Compression ratio hint. Useful when ingesting compressed files/blobs to help Azure Data Explorer assess the raw data size. Calculated as original size divided by compressed size |
76
75
|-limit |-l |integer |Optional |If set, limits the ingestion to first N files |
77
76
|-ingestTimeout | |integer |Optional |Timeout in minutes for all ingest operations completion. Defaults to `60`|
0 commit comments