You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/time-series-insights/time-series-insights-update-storage-ingress.md
+17-13Lines changed: 17 additions & 13 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -8,7 +8,7 @@ ms.workload: big-data
8
8
ms.service: time-series-insights
9
9
services: time-series-insights
10
10
ms.topic: conceptual
11
-
ms.date: 02/10/2020
11
+
ms.date: 04/27/2020
12
12
ms.custom: seodec18
13
13
---
14
14
@@ -18,15 +18,15 @@ This article describes updates to data storage and ingress for Azure Time Series
18
18
19
19
## Data ingress
20
20
21
-
Your Azure Time Series Insights environment contains an *ingestion engine* to collect, process, and store time-series data.
21
+
Your Azure Time Series Insights environment contains an *ingestion engine* to collect, process, and store time-series data.
22
22
23
23
There are some considerations to be mindful of to ensure all incoming data is processed, to achieve high ingress scale, and minimize *ingestion latency* (the time taken by Time Series Insights to read and process data from the event source) when [planning your environment](time-series-insights-update-plan.md).
24
24
25
25
Time Series Insights Preview data ingress policies determine where data can be sourced from and what format the data should have.
26
26
27
27
### Ingress policies
28
28
29
-
*Data ingress* involves how data is sent to an Azure Time Series Insights Preview environment.
29
+
*Data ingress* involves how data is sent to an Azure Time Series Insights Preview environment.
30
30
31
31
Key configuration, formatting, and best practices are summarized below.
32
32
@@ -37,10 +37,11 @@ Azure Time Series Insights Preview supports the following event sources:
Azure Time Series Insights Preview supports a maximum of two event sources per instance. When you connect an event source, your TSI environment will read all of the events currently stored in your Iot or Event Hub, starting with the oldest event.
40
+
Azure Time Series Insights Preview supports a maximum of two event sources per instance. When you connect an event source, your TSI environment will read all of the events currently stored in your Iot or Event Hub, starting with the oldest event.
41
41
42
-
> [!IMPORTANT]
43
-
> * You may experience high initial latency when attaching an event source to your Preview environment.
42
+
> [!IMPORTANT]
43
+
>
44
+
> * You may experience high initial latency when attaching an event source to your Preview environment.
44
45
> Event source latency depends on the number of events currently in your IoT Hub or Event Hub.
45
46
> * High latency will subside after event source data is first ingested. Submit a support ticket through the Azure portal if you experience ongoing high latency.
46
47
@@ -59,7 +60,7 @@ The supported data types are:
59
60
60
61
#### Objects and arrays
61
62
62
-
You may send complex types such as objects and arrays as part of your event payload, but your data will undergo a flattening process when stored.
63
+
You may send complex types such as objects and arrays as part of your event payload, but your data will undergo a flattening process when stored.
63
64
64
65
Detailed information describing how to shape your JSON events, send complex type, and nested object flattening is available in [How to shape JSON for ingress and query](./time-series-insights-update-how-to-shape-events.md) to assist with planning and optimization.
65
66
@@ -73,7 +74,7 @@ We recommend that you employ the following best practices:
73
74
74
75
* Understand how to optimize and shape your JSON data, as well as the current limitations in preview, by reading [how to shape JSON for ingress and query](./time-series-insights-update-how-to-shape-events.md).
75
76
76
-
### Ingress scale and Preview limitations
77
+
### Ingress scale and Preview limitations
77
78
78
79
Azure Time Series Insights Preview ingress limitations are described below.
79
80
@@ -88,7 +89,8 @@ In general, ingress rates are viewed as the factor of the number of devices that
88
89
89
90
By default, Time Series Insights preview can ingest incoming data at a rate of **up to 1 megabyte per second (MBps) per Time Series Insights environment**. There are additional limitations [per hub partition](https://docs.microsoft.com/azure/time-series-insights/time-series-insights-update-storage-ingress#hub-partitions-and-per-partition-limits).
90
91
91
-
> [!TIP]
92
+
> [!TIP]
93
+
>
92
94
> * Environment support for ingesting speeds up to 16 MBps can be provided by request.
93
95
> * Contact us if you require higher throughput by submitting a support ticket through Azure portal.
94
96
@@ -112,7 +114,7 @@ By default, Time Series Insights preview can ingest incoming data at a rate of *
112
114
113
115
When planning your Time Series Insights environment, it's important to consider the configuration of the event source(s) that you'll be connecting to Time Series Insights. Both Azure IoT Hub and Event Hubs utilize partitions to enable horizontal scale for event processing.
114
116
115
-
A *partition* is an ordered sequence of events held in a hub. The partition count is set during the hub creation phase and cannot be changed.
117
+
A *partition* is an ordered sequence of events held in a hub. The partition count is set during the hub creation phase and cannot be changed.
116
118
117
119
For Event Hubs partitioning best practices, review [How many partitions do I need?](https://docs.microsoft.com/azure/event-hubs/event-hubs-faq#how-many-partitions-do-i-need)
118
120
@@ -127,7 +129,7 @@ Azure Time Series Insights Preview currently has a general **per partition limit
127
129
128
130
When a device is created in IoT Hub, it's permanently assigned to a partition. In doing so, IoT Hub is able to guarantee event ordering (since the assignment never changes).
129
131
130
-
A fixed partition assignment also impacts Time Series Insights instances that are ingesting data sent from IoT Hub downstream. When messages from multiple devices are forwarded to the hub using the same gateway device ID, they may arrive in the same partition at the same time potentially exceeding the per partition scale limits.
132
+
A fixed partition assignment also impacts Time Series Insights instances that are ingesting data sent from IoT Hub downstream. When messages from multiple devices are forwarded to the hub using the same gateway device ID, they may arrive in the same partition at the same time potentially exceeding the per partition scale limits.
131
133
132
134
**Impact**:
133
135
@@ -140,6 +142,7 @@ To mitigate that circumstance, we recommend the following best practices:
140
142
141
143
> [!IMPORTANT]
142
144
> For environments using IoT Hub as an event source, calculate the ingestion rate using the number of hub devices in use to be sure that the rate falls below the 0.5 MBps per partition limitation in preview.
145
+
>
143
146
> * Even if several events arrive simultaneously, the Preview limit will not be exceeded.
@@ -181,15 +184,15 @@ For a thorough description of Azure Blob storage, read the [Storage blobs introd
181
184
182
185
When you create an Azure Time Series Insights Preview PAYG environment, an Azure Storage general-purpose V1 blob account is created as your long-term cold store.
183
186
184
-
Azure Time Series Insights Preview retains up to two copies of each event in your Azure Storage account. One copy stores events ordered by ingestion time, always allowing access to events in a time-ordered sequence. Over time, Time Series Insights Preview also creates a repartitioned copy of the data to optimize for performant Time Series Insights query.
187
+
Azure Time Series Insights Preview retains up to two copies of each event in your Azure Storage account. One copy stores events ordered by ingestion time, always allowing access to events in a time-ordered sequence. Over time, Time Series Insights Preview also creates a repartitioned copy of the data to optimize for performant Time Series Insights query.
185
188
186
189
During public Preview, data is stored indefinitely in your Azure Storage account.
187
190
188
191
#### Writing and editing Time Series Insights blobs
189
192
190
193
To ensure query performance and data availability, don't edit or delete any blobs that Time Series Insights Preview creates.
191
194
192
-
#### Accessing Time Series Insights Preview cold store data
195
+
#### Accessing Time Series Insights Preview cold store data
193
196
194
197
In addition to accessing your data from the [Time Series Insights Preview explorer](./time-series-insights-update-explorer.md) and [Time Series Query](./time-series-insights-update-tsq.md), you may also want to access your data directly from the Parquet files stored in the cold store. For example, you can read, transform, and cleanse data in a Jupyter notebook, then use it to train your Azure Machine Learning model in the same Spark workflow.
195
198
@@ -218,6 +221,7 @@ Time Series Insights Preview stores copies of your data as follows:
218
221
In both cases, the time property of the Parquet file corresponds to blob creation time. Data in the `PT=Time` folder is preserved with no changes once it's written to the file. Data in the `PT=TsId` folder will be optimized for query over time and is not static.
219
222
220
223
> [!NOTE]
224
+
>
221
225
> *`<YYYY>` maps to a four-digit year representation.
222
226
> *`<MM>` maps to a two-digit month representation.
223
227
> *`<YYYYMMDDHHMMSSfff>` maps to a time-stamp representation with four-digit year (`YYYY`), two-digit month (`MM`), two-digit day (`DD`), two-digit hour (`HH`), two-digit minute (`MM`), two-digit second (`SS`), and three-digit millisecond (`fff`).
0 commit comments