Skip to content

Commit 67da64a

Browse files
authored
Merge pull request #112911 from Virginia001/work03
Work03
2 parents 905634f + fc8b2de commit 67da64a

File tree

2 files changed

+18
-14
lines changed

2 files changed

+18
-14
lines changed

articles/time-series-insights/time-series-insights-update-plan.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@ ms.workload: big-data
88
ms.service: time-series-insights
99
services: time-series-insights
1010
ms.topic: conceptual
11-
ms.date: 01/15/2020
11+
ms.date: 04/27/2020
1212
ms.custom: seodec18
1313
---
1414

articles/time-series-insights/time-series-insights-update-storage-ingress.md

Lines changed: 17 additions & 13 deletions
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@ ms.workload: big-data
88
ms.service: time-series-insights
99
services: time-series-insights
1010
ms.topic: conceptual
11-
ms.date: 02/10/2020
11+
ms.date: 04/27/2020
1212
ms.custom: seodec18
1313
---
1414

@@ -18,15 +18,15 @@ This article describes updates to data storage and ingress for Azure Time Series
1818

1919
## Data ingress
2020

21-
Your Azure Time Series Insights environment contains an *ingestion engine* to collect, process, and store time-series data.
21+
Your Azure Time Series Insights environment contains an *ingestion engine* to collect, process, and store time-series data.
2222

2323
There are some considerations to be mindful of to ensure all incoming data is processed, to achieve high ingress scale, and minimize *ingestion latency* (the time taken by Time Series Insights to read and process data from the event source) when [planning your environment](time-series-insights-update-plan.md).
2424

2525
Time Series Insights Preview data ingress policies determine where data can be sourced from and what format the data should have.
2626

2727
### Ingress policies
2828

29-
*Data ingress* involves how data is sent to an Azure Time Series Insights Preview environment.
29+
*Data ingress* involves how data is sent to an Azure Time Series Insights Preview environment.
3030

3131
Key configuration, formatting, and best practices are summarized below.
3232

@@ -37,10 +37,11 @@ Azure Time Series Insights Preview supports the following event sources:
3737
- [Azure IoT Hub](../iot-hub/about-iot-hub.md)
3838
- [Azure Event Hubs](../event-hubs/event-hubs-about.md)
3939

40-
Azure Time Series Insights Preview supports a maximum of two event sources per instance. When you connect an event source, your TSI environment will read all of the events currently stored in your Iot or Event Hub, starting with the oldest event.
40+
Azure Time Series Insights Preview supports a maximum of two event sources per instance. When you connect an event source, your TSI environment will read all of the events currently stored in your Iot or Event Hub, starting with the oldest event.
4141

42-
> [!IMPORTANT]
43-
> * You may experience high initial latency when attaching an event source to your Preview environment.
42+
> [!IMPORTANT]
43+
>
44+
> * You may experience high initial latency when attaching an event source to your Preview environment.
4445
> Event source latency depends on the number of events currently in your IoT Hub or Event Hub.
4546
> * High latency will subside after event source data is first ingested. Submit a support ticket through the Azure portal if you experience ongoing high latency.
4647
@@ -59,7 +60,7 @@ The supported data types are:
5960

6061
#### Objects and arrays
6162

62-
You may send complex types such as objects and arrays as part of your event payload, but your data will undergo a flattening process when stored.
63+
You may send complex types such as objects and arrays as part of your event payload, but your data will undergo a flattening process when stored.
6364

6465
Detailed information describing how to shape your JSON events, send complex type, and nested object flattening is available in [How to shape JSON for ingress and query](./time-series-insights-update-how-to-shape-events.md) to assist with planning and optimization.
6566

@@ -73,7 +74,7 @@ We recommend that you employ the following best practices:
7374

7475
* Understand how to optimize and shape your JSON data, as well as the current limitations in preview, by reading [how to shape JSON for ingress and query](./time-series-insights-update-how-to-shape-events.md).
7576

76-
### Ingress scale and Preview limitations
77+
### Ingress scale and Preview limitations
7778

7879
Azure Time Series Insights Preview ingress limitations are described below.
7980

@@ -88,7 +89,8 @@ In general, ingress rates are viewed as the factor of the number of devices that
8889

8990
By default, Time Series Insights preview can ingest incoming data at a rate of **up to 1 megabyte per second (MBps) per Time Series Insights environment**. There are additional limitations [per hub partition](https://docs.microsoft.com/azure/time-series-insights/time-series-insights-update-storage-ingress#hub-partitions-and-per-partition-limits).
9091

91-
> [!TIP]
92+
> [!TIP]
93+
>
9294
> * Environment support for ingesting speeds up to 16 MBps can be provided by request.
9395
> * Contact us if you require higher throughput by submitting a support ticket through Azure portal.
9496
@@ -112,7 +114,7 @@ By default, Time Series Insights preview can ingest incoming data at a rate of *
112114

113115
When planning your Time Series Insights environment, it's important to consider the configuration of the event source(s) that you'll be connecting to Time Series Insights. Both Azure IoT Hub and Event Hubs utilize partitions to enable horizontal scale for event processing.
114116

115-
A *partition* is an ordered sequence of events held in a hub. The partition count is set during the hub creation phase and cannot be changed.
117+
A *partition* is an ordered sequence of events held in a hub. The partition count is set during the hub creation phase and cannot be changed.
116118

117119
For Event Hubs partitioning best practices, review [How many partitions do I need?](https://docs.microsoft.com/azure/event-hubs/event-hubs-faq#how-many-partitions-do-i-need)
118120

@@ -127,7 +129,7 @@ Azure Time Series Insights Preview currently has a general **per partition limit
127129

128130
When a device is created in IoT Hub, it's permanently assigned to a partition. In doing so, IoT Hub is able to guarantee event ordering (since the assignment never changes).
129131

130-
A fixed partition assignment also impacts Time Series Insights instances that are ingesting data sent from IoT Hub downstream. When messages from multiple devices are forwarded to the hub using the same gateway device ID, they may arrive in the same partition at the same time potentially exceeding the per partition scale limits.
132+
A fixed partition assignment also impacts Time Series Insights instances that are ingesting data sent from IoT Hub downstream. When messages from multiple devices are forwarded to the hub using the same gateway device ID, they may arrive in the same partition at the same time potentially exceeding the per partition scale limits.
131133

132134
**Impact**:
133135

@@ -140,6 +142,7 @@ To mitigate that circumstance, we recommend the following best practices:
140142

141143
> [!IMPORTANT]
142144
> For environments using IoT Hub as an event source, calculate the ingestion rate using the number of hub devices in use to be sure that the rate falls below the 0.5 MBps per partition limitation in preview.
145+
>
143146
> * Even if several events arrive simultaneously, the Preview limit will not be exceeded.
144147
145148
![IoT Hub Partition Diagram](media/concepts-ingress-overview/iot-hub-partiton-diagram.png)
@@ -181,15 +184,15 @@ For a thorough description of Azure Blob storage, read the [Storage blobs introd
181184

182185
When you create an Azure Time Series Insights Preview PAYG environment, an Azure Storage general-purpose V1 blob account is created as your long-term cold store.
183186

184-
Azure Time Series Insights Preview retains up to two copies of each event in your Azure Storage account. One copy stores events ordered by ingestion time, always allowing access to events in a time-ordered sequence. Over time, Time Series Insights Preview also creates a repartitioned copy of the data to optimize for performant Time Series Insights query.
187+
Azure Time Series Insights Preview retains up to two copies of each event in your Azure Storage account. One copy stores events ordered by ingestion time, always allowing access to events in a time-ordered sequence. Over time, Time Series Insights Preview also creates a repartitioned copy of the data to optimize for performant Time Series Insights query.
185188

186189
During public Preview, data is stored indefinitely in your Azure Storage account.
187190

188191
#### Writing and editing Time Series Insights blobs
189192

190193
To ensure query performance and data availability, don't edit or delete any blobs that Time Series Insights Preview creates.
191194

192-
#### Accessing Time Series Insights Preview cold store data
195+
#### Accessing Time Series Insights Preview cold store data
193196

194197
In addition to accessing your data from the [Time Series Insights Preview explorer](./time-series-insights-update-explorer.md) and [Time Series Query](./time-series-insights-update-tsq.md), you may also want to access your data directly from the Parquet files stored in the cold store. For example, you can read, transform, and cleanse data in a Jupyter notebook, then use it to train your Azure Machine Learning model in the same Spark workflow.
195198

@@ -218,6 +221,7 @@ Time Series Insights Preview stores copies of your data as follows:
218221
In both cases, the time property of the Parquet file corresponds to blob creation time. Data in the `PT=Time` folder is preserved with no changes once it's written to the file. Data in the `PT=TsId` folder will be optimized for query over time and is not static.
219222

220223
> [!NOTE]
224+
>
221225
> * `<YYYY>` maps to a four-digit year representation.
222226
> * `<MM>` maps to a two-digit month representation.
223227
> * `<YYYYMMDDHHMMSSfff>` maps to a time-stamp representation with four-digit year (`YYYY`), two-digit month (`MM`), two-digit day (`DD`), two-digit hour (`HH`), two-digit minute (`MM`), two-digit second (`SS`), and three-digit millisecond (`fff`).

0 commit comments

Comments
 (0)