Skip to content

Commit 0418738

Browse files
authored
Merge pull request #177539 from rboucher/patch-2
Make more scannable with tables
2 parents 14567f2 + 3e61478 commit 0418738

File tree

1 file changed

+40
-14
lines changed

1 file changed

+40
-14
lines changed

articles/azure-monitor/logs/data-ingestion-time.md

Lines changed: 40 additions & 14 deletions
Original file line numberDiff line numberDiff line change
@@ -12,64 +12,90 @@ ms.date: 07/18/2019
1212
Azure Monitor is a high scale data service that serves thousands of customers sending terabytes of data each month at a growing pace. There are often questions about the time it takes for log data to become available after it's collected. This article explains the different factors that affect this latency.
1313

1414
## Typical latency
15-
Latency refers to the time that data is created on the monitored system and the time that it comes available for analysis in Azure Monitor. The typical latency to ingest log data is between 20 sec and 3 minutes. However, the specific latency for any particular data will vary depending on a variety of factors explained below.
15+
Latency refers to the time that data is created on the monitored system and the time that it comes available for analysis in Azure Monitor. The typical latency to ingest log data is **between 20 seconds and 3 minutes**. However, the specific latency for any particular data will vary depending on a variety of factors explained below.
1616

1717

1818
## Factors affecting latency
1919
The total ingestion time for a particular set of data can be broken down into the following high-level areas.
2020

21-
- Agent time - The time to discover an event, collect it, and then send it to Azure Monitor Logs ingestion point as a log record. In most cases, this process is handled by an agent. Additional latency may be introduced by network.
22-
- Pipeline time - The time for the ingestion pipeline to process the log record. This includes parsing the properties of the event and potentially adding calculated information.
23-
- Indexing time The time spent to ingest a log record into Azure Monitor big data store.
21+
- **Agent time** - The time to discover an event, collect it, and then send it to Azure Monitor Logs ingestion point as a log record. In most cases, this process is handled by an agent. Additional latency may be introduced by the network.
22+
- **Pipeline time** - The time for the ingestion pipeline to process the log record. This includes parsing the properties of the event and potentially adding calculated information.
23+
- **Indexing time** - The time spent to ingest a log record into Azure Monitor big data store.
2424

2525
Details on the different latency introduced in this process are described below.
2626

2727
### Agent collection latency
28+
29+
**Time varies**
30+
2831
Agents and management solutions use different strategies to collect data from a virtual machine, which may affect the latency. Some specific examples include the following:
2932

30-
- Windows events, syslog events, and performance metrics are collected immediately. Linux performance counters are polled at 30-second intervals.
31-
- IIS logs and custom logs are collected once their timestamp changes. For IIS logs, this is influenced by the [rollover schedule configured on IIS](../agents/data-sources-iis-logs.md).
32-
- Active Directory Replication solution performs its assessment every five days, while the Active Directory Assessment solution performs a weekly assessment of your Active Directory infrastructure. The agent will collect these logs only when assessment is complete.
33+
| Type of data | Collection frequency | Notes |
34+
|:--------------|:----------------------|:------|
35+
| Windows events, syslog events, and performance metrics | collected immediately| |
36+
| Linux performance counters | polled at 30-second intervals| |
37+
| IIS logs and custom logs | collected once their timestamp changes | For IIS logs, this is influenced by the [rollover schedule configured on IIS](../agents/data-sources-iis-logs.md). |
38+
| Active Directory Replication solution | Assessment every five days | The agent collects these logs only when assessment is complete.|
39+
| Active Directory Assessment solution | weekly assessment of your Active Directory infrastructure | The agent collects these logs only when assessment is complete.|
3340

3441
### Agent upload frequency
42+
43+
**Under 1 minute**
44+
3545
To ensure the Log Analytics agent is lightweight, the agent buffers logs and periodically uploads them to Azure Monitor. Upload frequency varies between 30 seconds and 2 minutes depending on the type of data. Most data is uploaded in under 1 minute.
3646

3747
### Network
48+
49+
**Varies**
50+
3851
Network conditions may negatively affect the latency of this data to reach Azure Monitor Logs ingestion point.
3952

40-
### Azure activity logs, resource logs and metrics
53+
### Azure metrics, resource logs, activity log
54+
55+
**30 seconds to 15 minutes**
56+
4157
Azure data adds additional time to become available at Azure Monitor Logs ingestion point for processing:
4258

43-
- Resource logs typically add 30-90 seconds, depending on the Azure service. Some Azure services (specifically, Azure SQL Database and Azure Virtual Network) currently report their logs at 5 min intervals. Work is in progress to improve this further. See the [query below](#checking-ingestion-time) to examine this latency in your environment
44-
- Azure platform metrics take additional 3 minutes to be exported to Azure Monitor Logs ingestion point.
45-
- Activity log data may take additional 10-15 minutes, if legacy integration is used. We recommend to use subscription-level diagnostic settings to ingest Activity Logs into Azure Monitor Logs, which incurs additional latency of about 30 seconds.
59+
- **Azure platform metrics** are available in under a minute in the metrics database, but take additional 3 minutes to be exported to the Azure Monitor Logs ingestion point.
60+
- **Resource logs** typically add 30-90 seconds, depending on the Azure service. Some Azure services (specifically, Azure SQL Database and Azure Virtual Network) currently report their logs at 5 min intervals. Work is in progress to improve this further. See the [query below](#checking-ingestion-time) to examine this latency in your environment
61+
- **Activity log** data is ingested in 30 seconds when you use the recommended subscription-level diagnostic settings to send them into Azure Monitor Logs. However, they may take 10-15 minutes if you instead use the legacy integration.
4662

4763
### Management solutions collection
64+
65+
**Varies**
66+
4867
Some solutions do not collect their data from an agent and may use a collection method that introduces additional latency. Some solutions collect data at regular intervals without attempting near-real time collection. Specific examples include the following:
4968

5069
- Microsoft 365 solution polls activity logs using the Management Activity API, which currently does not provide any near-real time latency guarantees.
5170
- Windows Analytics solutions (Update Compliance for example) data is collected by the solution at a daily frequency.
5271

53-
Refer to the documentation for each solution to determine its collection frequency.
72+
Refer to the [documentation for each solution](/azure/azure-monitor/insights/solutions) to determine its collection frequency.
5473

5574
### Pipeline-process time
5675

76+
**30 to 60 seconds**
77+
5778
Once available at ingestion point, data takes additional 30-60 seconds to be available for querying.
5879

5980
Once log records are ingested into the Azure Monitor pipeline (as identified in the [_TimeReceived](./log-standard-columns.md#_timereceived) property), they're written to temporary storage to ensure tenant isolation and to make sure that data isn't lost. This process typically adds 5-15 seconds. Some management solutions implement heavier algorithms to aggregate data and derive insights as data is streaming in. For example, the Network Performance Monitoring aggregates incoming data over 3-minute intervals, effectively adding 3-minute latency. Another process that adds latency is the process that handles custom logs. In some cases, this process might add few minutes of latency to logs that are collected from files by the agent.
6081

6182
### New custom data types provisioning
83+
6284
When a new type of custom data is created from a [custom log](../agents/data-sources-custom-logs.md) or the [Data Collector API](../logs/data-collector-api.md), the system creates a dedicated storage container. This is a one-time overhead that occurs only on the first appearance of this data type.
6385

6486
### Surge protection
87+
88+
**Typically less than 1 minute but can be more**
89+
6590
The top priority of Azure Monitor is to ensure that no customer data is lost, so the system has built-in protection for data surges. This includes buffers to ensure that even under immense load, the system will keep functioning. Under normal load, these controls add less than a minute, but in extreme conditions and failures they could add significant time while ensuring data is safe.
6691

6792
### Indexing time
68-
There is a built-in balance for every big data platform between providing analytics and advanced search capabilities as opposed to providing immediate access to the data. Azure Monitor allows you to run powerful queries on billions of records and get results within a few seconds. This is made possible because the infrastructure transforms the data dramatically during its ingestion and stores it in unique compact structures. The system buffers the data until enough of it is available to create these structures. This must be completed before the log record appears in search results.
6993

70-
This process currently takes about 5 minutes when there is low volume of data but less time at higher data rates. This seems counterintuitive, but this process allows optimization of latency for high-volume production workloads.
94+
**5 minutes or less**
7195

96+
There is a built-in balance for every big data platform between providing analytics and advanced search capabilities as opposed to providing immediate access to the data. Azure Monitor allows you to run powerful queries on billions of records and get results within a few seconds. This is made possible because the infrastructure transforms the data dramatically during its ingestion and stores it in unique compact structures. The system buffers the data until enough of it is available to create these structures. This must be completed before the log record appears in search results.
7297

98+
This process currently takes about 5 minutes when there is low volume of data but less time at higher data rates. This seems counterintuitive, but this process allows optimization of latency for high-volume production workloads.
7399

74100
## Checking ingestion time
75101
Ingestion time may vary for different resources under different circumstances. You can use log queries to identify specific behavior of your environment. The following table specifies how you can determine the different times for a record as it's created and sent to Azure Monitor.

0 commit comments

Comments
 (0)