Skip to content

Commit a8eae8d

Browse files
authored
Merge pull request #105917 from bwren/am-query-optimize
Azure Monitor updates to query optimization
2 parents bd1663a + e5caaf5 commit a8eae8d

File tree

1 file changed

+8
-3
lines changed

1 file changed

+8
-3
lines changed

articles/azure-monitor/log-query/query-optimization.md

Lines changed: 8 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -254,8 +254,13 @@ by Computer
254254
) on Computer
255255
```
256256

257+
The measurement is always larger than the actual time specified. For example, if the filter on the query is 7 days, the system might scan 7.5 or 8.1 days. This is because the system is partitioning the data into chunks in variable size. To assure that all relevant records are scanned, it scans the entire partition that might cover several hours and even more than a day.
258+
259+
There are several cases where the system cannot provide an accurate measurement of the time range. This happens in most of the cases where the query's span less than a day or in multi-workspace queries.
260+
261+
257262
> [!IMPORTANT]
258-
> This indicator isn't available for cross region queries.
263+
> This indicator presents only data processed in the immediate cluster. In multi-region query, it would represent only one of the regions. In multi-workspace query, it might not include all workspaces.
259264
260265
## Age of processed data
261266
Azure Data Explorer uses several storage tiers: in-memory, local SSD disks and much slower Azure Blobs. The newer the data, the higher is the chance that it is stored in a more performant tier with smaller latency, reducing the query duration and CPU. Other than the data itself, the system also has a cache for metadata. The older the data, the less chance its metadata will be in cache.
@@ -280,7 +285,7 @@ Cross-region query execution requires the system to serialize and transfer in th
280285
If there is no real reason to scan all these regions, you should adjust the scope so it covers fewer regions. If the resource scope is minimized but still many regions are used, it might happen due to misconfiguration. For example, audit logs and diagnostic settings are sent to different workspaces in different regions or there are multiple diagnostic settings configurations.
281286

282287
> [!IMPORTANT]
283-
> This indicator isn't available for cross region queries.
288+
> When a query is run across several regions, the CPU and data measurements will not be accurate and will represent the measurement only on one of the regions.
284289
285290
## Number of workspaces
286291
Workspaces are logical containers that are used to segregate and administer logs data. The backend optimizes workspace placements on physical clusters within the selected region.
@@ -296,7 +301,7 @@ Cross-region and cross-cluster execution of queries requires the system to seria
296301
> In some multi-workspace scenarios, the CPU and data measurements will not be accurate and will represent the measurement only to few of the workspaces.
297302
298303
## Parallelism
299-
Azure Monitor Logs is using large clusters of Azure Data Explorer to run queries, and these clusters vary in scale. The system automatically scales the clusters according to workspace placement logic and capacity.
304+
Azure Monitor Logs is using large clusters of Azure Data Explorer to run queries, and these clusters vary in scale, potentially getting up to dozens of compute nodes. The system automatically scales the clusters according to workspace placement logic and capacity.
300305

301306
To efficiently execute a query, it is partitioned and distributed to compute nodes based on the data that is required for its processing. There are some situations where the system cannot do this efficiently. This can lead to a long duration of the query.
302307

0 commit comments

Comments
 (0)