Skip to content

Commit 14d3781

Browse files
Merge pull request #271882 from jmgallart77/patch-4
Update monitor-troubleshoot-ingestion-agent.md
2 parents 13efb4e + e70feff commit 14d3781

File tree

1 file changed

+12
-12
lines changed

1 file changed

+12
-12
lines changed

articles/operator-insights/monitor-troubleshoot-ingestion-agent.md

Lines changed: 12 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -28,14 +28,13 @@ Metrics are reported in a simple human-friendly form.
2828

2929
- For most of these troubleshooting techniques, you need an SSH connection to the VM running the agent.
3030

31-
## Collect diagnostics
32-
33-
Microsoft Support might request diagnostic packages when investigating an issue.
31+
## Ingestion agent diagnostics
3432

3533
To collect a diagnostics package, SSH to the Virtual Machine and run the command `/usr/bin/microsoft/az-aoi-ingestion-gather-diags`. This command generates a date-stamped zip file in the current directory that you can copy from the system.
3634

3735
> [!NOTE]
38-
> Diagnostics packages don't contain any customer data or the value of any credentials.
36+
> Microsoft Support might request diagnostics packages when investigating an issue. Diagnostics packages don't contain any customer data or the value of any credentials.
37+
3938

4039
## Problems common to all sources
4140

@@ -84,16 +83,16 @@ Symptoms: MCC reports alarms about MSFs being unavailable.
8483
Symptoms: no data appears in Azure Data Explorer.
8584
8685
- Check that the MCC is healthy and ingestion agents are running.
87-
- Check the logs from the ingestion agent for errors uploading to Azure. If the logs point to an invalid connection string, or connectivity issues, fix the configuration, connection string, or SAS token, and restart the agent.
86+
- Check the ingestion agent logs in the diagnostics package for errors uploading to Azure. If the logs point to an invalid connection string, or connectivity issues, fix the configuration, connection string, or SAS token, and restart the agent.
8887
- Check the network connectivity and firewall configuration on the storage account.
8988
9089
### Data missing or incomplete
9190
9291
Symptoms: Azure Monitor shows a lower incoming EDR rate in ADX than expected.
9392
94-
- Check that the agent is running on all VMs and isn't reporting errors in logs.
93+
- Check that the agent is running on all VMs and isn't reporting errors in the diagnostics package logs.
9594
- Verify that the agent VMs aren't being sent more than the rated load.
96-
- Check agent metrics for dropped bytes/dropped EDRs. If the metrics don't show any dropped data, then MCC isn't sending the data to the agent. Check the "received bytes" metrics to see how much data is being received from MCC.
95+
- Check agent metrics in the diagnostics package for dropped bytes/dropped EDRs. If the metrics don't show any dropped data, then MCC isn't sending the data to the agent. Check the "received bytes" metrics to see how much data is being received from MCC.
9796
- Check that the agent VM isn't overloaded – monitor CPU and memory usage. In particular, ensure no other process is taking resources from the VM.
9897
9998
## Problems with the SFTP pull source
@@ -114,31 +113,32 @@ Symptoms: No files are uploaded to AOI. The agent log file, */var/log/az-aoi-ing
114113
115114
### No files are uploaded to Azure Operator Insights
116115
117-
Symptoms: No data appears in Azure Data Explorer. The AOI *Data Ingested* metric for the relevant data type is zero.
116+
Symptoms: No data appears in Azure Data Explorer. Logs of category `Ingestion` don't appear in [Azure Operator Insights monitoring data](monitor-operator-insights-data-reference.md#resource-logs) or they contain errors. The [Number of ingested rows](concept-data-quality-monitoring.md#metrics) data quality metric for the relevant data type is zero.
118117
119118
- Check that the agent is running on all VMs and isn't reporting errors in logs.
120119
- Check that files exist in the correct location on the SFTP server, and that they aren't being excluded due to file source config (see [Files are missing](#files-are-missing)).
121120
- Check the network connectivity and firewall configuration between the ingestion agent VM and the Data Product's input storage account.
122121
123122
### Files are missing
124123
125-
Symptoms: Data is missing from Azure Data Explorer. The AOI *Data Ingested* and *Processed File Count* metrics for the relevant data type are lower than expected.
124+
Symptoms: Data is missing from Azure Data Explorer. Logs of category `Ingestion` in [Azure Operator Insights monitoring data](monitor-operator-insights-data-reference.md#resource-logs) are lower than expected or they contain errors. The [Number of ingested rows](concept-data-quality-monitoring.md#metrics) data quality metric for the relevant data type is lower than expected.
125+
126126
127-
- Check that the agent is running on all VMs and isn't reporting errors in logs. Search the logs for the name of the missing file to find errors related to that file.
127+
- Check that the agent is running on all VMs and isn't reporting errors in logs. Search in the diagnostics package logs for the name of the missing file to find errors related to that file.
128128
- Check that the files exist on the SFTP server and that they aren't being excluded due to file source config. Check the file source config and confirm that:
129129
- The files exist on the SFTP server under the path defined in `base_path`. Ensure that there are no symbolic links in the file paths of the files to upload: the ingestion agent ignores symbolic links.
130130
- The "last modified" time of the files is at least `settling_time` seconds earlier than the time of the most recent upload run for this file source.
131131
- The "last modified" time of the files is later than `exclude_before_time` (if specified).
132132
- The file path relative to `base_path` matches the regular expression given by `include_pattern` (if specified).
133133
- The file path relative to `base_path` *doesn't* match the regular expression given by `exclude_pattern` (if specified).
134-
- If recent files are missing, check the agent logs to confirm that the ingestion agent performed an upload run for the source at the expected time. The `cron` parameter in the source config gives the expected schedule.
134+
- If recent files are missing, check the agent logs in the diagnostics package to confirm that the ingestion agent performed an upload run for the source at the expected time. The `cron` parameter in the source config gives the expected schedule.
135135
- Check that the agent VM isn't overloaded – monitor CPU and memory usage. In particular, ensure no other process is taking resources from the VM.
136136
137137
### Files are uploaded more than once
138138
139139
Symptoms: Duplicate data appears in Azure Operator Insights.
140140
141-
- Check whether the ingestion agent encountered a retryable error on a previous upload and then retried that upload more than 24 hours after the last successful upload. In that case, the agent might upload duplicate data during the retry attempt. The duplication of data should affect only the retry attempt.
141+
- Check whether the ingestion agent encountered a retryable error in the diagnostics package log on a previous upload and then retried that upload more than 24 hours after the last successful upload. In that case, the agent might upload duplicate data during the retry attempt. The duplication of data should affect only the retry attempt.
142142
- Check that the file sources defined in the config file refer to nonoverlapping sets of files. If multiple file sources are configured to pull files from the same location on the SFTP server, use the `include_pattern` and `exclude_pattern` config fields to specify distinct sets of files that each file source should consider.
143143
- If you're running multiple instances of the SFTP ingestion agent, check that the file sources configured for each agent don't overlap with file sources on any other agent. In particular, look out for file source config that was accidentally copied from another agent's config.
144144
- If you recently changed the pipeline `id` for a configured file source, use the `exclude_before_time` field to avoid files being reuploaded with the new pipeline `id`. For instructions, see [Change configuration for ingestion agents for Azure Operator Insights](change-ingestion-agent-configuration.md).

0 commit comments

Comments
 (0)