Skip to content

Commit d744963

Browse files
Document audit telemetry (#4790)
* Document audit telemetry * Update docs/telemetry.md Co-authored-by: Ramon Smits <[email protected]> * Add otel collection example --------- Co-authored-by: Ramon Smits <[email protected]>
1 parent b8c6fbf commit d744963

File tree

1 file changed

+42
-0
lines changed

1 file changed

+42
-0
lines changed

docs/telemetry.md

Lines changed: 42 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,42 @@
1+
# Telemetry
2+
3+
Instances can be configured to emit telemetry to aid in performance testing or troubleshooting performance-related issues.
4+
5+
## Error
6+
7+
Setting `ServiceControl/PrintMetrics` to `true` will print metrics to the logs at `INFO` level.
8+
9+
## Audit
10+
11+
Set `ServiceControl.Audit/OtlpEndpointUrl` to a valid [OTLP endpoint url](https://opentelemetry.io/docs/specs/otel/protocol/exporter/#configuration-options). Only GRPC endpoints are supported at this stage.
12+
13+
It's recommended to use a local [OTEL Collector](https://opentelemetry.io/docs/collector/) to collect, batch and export the metrics to the relevant observability backend being used.
14+
15+
Example configuration: https://github.com/andreasohlund/Docker/tree/main/otel-monitoring
16+
17+
The following metrics are available:
18+
19+
### Ingestion
20+
21+
- `sc.audit.ingestion.count` - Successful ingested audit message count
22+
- `sc.audit.ingestion.retry` - Retried audit message count
23+
- `sc.audit.ingestion.failed` - Failed audit message count
24+
- `sc.audit.ingestion.duration` - Audit message processing duration (in milliseconds)
25+
- `sc.audit.ingestion.message_size` - Audit message body size (in kilobytes)
26+
- `sc.audit.ingestion.forwarded_count` - Forwarded audit messages count
27+
28+
### Batching
29+
30+
- `sc.audit.ingestion.batch_duration` - Batch processing duration (in milliseconds)
31+
- `sc.audit.ingestion.batch_size` - Batch size (number of messages)
32+
- `sc.audit.ingestion.consecutive_batch_failures` - Consecutive batch failures
33+
34+
### Storage
35+
36+
- `sc.audit.ingestion.audits_count` - Stored audit message count
37+
- `sc.audit.ingestion.sagas_count` - Stored sagas message count
38+
- `sc.audit.ingestion.commit_duration` - Storage unit of work commit duration (in milliseconds)
39+
40+
## Monitoring
41+
42+
No telemetry is currently available.

0 commit comments

Comments
 (0)