You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on Jun 7, 2023. It is now read-only.
Copy file name to clipboardExpand all lines: README.md
+4-4Lines changed: 4 additions & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,5 +1,5 @@
1
1
# Anomaly Detection in Prometheus Metrics
2
-
This repository contains the prototype for a Prometheus Anomaly Detector (PAD) which can be deployed on OpenShift. The PAD is a framework to deploy a metric prediction model to detect anomalies in prometheus metrics.
2
+
This repository contains the prototype for a Prometheus Anomaly Detector (PAD) which can be deployed on OpenShift. The PAD is a framework to deploy a metric prediction model to detect anomalies in prometheus metrics.
3
3
4
4
Prometheus is the chosen application to do monitoring across multiple products and platforms. Prometheus metrics are time series data identified by metric name and key/value pairs. With the increased amount of metrics flowing in it is getting harder to see the signals within the noise. The current state of the art is to graph out metrics on dashboards and alert on thresholds. This application leverages machine learning algorithms such as Fourier and Prophet models to perform time series forecasting and predict anomalous behavior in the metrics. The predicted values are compared with the actual values and if they differ from the default threshold values, it is flagged as an anomaly.
5
5
@@ -36,7 +36,7 @@ make run_app
36
36
## Using the pre-built Container Image
37
37
* We have a pre-built container image available that you can use to deploy the Prometheus Anomaly Detector.
38
38
* The image is hosted at: `quay.io/aicoe/prometheus-anomaly-detector:latest`
39
-
* Example command: *(make sure port 8080 is available)*
39
+
* Example command: *(make sure port 8080 is exposed - this is where /metrics will be available.)*
40
40
```
41
41
docker run --name pad -p 8080:8080 --network host \
@@ -59,14 +59,14 @@ The current setup is as follows:
59
59
- Prophet[https://facebook.github.io/prophet/] - Procedure developed by Facebook for forecasting time series data based on an additive model where non-linear trends are fit with yearly, weekly, and daily seasonality, plus holiday effects. It works best with time series that have strong seasonal effects and several seasons of historical data. The following are the forecasted values:
60
60
- `yhat` - Predicted time series value
61
61
- `yhat_lower` - Lower bound of uncertainity interval
62
-
- `yhat_upper` - Upper bound of uncertainity interval
62
+
- `yhat_upper` - Upper bound of uncertainity interval
63
63
- **Visualization** - Grafana dashboards are created to visualize the predicted metrics
64
64
- **Alerts** - Prometheus alerts are configured based on predicted metric values
65
65
66
66

67
67
68
68
# Model Testing
69
-
For a given timeframe of a metric, with known anomalies, the PAD can be run in `test-mode` to check whether the models reported back these anomalies. The accuracy and performance of the models can then be logged as metrics to MLFlow for comparing the results.
69
+
For a given timeframe of a metric, with known anomalies, the PAD can be run in `test-mode` to check whether the models reported back these anomalies. The accuracy and performance of the models can then be logged as metrics to MLFlow for comparing the results.
70
70
71
71
MLflow is an open source platform to manage the ML lifecycle, including experimentation, reproducibility and deployment. It currently offers three components:
72
72

0 commit comments