Streaming Metric Anomaly Detection

What is it?

This project serves to conduct various forms of outlier detection on streaming metrics, using Apache Kafka and Spark as the predominant methods of metric storage and analysis.

Quick Start

Dependencies

pip3 install kafka-python
pip3 install pyspark
pip3 install matplotlib
pip3 install pandas
pip3 install scikit-learn
pip3 install numpy
pip3 install psutil

Clone the repo

git clone https://github.com/SiddharthRajaraman/streamingMetricsAnomolyDetection.git

Deploy Zookeeper and Kafka to Kubernetes Cluster

kubectl apply -f kafkaConfig/zookeeper.yaml
kubectl apply -f kafkaConfig/kafkaBroker.yaml

Expose port for Kafka

kubectl port-forward <NAME OF KAFKA-BROKER POD> 9092

Run producer

Kafka Producer, by default, sends local CPU metrics every .5 seconds

python3 producerFiles/producer.py

Run Consumer

DBSCAN

python3 consumerFiles/consumerDBSCAN.py

Kmeans

python3 consumerFiles/consumerKmeans.py

Quartile

python3 consumerFiles/consumerQuartile.py

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
consumerFiles		consumerFiles
kafkaConfig		kafkaConfig
producerFiles		producerFiles
README.md		README.md
admin.py		admin.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Streaming Metric Anomaly Detection

What is it?

Quick Start

Dependencies

Clone the repo

Deploy Zookeeper and Kafka to Kubernetes Cluster

Expose port for Kafka

Run producer

Run Consumer

DBSCAN

Kmeans

Quartile

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Streaming Metric Anomaly Detection

What is it?

Quick Start

Dependencies

Clone the repo

Deploy Zookeeper and Kafka to Kubernetes Cluster

Expose port for Kafka

Run producer

Run Consumer

DBSCAN

Kmeans

Quartile

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages