From 91f35f02fba501a609c7712a329820f121227244 Mon Sep 17 00:00:00 2001 From: Haider Ali <42317018+AliHaider20@users.noreply.github.com> Date: Sun, 23 May 2021 11:31:25 +0530 Subject: [PATCH] Semi and supervised machine learning Added some more information on Supervised, Unsupervised and Semi-Supervised learning. --- README.md | 18 ++++++++++++++---- 1 file changed, 14 insertions(+), 4 deletions(-) diff --git a/README.md b/README.md index 001b511..817891f 100644 --- a/README.md +++ b/README.md @@ -17,9 +17,9 @@ In supervised learning we have a set of training data as an input and a set of l ### Regression -In regression problems we do real value predictions. Basically we try to draw a line/plane/n-dimensional plane along the training examples. +In regression problems we predict real/continuos values which are countably infinite. Basically we try to draw a line/plane/n-dimensional plane along the training examples. -_Usage examples: stock price forecast, sales analysis, dependency of any number, etc._ +_Usage examples: stock price forecast, sales analysis, dependency of any number, House price, Market Sales, Heart rate, etc._ #### 🤖 Linear Regression @@ -31,9 +31,14 @@ _Usage examples: stock price forecast, sales analysis, dependency of any number, ### Classification -In classification problems we split input examples by certain characteristic. +In classification problem the output variable (feature) is **categorical**, For example: Covid or No Covid, Spam or Not Spam. +There are many types of classification problems such as: +- Binary Classification - Binary classification refers to those classification tasks that have two class labels. +- Multi-Class Classification - Multi-class classification refers to those classification tasks that have more than two class labels. +- Multi-Label Classification - Multi-label classification refers to those classification tasks that have two or more class labels, where one or more class labels may be predicted for each example. +- Imbalanced Classification - Imbalanced Classification refers to classification tasks where one variable has more examples over other variables. -_Usage examples: spam-filters, language detection, finding similar documents, handwritten letters recognition, etc._ +_Usage examples: spam-filters, language detection, finding similar documents, handwritten letters recognition, Face (Male/ Female) Binary Classification, Emotion detection (Negative/ Positive), etc._ #### 🤖 Logistic Regression @@ -72,6 +77,11 @@ _Usage examples: intrusion detection, fraud detection, system health monitoring, - ⚙️ [Code | Anomaly Detection using Gaussian Distribution](homemade/anomaly_detection/gaussian_anomaly_detection.py) - implementation example - ▶️ [Demo | Anomaly Detection](https://nbviewer.jupyter.org/github/trekhleb/homemade-machine-learning/blob/master/notebooks/anomaly_detection/anomaly_detection_gaussian_demo.ipynb) - find anomalies in server operational parameters like `latency` and `threshold` +## Semi-Supervised Learning + +Semi-Supervised learning the dataset consists of target variable which has 50% values and 50% is empty (NaN). The percentage might differ according to datasets. + + ## Neural Network (NN) The neural network itself isn't an algorithm, but rather a framework for many different machine learning algorithms to work together and process complex data inputs.