Welcome to the Machine Learning Basics and Terminologies repository!
This repository is designed for beginners and enthusiasts who want to understand the fundamental concepts of Machine Learning, key differences between related fields, and essential terminologies.

- AI vs ML vs DL
- Traditional Programming vs Machine Learning
- Machine Learning Lifecycle
- Types of Machine Learning - Supervised, Unsupervised, Semi-Supervised, and Reinforcement Learning
- [Common Machine Learning Terminologies]
- Model Performance Metrics
- [Popular Machine Learning Algorithms]
- Artificial Intelligence (AI): A broad field aiming to create machines that can simulate human intelligence.
- Machine Learning (ML): A subset of AI that allows systems to learn and improve from experience.
- Deep Learning (DL): A subset of ML focused on neural networks with many layers (deep architectures).
🔁 AI ⊃ ML ⊃ DL
Machine Learning is the science of getting computers to learn patterns from data and make decisions or predictions. It is used in various domains such as healthcare, finance, marketing, and more.
| Traditional Programming | Machine Learning |
|---|---|
| Rules + Data → Output | Data + Output → Rules |
| Explicit logic written | Model learns patterns |
| Deterministic | Probabilistic |
- Problem Definition
- Data Collection
- Data Preprocessing
- Exploratory Data Analysis
- Model Selection
- Training
- Evaluation
- Deployment
- Monitoring and Maintenance
- Supervised Learning
- Unsupervised Learning
- Semi-Supervised Learning
- Reinforcement Learning
- Supervised Learning: Labeled data used (e.g., classification, regression)
- Unsupervised Learning: No labels (e.g., clustering, dimensionality reduction)
- Semi-Supervised Learning: Few labeled and many unlabeled data points
- Reinforcement Learning: Agent learns by interacting with an environment via rewards/punishments
- Regression: Predicts continuous values (e.g., house price)
- Classification: Predicts discrete labels (e.g., spam or not)
- Feature: Input variable
- Label: Output variable
- Model: Learned algorithm
- Training: Learning from data
- Testing: Evaluating performance
- Overfitting: Too complex, memorizes training data
- Underfitting: Too simple, misses patterns
- Bias & Variance: Errors due to assumptions vs fluctuations
- Accuracy
- Precision
- Recall
- F1-Score
- ROC-AUC
- Mean Absolute Error (MAE)
- Mean Squared Error (MSE)
- Root Mean Squared Error (RMSE)
- R² Score
- Linear Regression
- Logistic Regression
- Decision Trees
- Random Forest
- Support Vector Machines (SVM)
- K-Nearest Neighbors (KNN)
- Naïve Bayes
- Gradient Boosting (e.g., XGBoost, LightGBM)
- K-Means Clustering (Unsupervised)
- Principal Component Analysis (PCA)
📁 machine-learning-basics/
│
├── ai-vs-ml-vs-dl.md
├── introduction-to-ml.md
├── traditional-vs-ml.md
├── ml-lifecycle.md
├── types-of-ml.md
├── supervised-vs-unsupervised.md
├── regression-vs-classification.md
├── ml-terminology.md
├── performance-metrics.md
├── ml-algorithms.md
└── README.md
Pull requests are welcome! Feel free to improve the content or suggest better examples.
Feel free to connect via GitHub Issues for any questions or suggestions.
