Machine Learning Studies

This repository organizes my Machine Learning studies with a focus on practical applications. The goal is to learn how to solve real-world problems efficiently, prioritizing what produces the most impact.

Study Structure

📌 ML + DL Study Roadmap

1. Data Understanding & Preparation ✅

Data exploration using pandas, matplotlib, and seaborn X
Cleaning: handling missing values, duplicates, and outliers X
Normalization and standardization X
Encoding categorical variables (one-hot, label encoding)
Train/validation/test split X

2. Core ML Models ✅

Logistic Regression — simple classification X
Random Forest — robust, good for tabular data X
XGBoost / LightGBM — strong performance on structured data X
👉 Compare baselines vs ensembles, using cross-validation

3. Model Evaluation ✅

Metrics: accuracy, precision, recall, F1-score, ROC-AUC X
Choose metrics according to the problem type X

4. Avoiding Overfitting ✅

Techniques: regularization, cross-validation, early stopping
Focus: train well and generalize to unseen data

5. Hyperparameter Tuning

Tools: GridSearchCV, RandomizedSearchCV, Optuna
Apply to RandomForest / XGBoost / LightGBM ✅

6. Intro to Deep Learning (Concepts First)

Neurons, layers, activations, loss functions, optimizers ✅
Backpropagation (high-level understanding) ✅
Overfitting/underfitting in neural nets ✅

7. TensorFlow & PyTorch 🚀

TensorFlow (Keras API)
- Building dense feedforward networks ✅
- Using callbacks (early stopping, checkpointing) ✅
- Training on structured/tabular data ✅
PyTorch
- Manual training loops vs high-level API (Lightning/FastAI)
- Understanding tensors & autograd
- Implementing a basic neural net from scratch
  👉 Project: train the same simple NN in both frameworks, compare coding style

8. Text & Basic NLP

Vectorization: TF-IDF, word embeddings, BERT embeddings
Classification tasks (reviews, spam detection)
Try both scikit-learn + TF-IDF and PyTorch/TensorFlow embeddings

9. Practical ML/DL Cycle

Collect and prepare data
Split into train/validation/test sets
Train ML or DL models
Evaluate and adjust
Deploy (pickle/joblib for ML, SavedModel/torchscript for DL)
Serve via FastAPI/Flask

10. What to Avoid Initially

Very deep neural nets (CNNs/RNNs) without tabular basics
Reinforcement learning, GANs, diffusion models (too advanced for now)
MLOps heavy tools (Kubeflow, Airflow)

📂 Next Repo Section:
/deep_learning_basics → notebooks comparing TensorFlow and PyTorch on the same problems.

Getting Started

Clone the repository and follow the notebooks and scripts in the /notebooks folder for exercises and mini-projects.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
breast_cancer		breast_cancer
core_models		core_models
data_preparation		data_preparation
finance_exploration		finance_exploration
overfitting-numbers		overfitting-numbers
pre-tensorflow		pre-tensorflow
student-performance		student-performance
titanic		titanic
titanic_simple_ml		titanic_simple_ml
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Machine Learning Studies

Study Structure

📌 ML + DL Study Roadmap

1. Data Understanding & Preparation ✅

2. Core ML Models ✅

3. Model Evaluation ✅

4. Avoiding Overfitting ✅

5. Hyperparameter Tuning

6. Intro to Deep Learning (Concepts First)

7. TensorFlow & PyTorch 🚀

8. Text & Basic NLP

9. Practical ML/DL Cycle

10. What to Avoid Initially

Getting Started

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Machine Learning Studies

Study Structure

📌 ML + DL Study Roadmap

1. Data Understanding & Preparation ✅

2. Core ML Models ✅

3. Model Evaluation ✅

4. Avoiding Overfitting ✅

5. Hyperparameter Tuning

6. Intro to Deep Learning (Concepts First)

7. TensorFlow & PyTorch 🚀

8. Text & Basic NLP

9. Practical ML/DL Cycle

10. What to Avoid Initially

Getting Started

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages