Skip to content

Commit 7bbdbc4

Browse files
authored
updated readme (#652)
1 parent 8b0cc68 commit 7bbdbc4

File tree

2 files changed

+217
-95
lines changed

2 files changed

+217
-95
lines changed

README.md

Lines changed: 217 additions & 95 deletions
Original file line numberDiff line numberDiff line change
@@ -1,13 +1,13 @@
11
<p align="center">
2-
<img width="60%" src="images/zoomcamp.jpg" alt="Machine Learning Zoomcamp">
2+
<img width="100%" src="images/ml-zoomcamp.png" alt="Machine Learning Zoomcamp">
33
</p>
44

55
<h1 align="center">
66
<strong>Machine Learning Zoomcamp: A Free 4-Month Course on ML Engineering</strong>
77
</h1>
88

99
<p align="center">
10-
Master the fundamentals of machine learning, from regression and classification to deployment and deep learning.
10+
Learn machine learning engineering from regression and classification to deployment and deep learning.
1111
</p>
1212

1313
<p align="center">
@@ -22,116 +22,238 @@ Master the fundamentals of machine learning, from regression and classification
2222
<a href="https://ctt.ac/XZ6b9">Tweet about the Course</a>
2323
</p>
2424

25-
## How to Enroll
26-
27-
### 2025 Cohort
28-
29-
- **Start Date**: September 15, 2025
30-
- **Register Here**: [Sign up](https://airtable.com/shryxwLd0COOEaqXo)
31-
- **Stay Updated**: Subscribe to our [Google Calendar](https://calendar.google.com/calendar/?cid=cGtjZ2tkbGc1OG9yb2lxa2Vwc2g4YXMzMmNAZ3JvdXAuY2FsZW5kYXIuZ29vZ2xlLmNvbQ)
32-
33-
### Self-Paced Learning
34-
All course materials are freely available for independent study. Follow these steps:
35-
1. Watch the course videos and work through the code.
36-
2. Join the [Slack community](https://DataTalks.Club/slack.html) (`#course-ml-zoomcamp`).
37-
3. Ask questions in Slack or refer to the FAQ.
38-
4. Complete the homework assignments (solutions provided but attempt first).
39-
5. Work on at least one project for deeper learning.
40-
41-
## Syllabus Overview
42-
The course consists of structured modules covering the full ML pipeline, from fundamentals to advanced techniques.
43-
44-
### Prerequisites
45-
- Prior programming experience (at least 1+ year)
46-
- Comfort with command line basics
47-
- No prior ML knowledge required
48-
49-
### Modules
50-
51-
#### [Module 1: Introduction to Machine Learning](01-intro/)
52-
- ML vs Rule-Based Systems
53-
- Supervised Learning
54-
- CRISP-DM Framework
55-
- Model Selection Process
56-
- Environment Setup
57-
- Homework
58-
59-
#### [Module 2: Machine Learning for Regression](02-regression/)
60-
- Car Price Prediction Project
61-
- Exploratory Data Analysis
62-
- Linear Regression Basics
63-
- Feature Engineering & Regularization
64-
- Homework
65-
66-
#### [Module 3: Machine Learning for Classification](03-classification/)
67-
- Churn Prediction Project
68-
- Feature Selection & Encoding
69-
- Logistic Regression
70-
- Model Interpretation
71-
- Homework
72-
73-
#### [Module 4: Evaluation Metrics](04-evaluation/)
74-
- Accuracy, Precision, Recall
75-
- ROC Curves & AUC
76-
- Cross-Validation
77-
- Homework
78-
79-
#### [Module 5: Deploying ML Models](05-deployment/)
80-
- Saving & Loading Models
81-
- Flask API Deployment
82-
- Docker & Virtual Environments
83-
- Cloud Deployment (AWS)
84-
- Homework
85-
86-
#### [Module 6: Decision Trees & Ensemble Learning](06-trees/)
87-
- Decision Trees
88-
- Random Forest & Gradient Boosting
89-
- Model Selection & Hyperparameter Tuning
90-
- Homework
91-
92-
#### [Module 7: Neural Networks & Deep Learning](08-deep-learning/)
25+
26+
## Table of Contents
27+
- [How to Join](#how-to-join)
28+
- [What This Course Is About](#what-this-course-is-about)
29+
- [Prerequisites](#prerequisites)
30+
- [Syllabus](#syllabus)
31+
- [Community & Getting Help](#community--getting-help)
32+
- [Certificates](#certificates)
33+
- [Sponsors](#sponsors)
34+
- [About DataTalks.Club](#about-datatalksclub)
35+
36+
## How to Join
37+
38+
### 2025 Cohort (Live Format)
39+
40+
- **Starts**: September 15, 2025
41+
- **Duration**: 4 months
42+
- **Time commitment**: ~10 hours per week for coursework and projects
43+
- **What's included**:
44+
- Regular live office hours with instructors
45+
- Structured learning path with deadlines
46+
- Peer interaction and community support
47+
- Opportunity to earn a certificate
48+
- Access to all recorded sessions and office hours
49+
- **Register**: [Sign up here](https://airtable.com/shryxwLd0COOEaqXo)
50+
- **Calendar**: [Subscribe to updates](https://calendar.google.com/calendar/?cid=cGtjZ2tkbGc1OG9yb2lxa2Vwc2g4YXMzMmNAZ3JvdXAuY2FsZW5kYXIuZ29vZ2xlLmNvbQ)
51+
52+
### Self-Paced Learning (Available Anytime)
53+
54+
All materials are freely available on GitHub. You can:
55+
- Watch videos on [YouTube](https://www.youtube.com/playlist?list=PL3MmuxUbc_hIhxl5Ji8t4O6lPAOpHaCLR)
56+
- Follow along with the syllabus below
57+
- Join our [Slack community](https://DataTalks.Club/slack.html) for help and discussion
58+
- Complete homework at your own pace (solutions included)
59+
- Work on projects to practice what you learn
60+
61+
**Note**: Self-paced learning gives you access to all course materials and recordings, but you need to join a live cohort to earn a certificate.
62+
63+
## What This Course Is About
64+
65+
This is a practical course where you'll learn to build and deploy machine learning systems. We focus on the engineering side from training models to getting them to work in production.
66+
67+
**You'll learn:**
68+
- Core ML algorithms and when to use them
69+
- How to prepare data and engineer features
70+
- Model evaluation and selection
71+
- Deploying models with Flask, Docker, and cloud platforms
72+
- Using Kubernetes for ML model serving
73+
- MLOps practices
74+
75+
**What makes this course different:**
76+
- **Hands-on approach**: Build real projects, not just follow tutorials
77+
- **Practical focus**: Heavily focused on implementation over mathematical theory
78+
- **End-to-end focus**: From data to deployment
79+
- **Community-driven**: Learn alongside others, get help when stuck
80+
- **Open source**: All materials on GitHub, contribute improvements
81+
- **Free**: No paywalls, no premium tiers
82+
83+
**Technical setup**: For machine learning modules, you only need a laptop with internet connection. For deep learning sections, we'll use cloud resources (like Saturn Cloud) for more intensive computations.
84+
85+
## Prerequisites
86+
87+
**You'll need:**
88+
- Some programming experience (1+ years, preferably Python)
89+
- Basic Python knowledge: variables, libraries, and Jupyter notebooks
90+
- Basic command line comfort
91+
- High school math
92+
93+
**Helpful but not required:**
94+
- Statistics background
95+
- Git/GitHub familiarity
96+
97+
No machine learning experience needed, we'll start from the basics.
98+
99+
## Syllabus
100+
101+
### [Module 1: Introduction to Machine Learning](01-intro/)
102+
103+
Learn the fundamentals: what ML is, when to use it, and how to approach ML problems using the CRISP-DM framework.
104+
105+
**Topics:**
106+
- ML vs rule-based systems
107+
- Supervised learning basics
108+
- CRISP-DM methodology
109+
- Model selection concepts
110+
- Environment setup
111+
112+
### [Module 2: Machine Learning for Regression](02-regression/)
113+
114+
Build a car price prediction model while learning linear regression, feature engineering, and regularization.
115+
116+
**Topics:**
117+
- Linear regression (from scratch and with scikit-learn)
118+
- Exploratory data analysis
119+
- Feature engineering
120+
- Regularization techniques
121+
- Model validation
122+
123+
### [Module 3: Machine Learning for Classification](03-classification/)
124+
125+
Create a customer churn prediction system using logistic regression and learn about feature selection.
126+
127+
**Topics:**
128+
- Logistic regression
129+
- Feature importance and selection
130+
- Categorical variable encoding
131+
- Model interpretation
132+
133+
### [Module 4: Evaluation Metrics for Classification](04-evaluation/)
134+
135+
Learn how to properly evaluate classification models and handle imbalanced datasets.
136+
137+
**Topics:**
138+
- Accuracy, precision, recall, F1-score
139+
- ROC curves and AUC
140+
- Cross-validation
141+
- Confusion matrices
142+
- Class imbalance handling
143+
144+
### [Module 5: Deploying Machine Learning Models](05-deployment/)
145+
146+
Turn your models into web services and deploy them with Docker and cloud platforms.
147+
148+
**Topics:**
149+
- Model serialization with Pickle
150+
- Flask web services
151+
- Docker containerization
152+
- Cloud deployment (AWS)
153+
154+
### [Module 6: Decision Trees & Ensemble Learning](06-trees/)
155+
156+
Learn tree-based models and ensemble methods for better predictions.
157+
158+
**Topics:**
159+
- Decision trees
160+
- Random Forest
161+
- Gradient boosting (XGBoost)
162+
- Hyperparameter tuning
163+
- Feature importance
164+
165+
### Midterm Project
166+
167+
Apply everything you've learned in a complete project: find a dataset, train models, and deploy a web service.
168+
169+
### [Module 7: Neural Networks & Deep Learning](08-deep-learning/)
170+
171+
Introduction to neural networks using TensorFlow and Keras, including CNNs and transfer learning.
172+
173+
**Topics:**
174+
- Neural network fundamentals
93175
- TensorFlow & Keras
94176
- Convolutional Neural Networks
95-
- Transfer Learning
96-
- Model Optimization & Regularization
97-
- Homework
177+
- Transfer learning
178+
- Model optimization
179+
180+
### [Module 8: Serverless Deep Learning](09-serverless/)
181+
182+
Deploy deep learning models using serverless technologies like AWS Lambda.
98183

99-
#### [Module 8: Serverless Deep Learning](09-serverless/)
100-
- Introduction to Serverless
101-
- AWS Lambda & TensorFlow Lite
184+
**Topics:**
185+
- Serverless concepts
186+
- AWS Lambda for ML
187+
- TensorFlow Lite
102188
- API Gateway
103-
- Homework
104189

105-
#### [Module 9: Kubernetes & TensorFlow Serving](10-kubernetes/)
106-
- TensorFlow Model Serving
107-
- Kubernetes Basics
108-
- Deploying ML Models to Kubernetes
109-
- Homework
190+
### [Module 9: Kubernetes & TensorFlow Serving](10-kubernetes/)
191+
192+
Learn to serve ML models at scale using Kubernetes and TensorFlow Serving.
193+
194+
**Topics:**
195+
- Kubernetes basics
196+
- TensorFlow Serving
197+
- Model deployment and scaling
198+
- Load balancing
199+
200+
### [Module 10: KServe (Optional)](11-kserve/)
110201

111-
#### [Capstone Projects](projects/)
112-
- Midterm & Final Projects integrating all learned concepts
202+
Advanced model serving with KServe for production ML systems.
113203

114-
## Community & Support
115204

116-
### **Getting Help on Slack**
117-
Join the [`#course-ml-zoomcamp`](https://app.slack.com/client/T01ATQK62F8/C0288NJ5XSA) channel on [DataTalks.Club Slack](https://DataTalks.Club/slack.html) for discussions, troubleshooting, and networking.
205+
### [Capstone Project](projects/)
118206

119-
To keep discussions organized:
120-
- Follow [our guidelines](asking-questions.md) when posting questions.
121-
- Review the [community guidelines](https://datatalks.club/slack/guidelines.html).
207+
Choose a problem that interests you, find a suitable dataset, and develop your model. Deploy your model into a web service (local deployment or cloud deployment for bonus points).
122208

123-
> We encourage [Learning in Public](learning-in-public.md)
209+
## Community & Getting Help
124210

125-
## Sponsors & Supporters
126-
A special thanks to our course sponsors for making this initiative possible!
211+
### Where to Get Help
212+
- **Slack**: [`#course-ml-zoomcamp`](https://app.slack.com/client/T01ATQK62F8/C0288NJ5XSA) channel
213+
- **FAQ**: [Common questions and answers](https://docs.google.com/document/d/1LpPanc33QJJ6BSsyxVg-pWNMplal84TdZtq10naIhD8)
214+
- **Office Hours**: Regular Q&A sessions
215+
- **Study Groups**: Connect with other learners
216+
217+
### Community Guidelines
218+
- Check the [FAQ](https://docs.google.com/document/d/1LpPanc33QJJ6BSsyxVg-pWNMplal84TdZtq10naIhD8) first
219+
- Follow our [question guidelines](asking-questions.md)
220+
- Be helpful and respectful
221+
- Share your learning journey
222+
223+
### Learning in Public
224+
We encourage sharing your progress! Write blog posts, create videos, post on social media with #mlzoomcamp. It helps you learn better and builds your professional network.
225+
226+
**Bonus**: You can earn extra points for sharing your learning experience publicly.
227+
228+
Learn more: [Learning in Public](learning-in-public.md)
229+
230+
## Certificates
231+
232+
To receive a certificate, you'll need to:
233+
234+
1. **Join a live cohort** (self-paced learners cannot earn certificates)
235+
2. **Complete 2 out of 3 projects**:
236+
- **Midterm Project**: Choose a problem that interests you, find a suitable dataset, and develop your model
237+
- **Capstone Project**: Complete either Capstone Project 1 or Capstone Project 2 (includes deploying a model as a web service)
238+
3. **Review 3 peers' projects** by the deadline
239+
240+
**Important**: Projects must be completed individually, and you can join after the course has started if you miss some homework deadlines.
241+
242+
<p align="center">
243+
<strong>Ready to start? <a href="https://airtable.com/shryxwLd0COOEaqXo">Join the 2025 cohort</a> or <a href="01-intro/">start with Module 1</a></strong>
244+
</p>
245+
246+
## Sponsors
247+
248+
Thanks to our sponsors who make this course possible:
127249

128250
<p align="center">
129251
<a href="https://saturncloud.io/">
130252
<img height="120" src="https://github.com/DataTalksClub/llm-zoomcamp/raw/main/images/saturn-cloud.png">
131253
</a>
132254
</p>
133255

134-
Interested in supporting our community? Reach out to [[email protected]](mailto:[email protected]).
256+
Interested in sponsoring? Contact [[email protected]](mailto:[email protected]).
135257

136258
## About DataTalks.Club
137259

@@ -157,5 +279,5 @@ Interested in supporting our community? Reach out to [[email protected]](mai
157279

158280
All the activity at DataTalks.Club mainly happens on [Slack](https://datatalks.club/slack.html). We post updates there and discuss different aspects of data, career questions, and more.
159281

160-
At DataTalksClub, we organize online events, community activities, and free courses. You can learn more about what we do at [DataTalksClub Community Navigation](https://www.notion.so/DataTalksClub-Community-Navigation-bf070ad27ba44bf6bbc9222082f0e5a8?pvs=21).
282+
161283

images/ml-zoomcamp.png

1.12 MB
Loading

0 commit comments

Comments
 (0)