DataTalks.Club FAQ Jekyll Site

This is a Jekyll site that contains frequently asked questions and answers from DataTalks.Club courses.

Structure

_questions/ - Individual question markdown files with Jekyll frontmatter
_layouts/ - Jekyll layout templates
images/ - Extracted images from the original FAQ documents
assets/css/ - Custom CSS styles
_config.yml - Jekyll configuration
index.md - Main landing page
[course].md - Course index pages

Quick Start

Using the Automation Scripts

Choose your preferred method:

PowerShell (Recommended for Windows):

# Initial setup
.\faq.ps1 setup

# Process FAQ documents and serve site
.\faq.ps1 dev

Batch file (Windows Command Prompt):

# Initial setup
faq setup

# Process FAQ documents and serve site
faq dev

Shell script (Linux/Mac/WSL):

# Initial setup
./faq.sh setup

# Process FAQ documents and serve site
./faq.sh dev

Makefile (Linux/Mac - has Windows compatibility issues):

# Initial setup
make setup

# Process FAQ documents and serve site
make dev

Available Commands

Command	Description
`setup`	Initial setup (install Python deps + Jekyll)
`process`	Process FAQ documents and generate Jekyll site
`serve`	Serve Jekyll site locally (http://localhost:4000)
`build`	Build Jekyll site for production
`dev`	Process + serve (development workflow)
`clean`	Clean generated files
`install`	Install Jekyll dependencies only
`stats`	Show site statistics

Manual Setup

If you prefer to run commands manually:

Install Jekyll and dependencies:

gem install jekyll bundler
bundle install

Process FAQ documents (with automatic cleanup):

# Clean questions directory first
python clean_questions.py

# Extract FAQ data
uv run python faq_processor.py

# Validate compatibility
python validate_questions.py

# Generate static site
python generate.py

Serve the site:
```
bundle exec jekyll serve
```
Open your browser to http://localhost:4000

Content

The site contains FAQ content from the following courses:

Data Engineering Zoomcamp
Machine Learning Zoomcamp
MLOps Zoomcamp

Each question is stored as an individual markdown file with Jekyll frontmatter containing:

question - The question text
section - The section/category the question belongs to
course - The course the question is from

Processing

The content was processed from the original Google Docs FAQ documents using the faq_processor.py script, which:

Downloads and caches DOCX files
Extracts embedded images
Converts content to individual Jekyll question files
Generates course index pages
Creates the Jekyll site structure

FAQ Processing Workflow

To ensure compatibility between the FAQ processor and static site generator:

Available Makefile Targets

make help              # Show available commands
make clean_questions   # Remove all files in _questions/ directory
make extract          # Clean _questions/ and extract FAQ data from Google Docs
make validate         # Validate all question files are compatible with generate.py
make website          # Generate static website from markdown files

Recommended Workflow

Clean and Extract: Always start by cleaning the questions directory to prevent leftover files:
```
make extract  # This automatically runs clean_questions first
```
Validate: Check that all generated files are compatible:
```
make validate
```
Generate Site: Create the static HTML site:
```
make website
```

Manual Workflow

If you prefer to run commands individually:

# 1. Clean questions directory
python clean_questions.py

# 2. Extract FAQ data
python faq_processor.py

# 3. Validate compatibility  
python validate_questions.py

# 4. Generate static site
python generate.py

File Compatibility

The FAQ processor now generates markdown files with properly formatted YAML frontmatter that includes:

id - Unique identifier for the question
question - The question text (properly quoted for YAML)
section - The section/category (properly quoted for YAML)
course - The course name
sort_order - Numerical sort order

All string values containing special characters (like colons) are automatically quoted to ensure YAML compatibility.

Images

Images are stored in /images/[course]/ directories and referenced using Jekyll's absolute path syntax (/images/...) for proper display.

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
.github/workflows		.github/workflows
_layouts		_layouts
_questions		_questions
cache		cache
images		images
notebooks		notebooks
.gitignore		.gitignore
.python-version		.python-version
Makefile		Makefile
README.md		README.md
agents.md		agents.md
generate_website.py		generate_website.py
index.md		index.md
main.py		main.py
process_faq.py		process_faq.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

DataTalks.Club FAQ Jekyll Site

Structure

Quick Start

Using the Automation Scripts

Available Commands

Manual Setup

Content

Processing

FAQ Processing Workflow

Available Makefile Targets

Recommended Workflow

Manual Workflow

File Compatibility

Images

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

DataTalksClub/faq

Folders and files

Latest commit

History

Repository files navigation

DataTalks.Club FAQ Jekyll Site

Structure

Quick Start

Using the Automation Scripts

Available Commands

Manual Setup

Content

Processing

FAQ Processing Workflow

Available Makefile Targets

Recommended Workflow

Manual Workflow

File Compatibility

Images

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages