Skip to content

Commit e08a725

Browse files
authored
Merge pull request #76 from DataTalksClub/podcast-improvements-seo
## All podcasts that had transcripts now have This update improves SEO by making each podcast page more informative and relevant to search engines. Updated 190+ podcast markdown files: * **Updated `titles` and timestamps** make content clearer and more structured, helping both users and crawlers understand what each episode covers. * **`Descriptions`** enhance how pages appear in search results, increasing the chance of clicks. * **`Intro` text** adds context and keyword-rich content, improving topical relevance and readability. Added 190+ timestamp files in `podcast-timestamps/` Together, these changes strengthen on-page SEO and help podcast episodes rank better for specific topics and guest names. Addresses many features described here: #61 ## Added schema markup for podcast pages Schema markup helps search engines understand the content of each podcast page more precisely. By providing structured data about videos, audio, and episode details, we make it easier for Google to display rich snippets, video previews, and podcast listings in search results. This improves visibility, click-through rates, and discoverability across platforms like Google Search and Google Podcasts. Elements include: - VideoObject - BreadcrumbList - AudioObject - PodcastEpisode Includes author/guest info, platform links (YouTube, Apple, Spotify, Anchor), and action metadata. These comply with: - https://schema.org/ - https://developers.google.com/search/docs/appearance/structured-data/search-gallery Each individual page can be checked using these validators by either copying and pasting the URL of the page or its source code: - https://validator.schema.org/ - https://developers.google.com/search/docs/appearance/structured-data ## New front matter fields for podcast pages required for schema markup These new front matter fields improve both **structured data accuracy** and **content discoverability** in search results. * **`dateadded`** helps search engines identify when the episode was first published, supporting features like “new podcast” highlights and improving freshness signals for SEO. * **`duration`** provides the exact length of each episode, making podcast previews more informative and eligible for enhanced display formats. * **`quotableClips`** adds structured snippets linking to key moments in the episode, increasing engagement and enabling deep linking to specific topics or quotes. By generating and maintaining these fields automatically through scripts, we ensure all podcast pages are complete, consistent, and optimized for rich results. ### `dateadded` - date when the podcast was uploaded to the website How to add it: Created a script to add the `dateadded` field to each podcast page front matter based on the Git log ### `duration` - a calculated duration of the podcast recording based on the transcript How to add it: Created a script to calculate the duration of the podcast and add it to the frontmatter of each podcast. ### `quotableClips` - a list of structured snippets linking to key moments in the episode, increasing engagement and enabling deep linking to specific topics or quotes How to add: Created a script to add episode clips of the podcast and add it to the frontmatter of each podcast. Each clip in the quotableClips array contains: - name: The timestamp title/description (e.g., "Podcast Introduction: guest Antonis and episode themes") - startOffset: Start time in seconds (e.g., 0, 148) - url: YouTube URL with time parameter (e.g., https://www.youtube.com/watch?v=-Gj7SaI-QW4&t=148) - endOffset: Start time in seconds (e.g., 0, 148)
2 parents 1b913f3 + fc3a5f2 commit e08a725

File tree

363 files changed

+62737
-33703
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

363 files changed

+62737
-33703
lines changed

_layouts/podcast.html

Lines changed: 360 additions & 37 deletions
Large diffs are not rendered by default.

_podcast/s01e01-roles.md

Lines changed: 9 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -1,24 +1,24 @@
11
---
2-
title: "Data Team Roles Explained"
3-
short: "Roles in a Data Team"
4-
guests: [alexeygrigorev]
5-
2+
title: 'Data Team Roles Explained: Skills, Responsibilities, and How Teams Ship ML
3+
Products'
4+
short: Roles in a Data Team
5+
guests:
6+
- alexeygrigorev
67
image: images/podcast/s01e01-roles.jpg
7-
8-
keywords: "data team roles, data scientist, data engineer, machine learning engineer, data analyst, MLOps engineer, product manager, data team structure, data science roles, ML engineer vs data engineer, data team responsibilities, data science career"
9-
8+
keywords: data team roles, data scientist, data engineer, machine learning engineer,
9+
data analyst, MLOps engineer, product manager, data team structure, data science
10+
roles, ML engineer vs data engineer, data team responsibilities, data science career
1011
season: 1
1112
episode: 1
12-
1313
ids:
1414
youtube: UukjwSIAnpw
1515
anchor: Roles-in-a-data-team---Alexey-Grigorev-emqcft
16-
1716
links:
1817
youtube: https://www.youtube.com/watch?v=UukjwSIAnpw
1918
anchor: https://anchor.fm/datatalksclub/episodes/Roles-in-a-data-team---Alexey-Grigorev-emqcft
2019
spotify: TODO
2120
apple: TODO
21+
dateadded: '2021-02-23'
2222
---
2323

2424
The topic today is the roles in data teams. We want to understand what kind of people work in the data team, what responsibilities they have, what they do, and what they need to know.

_podcast/s01e02-processes.md

Lines changed: 13 additions & 11 deletions
Original file line numberDiff line numberDiff line change
@@ -1,26 +1,28 @@
11
---
2-
title: "CRISP-DM Methodology: Complete Guide to Data Science Project Process"
3-
short: "Processes in a Data Science Project"
4-
guests: [alexeygrigorev]
5-
2+
title: 'CRISP-DM Methodology for Data Science Projects: Business Understanding, Data
3+
Preparation, Modeling, Evaluation & Deployment'
4+
short: Processes in a Data Science Project
5+
guests:
6+
- alexeygrigorev
67
image: images/podcast/s01e02-processes.jpg
7-
8-
description: "Learn the CRISP-DM methodology for managing data science projects. Step-by-step guide covering business understanding, data preparation, modeling, evaluation, and deployment."
9-
10-
keywords: "CRISP-DM, data science process, machine learning methodology, data science project management, ML project lifecycle, data science workflow, A/B testing, model deployment, data science best practices, ML model evaluation, cross-functional data teams"
11-
8+
description: Learn the CRISP-DM methodology for managing data science projects. Step-by-step
9+
guide covering business understanding, data preparation, modeling, evaluation, and
10+
deployment.
11+
keywords: CRISP-DM, data science process, machine learning methodology, data science
12+
project management, ML project lifecycle, data science workflow, A/B testing, model
13+
deployment, data science best practices, ML model evaluation, cross-functional data
14+
teams
1215
season: 1
1316
episode: 2
14-
1517
ids:
1618
youtube: SesVTDklFYQ
1719
anchor: Processes-in-a-Data-Science-Project---Alexey-Grigorev-encdlg
18-
1920
links:
2021
youtube: https://www.youtube.com/watch?v=SesVTDklFYQ
2122
anchor: https://anchor.fm/datatalksclub/episodes/Processes-in-a-Data-Science-Project---Alexey-Grigorev-encdlg
2223
spotify: TODO
2324
apple: TODO
25+
dateadded: '2021-02-23'
2426
---
2527

2628
The topic today is the processes in a data science project. We want to understand how cross-functional teams work together to ship real value. We'll use a concrete example (auto-categorizing marketplace listings) and walk through CRISP-DM step by step.

_podcast/s01e03-building-ds-team.md

Lines changed: 632 additions & 505 deletions
Large diffs are not rendered by default.

_podcast/s01e04-standing-out-as-a-data-scientist.md

Lines changed: 777 additions & 665 deletions
Large diffs are not rendered by default.

_podcast/s01e05-mentoring.md

Lines changed: 115 additions & 11 deletions
Original file line numberDiff line numberDiff line change
@@ -1,26 +1,130 @@
11
---
2-
title: "How to Find a Mentor and Advance Your Tech Career"
3-
short: "Mentoring"
4-
guests: [rahuljain]
5-
2+
title: 'How to Find a Mentor and Become One: Mentoring Strategies for Tech Careers'
3+
short: Mentoring
4+
guests:
5+
- rahuljain
66
image: images/podcast/s01e05-mentoring.jpg
7-
8-
description: "Learn how to find the right mentor, become an effective mentor yourself, and advance your tech career. Expert advice on overcoming imposter syndrome, cold outreach strategies, and balancing technical work with leadership roles."
9-
10-
keywords: "mentoring, career development, tech mentorship, finding a mentor, becoming a mentor, imposter syndrome, tech leadership, career advice, professional development, data engineering"
11-
7+
description: 'Discover practical mentoring strategies for tech careers: find mentors,
8+
master cold outreach, run effective sessions, start paid mentorship & boost leadership.'
9+
keywords: mentoring, career development, tech mentorship, finding a mentor, becoming
10+
a mentor, imposter syndrome, tech leadership, career advice, professional development,
11+
data engineering
1212
season: 1
1313
episode: 5
14-
1514
ids:
1615
youtube: LQvwTNQbPg4
1716
anchor: Mentoring---Rahul-Jain-eo7cmu
18-
1917
links:
2018
youtube: https://www.youtube.com/watch?v=LQvwTNQbPg4
2119
anchor: https://anchor.fm/datatalksclub/episodes/Mentoring---Rahul-Jain-eo7cmu
2220
spotify: TODO
2321
apple: TODO
22+
intro: 'Struggling to find a mentor — or wondering how to become one — in a fast-moving
23+
tech career? In this episode, Rahul Jain, a senior solutions engineer and data/AI
24+
leader with 15+ years driving enterprise data transformations and a career arc from
25+
mining engineering to data engineering and leadership, walks through practical mentoring
26+
strategies for tech professionals. We define mentoring (purpose, scope, types),
27+
explore early models like Thoughtworks’ sponsorship, and show how to find mentors
28+
through networks, platforms, and cold outreach — with concrete outreach best practices:
29+
specificity, background, and follow‑up. Rahul covers preparing mentoring sessions
30+
(goals, agendas), mentoring formats (one‑off advice vs long‑term relationships),
31+
and how to start as a mentor using simple first steps and platforms. Topics include
32+
benefits of mentoring, transferable workplace guidance, developing people skills
33+
(empathy, listening), balancing technical work and leadership, tackling imposter
34+
syndrome, coaching vs managing, setting boundaries and paid mentorship, and maintaining
35+
development plans. Listen to gain actionable steps, templates, and mindset shifts
36+
to both secure meaningful mentorship and build a sustainable mentoring practice
37+
in your tech career.'
38+
dateadded: '2021-02-23'
39+
quotableClips:
40+
- name: Episode Introduction
41+
startOffset: 0
42+
url: https://www.youtube.com/watch?v=LQvwTNQbPg4&t=0
43+
endOffset: 80
44+
- name: 'Career Journey: From Mining Engineering to Data Engineering & Leadership'
45+
startOffset: 80
46+
url: https://www.youtube.com/watch?v=LQvwTNQbPg4&t=80
47+
endOffset: 370
48+
- name: 'Defining Mentoring: Purpose, Scope, and Types'
49+
startOffset: 370
50+
url: https://www.youtube.com/watch?v=LQvwTNQbPg4&t=370
51+
endOffset: 570
52+
- name: 'Early Mentoring Experience: Thoughtworks Model and Sponsorship'
53+
startOffset: 570
54+
url: https://www.youtube.com/watch?v=LQvwTNQbPg4&t=570
55+
endOffset: 770
56+
- name: 'Finding a Mentor: Networks, Cold Outreach & Platforms'
57+
startOffset: 770
58+
url: https://www.youtube.com/watch?v=LQvwTNQbPg4&t=770
59+
endOffset: 990
60+
- name: 'Cold Outreach Best Practices: Specificity, Background, and Follow‑up'
61+
startOffset: 990
62+
url: https://www.youtube.com/watch?v=LQvwTNQbPg4&t=990
63+
endOffset: 1180
64+
- name: 'Preparing for Mentoring Sessions: Goals, Expectations, and Agenda'
65+
startOffset: 1180
66+
url: https://www.youtube.com/watch?v=LQvwTNQbPg4&t=1180
67+
endOffset: 1350
68+
- name: 'Mentoring Formats: One‑Off Advice vs. Long‑Term Relationships'
69+
startOffset: 1350
70+
url: https://www.youtube.com/watch?v=LQvwTNQbPg4&t=1350
71+
endOffset: 1510
72+
- name: 'Benefits of Being a Mentor: Listening, Pattern Recognition, and Growth'
73+
startOffset: 1510
74+
url: https://www.youtube.com/watch?v=LQvwTNQbPg4&t=1510
75+
endOffset: 1680
76+
- name: 'Mentoring Across Domains: Transferable Workplace Guidance'
77+
startOffset: 1680
78+
url: https://www.youtube.com/watch?v=LQvwTNQbPg4&t=1680
79+
endOffset: 1840
80+
- name: 'Developing People Skills: Empathy, Listening, and Avoiding the "Advice Monster"'
81+
startOffset: 1840
82+
url: https://www.youtube.com/watch?v=LQvwTNQbPg4&t=1840
83+
endOffset: 2010
84+
- name: 'Balancing Technical Work and Leadership: Blended Career Paths'
85+
startOffset: 2010
86+
url: https://www.youtube.com/watch?v=LQvwTNQbPg4&t=2010
87+
endOffset: 2200
88+
- name: 'Common Mentee Challenges: Imposter Syndrome & Tech‑vs‑Management Choices'
89+
startOffset: 2200
90+
url: https://www.youtube.com/watch?v=LQvwTNQbPg4&t=2200
91+
endOffset: 2390
92+
- name: 'Coaching vs. Managing: When to Use External Mentors'
93+
startOffset: 2390
94+
url: https://www.youtube.com/watch?v=LQvwTNQbPg4&t=2390
95+
endOffset: 2550
96+
- name: 'Setting Boundaries: Expectations, Cadence, and When to Charge'
97+
startOffset: 2550
98+
url: https://www.youtube.com/watch?v=LQvwTNQbPg4&t=2550
99+
endOffset: 2710
100+
- name: 'Paid Mentorship: Accountability, Pricing, and Professional Coaching'
101+
startOffset: 2710
102+
url: https://www.youtube.com/watch?v=LQvwTNQbPg4&t=2710
103+
endOffset: 2870
104+
- name: 'Building Reciprocal Mentoring Relationships: Authenticity & Retrospectives'
105+
startOffset: 2870
106+
url: https://www.youtube.com/watch?v=LQvwTNQbPg4&t=2870
107+
endOffset: 3020
108+
- name: 'Starting as a Mentor: Practical First Steps and Platforms'
109+
startOffset: 3020
110+
url: https://www.youtube.com/watch?v=LQvwTNQbPg4&t=3020
111+
endOffset: 3160
112+
- name: 'Maintaining Development Plans: Review, Visibility, and Follow‑through'
113+
startOffset: 3160
114+
url: https://www.youtube.com/watch?v=LQvwTNQbPg4&t=3160
115+
endOffset: 3290
116+
- name: 'Managing Overwhelm: Diagnosing Causes and Practical Steps'
117+
startOffset: 3290
118+
url: https://www.youtube.com/watch?v=LQvwTNQbPg4&t=3290
119+
endOffset: 3390
120+
- name: Audience Q&A Highlights
121+
startOffset: 3390
122+
url: https://www.youtube.com/watch?v=LQvwTNQbPg4&t=3390
123+
endOffset: 3480
124+
- name: Closing Thoughts and Resources
125+
startOffset: 3480
126+
url: https://www.youtube.com/watch?v=LQvwTNQbPg4&t=3480
127+
endOffset: 3480
24128
---
25129

26130
Today we're discussing mentoring with [Rahul Jain](/people/rahuljain.html), a technical leader with about 20 years of experience building and running software products. He currently leads the Business Intelligence and Data Engineering units at Omio, a ticket-booking company, and mentors engineers and managers through The Mentoring Club.

_podcast/s02e01-writing.md

Lines changed: 108 additions & 11 deletions
Original file line numberDiff line numberDiff line change
@@ -1,26 +1,123 @@
11
---
2-
title: "The Importance of Writing in a Tech Career"
3-
short: "The Importance of Writing in a Tech Career"
4-
guests: [eugeneyan]
5-
2+
title: 'Master Technical Writing: 7-Day Workflow to Accelerate Your Data Science Career'
3+
short: 'Master Technical Writing: 7-Day Workflow to Accelerate Your Data Science Career'
4+
guests:
5+
- eugeneyan
66
image: images/podcast/s02e01-writing.jpg
7-
8-
description: "Learn why technical writing is crucial for career growth in data science and ML. Eugene Yan shares his 7-day writing process, tips for documentation, and how writing helped him transition from psychology to Amazon Applied Scientist."
9-
10-
keywords: "technical writing, data science career, ML engineer writing, documentation skills, technical communication, data science blog, career growth, writing process, Amazon data scientist, Eugene Yan, technical documentation, data science portfolio, ML career advice, technical writing tips, data science writing"
11-
7+
description: 'Master technical writing for data science with a practical 7-day workflow:
8+
outline-first cadence, portfolio tips, docs & distribution to accelerate your career.'
9+
keywords: technical writing, data science career, ML engineer writing, documentation
10+
skills, technical communication, data science blog, career growth, writing process,
11+
Amazon data scientist, Eugene Yan, technical documentation, data science portfolio,
12+
ML career advice, technical writing tips, data science writing
1213
season: 2
1314
episode: 1
14-
1515
ids:
1616
youtube: vXWGd7olv3c
1717
anchor: The-Importance-of-Writing-in-a-Tech-Career---Eugene-Yan-ep17du
18-
1918
links:
2019
youtube: https://www.youtube.com/watch?v=vXWGd7olv3c
2120
anchor: https://anchor.fm/datatalksclub/episodes/The-Importance-of-Writing-in-a-Tech-Career---Eugene-Yan-ep17du
2221
spotify: TODO
2322
apple: TODO
23+
intro: How can technical writing accelerate your data science career in just one week?
24+
In this episode, Eugene Yan — an Applied Scientist at Amazon who previously led data
25+
science teams at Lazada and uCare.ai and writes about ML in production and career
26+
growth — walks through a practical, repeatable 7-day workflow for technical writing
27+
tailored to data scientists. <br><br> We cover Eugene’s career transition and first
28+
public writing, motivations for sharing work, and how to target readers (peers,
29+
future teammates, and hiring managers). He frames writing as a product with a weekly
30+
shipping cadence, explains the outline-first method for filtering ideas, and outlines
31+
a realistic time budget and editing limits. You’ll get concrete guidance on idea
32+
sourcing, title crafting, article length, blogging tools (Medium, Substack, WordPress,
33+
Jekyll), writing habits, distribution via Twitter and LinkedIn, and writing at work
34+
(press releases, design docs, decision logs). Practical portfolio advice — clear
35+
README, quick start, repo tour — and tips to iterate outlines and ship weekly round
36+
out the episode. <br><br> Listen to learn a concrete 7-day workflow, documentation
37+
and portfolio best practices, and distribution tactics to boost your technical writing
38+
and advance your data science career.
39+
dateadded: '2021-02-23'
40+
quotableClips:
41+
- name: Podcast Introduction
42+
startOffset: 0
43+
url: https://www.youtube.com/watch?v=vXWGd7olv3c&t=0
44+
endOffset: 100
45+
- name: 'Career Transition: Psychology to Applied Scientist'
46+
startOffset: 100
47+
url: https://www.youtube.com/watch?v=vXWGd7olv3c&t=100
48+
endOffset: 360
49+
- name: 'First Public Writing: Early Blog Posts and Meetups'
50+
startOffset: 360
51+
url: https://www.youtube.com/watch?v=vXWGd7olv3c&t=360
52+
endOffset: 570
53+
- name: 'Writing Motivations: Share, Learn, Be a Beacon'
54+
startOffset: 570
55+
url: https://www.youtube.com/watch?v=vXWGd7olv3c&t=570
56+
endOffset: 840
57+
- name: 'Audience Targeting: Readers, Peers, and Future Teammates'
58+
startOffset: 840
59+
url: https://www.youtube.com/watch?v=vXWGd7olv3c&t=840
60+
endOffset: 990
61+
- name: 'Writing as Product: Weekly Shipping and UX Mindset'
62+
startOffset: 990
63+
url: https://www.youtube.com/watch?v=vXWGd7olv3c&t=990
64+
endOffset: 1200
65+
- name: 'Weekly Writing Cadence: 7-Day Workflow and Schedule'
66+
startOffset: 1200
67+
url: https://www.youtube.com/watch?v=vXWGd7olv3c&t=1200
68+
endOffset: 1500
69+
- name: 'Outline-First Method: Memory Rewriting and Idea Filtering'
70+
startOffset: 1500
71+
url: https://www.youtube.com/watch?v=vXWGd7olv3c&t=1500
72+
endOffset: 1740
73+
- name: 'Time Budget & Editing: 25 Hours/Week and Avoiding Over-Editing'
74+
startOffset: 1740
75+
url: https://www.youtube.com/watch?v=vXWGd7olv3c&t=1740
76+
endOffset: 1980
77+
- name: Idea Sources and Topic Prioritization for Technical Writing
78+
startOffset: 1980
79+
url: https://www.youtube.com/watch?v=vXWGd7olv3c&t=1980
80+
endOffset: 2220
81+
- name: Title Crafting and Article Length Decisions
82+
startOffset: 2220
83+
url: https://www.youtube.com/watch?v=vXWGd7olv3c&t=2220
84+
endOffset: 2460
85+
- name: 'Getting Started: Start Writing, Overcome Friction'
86+
startOffset: 2460
87+
url: https://www.youtube.com/watch?v=vXWGd7olv3c&t=2460
88+
endOffset: 2610
89+
- name: 'Blogging Tools: Medium, Substack, WordPress, Jekyll (GitHub Pages)'
90+
startOffset: 2610
91+
url: https://www.youtube.com/watch?v=vXWGd7olv3c&t=2610
92+
endOffset: 2760
93+
- name: 'Writing Habits: Morning Reps and Weekend Deep Work'
94+
startOffset: 2760
95+
url: https://www.youtube.com/watch?v=vXWGd7olv3c&t=2760
96+
endOffset: 2910
97+
- name: 'Audience Growth: Distribution via Twitter, LinkedIn, Consistency'
98+
startOffset: 2910
99+
url: https://www.youtube.com/watch?v=vXWGd7olv3c&t=2910
100+
endOffset: 3060
101+
- name: 'Writing at Work: Press Release, Working Backwards, and Design Docs'
102+
startOffset: 3060
103+
url: https://www.youtube.com/watch?v=vXWGd7olv3c&t=3060
104+
endOffset: 3240
105+
- name: 'Technical Documentation: Decision Logs, Rationales, and Team Memory'
106+
startOffset: 3240
107+
url: https://www.youtube.com/watch?v=vXWGd7olv3c&t=3240
108+
endOffset: 3390
109+
- name: 'Portfolio Best Practices: Clear README, Quick Start, Repo Tour'
110+
startOffset: 3390
111+
url: https://www.youtube.com/watch?v=vXWGd7olv3c&t=3390
112+
endOffset: 3510
113+
- name: 'Practical Tips: Iterate Outlines, Ship Weekly, Learn by Teaching'
114+
startOffset: 3510
115+
url: https://www.youtube.com/watch?v=vXWGd7olv3c&t=3510
116+
endOffset: 3630
117+
- name: Podcast Wrap-Up and Resources
118+
startOffset: 3630
119+
url: https://www.youtube.com/watch?v=vXWGd7olv3c&t=3630
120+
endOffset: 3630
24121
---
25122

26123
Today we're discussing technical writing, logging, documentation, and more. Our special guest is [Eugene Yan](/people/eugeneyan). Eugene works at the intersection of machine learning and product, building pragmatic ML systems while writing and speaking about effective data science, ML in production, and career growth.

0 commit comments

Comments
 (0)