DataTalksClub
diff --git a/‎_layouts/author.html‎
Lines changed: 35 additions & 0 deletions b/‎_layouts/author.html‎
Lines changed: 35 additions & 0 deletions
diff --git a/‎_podcast/ab-testing-and-product-experimentation.md‎
Lines changed: 21 additions & 12 deletions b/‎_podcast/ab-testing-and-product-experimentation.md‎
Lines changed: 21 additions & 12 deletions
diff --git a/‎_podcast/ai-for-ecology-biodiversity-and-conservation.md‎
Lines changed: 22 additions & 8 deletions b/‎_podcast/ai-for-ecology-biodiversity-and-conservation.md‎
Lines changed: 22 additions & 8 deletions
diff --git a/‎_podcast/ai-ml-product-design-and-experimentation.md‎
Lines changed: 21 additions & 14 deletions b/‎_podcast/ai-ml-product-design-and-experimentation.md‎
Lines changed: 21 additions & 14 deletions
diff --git a/‎_podcast/algorithmic-trading-with-python-and-machine-learning.md‎
Lines changed: 20 additions & 8 deletions b/‎_podcast/algorithmic-trading-with-python-and-machine-learning.md‎
Lines changed: 20 additions & 8 deletions
@@ -81,4 +81,39 @@ <h3>Books</h3>
   </div>
 
   {% include footer.html %}
+
+  {%- assign person_description = page.bio_short | default: page.description | default: content | strip_html | strip_newlines | truncate: 200 -%}
+  {%- assign same_as_links = "" -%}
+  {%- if page.linkedin -%}
+    {%- assign same_as_links = same_as_links | append: '"https://www.linkedin.com/in/' | append: page.linkedin | append: '/"' -%}
+  {%- endif -%}
+  {%- if page.twitter -%}
+    {%- if same_as_links != "" -%}
+      {%- assign same_as_links = same_as_links | append: ', ' -%}
+    {%- endif -%}
+    {%- assign same_as_links = same_as_links | append: '"https://twitter.com/' | append: page.twitter | append: '"' -%}
+  {%- endif -%}
+  {%- if page.github -%}
+    {%- if same_as_links != "" -%}
+      {%- assign same_as_links = same_as_links | append: ', ' -%}
+    {%- endif -%}
+    {%- assign same_as_links = same_as_links | append: '"https://github.com/' | append: page.github | append: '"' -%}
+  {%- endif -%}
+  {%- if page.web -%}
+    {%- if same_as_links != "" -%}
+      {%- assign same_as_links = same_as_links | append: ', ' -%}
+    {%- endif -%}
+    {%- assign same_as_links = same_as_links | append: '"' | append: page.web | append: '"' -%}
+  {%- endif -%}
+  <script type="application/ld+json">
+  {
+    "@context": "https://schema.org",
+    "@type": "Person",
+    "name": {{ page.title | jsonify }}{% if page.picture %},
+    "image": "{{ site.url }}/{{ page.picture }}"{% endif %},
+    "url": "{{ site.url }}{{ page.url }}",
+    "description": {{ person_description | jsonify }}{% if same_as_links != "" %},
+    "sameAs": [{{ same_as_links }}]{% endif %}
+  }
+  </script>
 </body>
@@ -1,6 +1,6 @@
 ---
-title: "Product Analytics & A/B Testing: Causality, Metrics, Power Analysis, A/A Tests"
-short: "A/B Testing"
+title: 'Product Analytics & A/B Testing: Causality, Metrics, Power Analysis, A/A Tests'
+short: A/B Testing
 season: 7
 episode: 6
 guests:
@@ -14,16 +14,30 @@ links:
   apple: https://podcasts.apple.com/us/podcast/a-b-testing-jakob-graff/id1541710331?i=1000552243668
   spotify: https://open.spotify.com/episode/3LhBOO1UANCGbOwkntZt4j
   youtube: https://www.youtube.com/watch?v=0Gqx1LtqRZU
-
-description: "Master product analytics, A/B testing & power analysis: design stable metrics, validate randomization with A/A tests, plan sample size to de-risk features."
-intro: "How do you design product experiments that truly establish causality and avoid costly false conclusions? In this episode, Jakob Graff — Director of Data Science and Data Analytics at diconium, with prior analytics leadership at Inkitt, Babbel, King and a background in econometrics — walks through practical product analytics and A/B testing strategies focused on causality and reliable metrics. <br><br> We cover why randomized experiments mirror clinical trials, how experimentation de-risks features and builds organizational learning, and a concrete case study on subscription vs. points revenue metric design. Jakob explains experimentation platform trade-offs (third-party vs. in-house), traffic splitters, assignment tracking, and why A/A tests validate system trust. You’ll hear best practices for first tests (two-group simplicity), metric selection considering noise and seasonality, and how to plan duration with power analysis and sample-size calculations. The discussion also compares z/t/nonparametric tests, p-value intuition from A/A comparisons, frequentist vs Bayesian perspectives, and multi-armed test considerations. <br><br> Listen to learn practical steps for designing randomized experiments, selecting stable metrics, planning sample sizes, and interpreting results so your product analytics and A/B testing produce actionable, causal insights"
+description: 'Master product analytics, A/B testing & power analysis: design stable
+  metrics, validate randomization with A/A tests, plan sample size to de-risk features.'
+intro: How do you design product experiments that truly establish causality and avoid
+  costly false conclusions? In this episode, Jakob Graff — Director of Data Science
+  and Data Analytics at diconium, with prior analytics leadership at Inkitt, Babbel,
+  King and a background in econometrics — walks through practical product analytics
+  and A/B testing strategies focused on causality and reliable metrics. <br><br> We
+  cover why randomized experiments mirror clinical trials, how experimentation de-risks
+  features and builds organizational learning, and a concrete case study on subscription
+  vs. points revenue metric design. Jakob explains experimentation platform trade-offs
+  (third-party vs. in-house), traffic splitters, assignment tracking, and why A/A
+  tests validate system trust. You’ll hear best practices for first tests (two-group
+  simplicity), metric selection considering noise and seasonality, and how to plan
+  duration with power analysis and sample-size calculations. The discussion also compares
+  z/t/nonparametric tests, p-value intuition from A/A comparisons, frequentist vs
+  Bayesian perspectives, and multi-armed test considerations. <br><br> Listen to learn
+  practical steps for designing randomized experiments, selecting stable metrics,
+  planning sample sizes, and interpreting results so your product analytics and A/B
+  testing produce actionable, causal insights
 topics:
 - data science
 - practices
 dateadded: 2022-02-27
-
 duration: PT01H03M37S
-
 quotableClips:
 - name: Podcast Introduction
   startOffset: 0
@@ -105,11 +119,6 @@ quotableClips:
   startOffset: 3839
   url: https://www.youtube.com/watch?v=0Gqx1LtqRZU&t=3839
   endOffset: 3880
-- name: Episode Wrap-up and Key Takeaways
-  startOffset: 3880
-  url: https://www.youtube.com/watch?v=0Gqx1LtqRZU&t=3880
-  endOffset: 3817
-
 transcript:
 - header: Podcast Introduction
 - header: Guest Background & Career Transition to Data Science
 
@@ -1,6 +1,7 @@
 ---
-title: "AI for Ecology, Biodiversity, and Conservation: Computer Vision, Remote Sensing and Citizen Science"
-short: "AI for Ecology, Biodiversity, and Conservation"
+title: 'AI for Ecology, Biodiversity, and Conservation: Computer Vision, Remote Sensing
+  and Citizen Science'
+short: AI for Ecology, Biodiversity, and Conservation
 season: 18
 episode: 3
 guests:
@@ -14,8 +15,25 @@ links:
   apple: https://podcasts.apple.com/us/podcast/ai-for-ecology-biodiversity-and-conservation-tanya/id1541710331?i=1000653709956
   spotify: https://open.spotify.com/episode/3Hhz5N8ZDvsOPlPP3wxQxq?si=Oz7y_pBrTfeypfYZXubu-g
   youtube: https://www.youtube.com/watch?v=30tTrozbAkg
-description: "Discover AI-driven computer vision and remote sensing strategies to scale biodiversity monitoring, improve species ID, and inform conservation policy."
-intro: "How can AI help close critical data gaps in biodiversity monitoring and turn images and sensor data into actionable conservation decisions? In this episode Tanya Berger-Wolf, a computational ecologist, director of TDAI@OSU, and co-founder of the Wildbook project (Wild Me), walks through practical applications of AI for ecology, biodiversity monitoring, and conservation. <br><br> We cover core techniques—computer vision, machine learning, and remote sensing—and their use in image-based monitoring with camera traps, drones, and species identification. Tanya explains individual identification and longitudinal tracking, habitat mapping and change detection, and the data challenges of labeling, class imbalance, and sparse observations. The conversation addresses integration of heterogeneous datasets, model robustness (domain shift and transfer learning), and ethical considerations including Indigenous knowledge and equity. You’ll also hear about scalable platforms like Wildbook, citizen science workflows for crowdsourcing and quality control, policy relevance, open data and FAIR principles, edge deployment in the field, and building sustainable monitoring programs. <br><br> Listen to gain concrete insights on tools, pitfalls, and next steps for applying AI to conservation—what works now, what remains hard, and resources to explore further."
+description: Discover AI-driven computer vision and remote sensing strategies to scale
+  biodiversity monitoring, improve species ID, and inform conservation policy.
+intro: How can AI help close critical data gaps in biodiversity monitoring and turn
+  images and sensor data into actionable conservation decisions? In this episode Tanya
+  Berger-Wolf, a computational ecologist, director of TDAI@OSU, and co-founder of
+  the Wildbook project (Wild Me), walks through practical applications of AI for ecology,
+  biodiversity monitoring, and conservation. <br><br> We cover core techniques—computer
+  vision, machine learning, and remote sensing—and their use in image-based monitoring
+  with camera traps, drones, and species identification. Tanya explains individual
+  identification and longitudinal tracking, habitat mapping and change detection,
+  and the data challenges of labeling, class imbalance, and sparse observations. The
+  conversation addresses integration of heterogeneous datasets, model robustness (domain
+  shift and transfer learning), and ethical considerations including Indigenous knowledge
+  and equity. You’ll also hear about scalable platforms like Wildbook, citizen science
+  workflows for crowdsourcing and quality control, policy relevance, open data and
+  FAIR principles, edge deployment in the field, and building sustainable monitoring
+  programs. <br><br> Listen to gain concrete insights on tools, pitfalls, and next
+  steps for applying AI to conservation—what works now, what remains hard, and resources
+  to explore further.
 topics:
 - AI
 - computer vision
@@ -116,10 +134,6 @@ quotableClips:
   startOffset: 3630
   url: https://www.youtube.com/watch?v=30tTrozbAkg&t=3630
   endOffset: 3720
-- name: 'Episode Closing: Key Takeaways and Next Steps'
-  startOffset: 3720
-  url: https://www.youtube.com/watch?v=30tTrozbAkg&t=3720
-  endOffset: 3720
 context: 'Context: The episode frames a biodiversity crisis made harder by fragmented,
   sparse data and limited monitoring capacity, then surveys AI tools (computer vision,
   remote sensing, platforms, citizen science), technical challenges, ethical concerns,
 
@@ -1,6 +1,6 @@
 ---
-title: "AI Product Design: Algorithm-Ready UX, Rapid Experiments & Data-Driven Roadmaps"
-short: "Innovation and Design for Machine Learning"
+title: 'AI Product Design: Algorithm-Ready UX, Rapid Experiments & Data-Driven Roadmaps'
+short: Innovation and Design for Machine Learning
 season: 8
 episode: 3
 guests:
@@ -14,19 +14,32 @@ links:
   apple: https://podcasts.apple.com/us/podcast/innovation-and-design-for-machine-learning-liesbeth/id1541710331?i=1000556693861
   spotify: https://open.spotify.com/episode/4vhTQJ6Aj9z5VHm9UsHspv
   youtube: https://www.youtube.com/watch?v=tcqBfZw41FM
-
-description: "Master AI product design: build algorithm-ready UX, run rapid experiments and craft data-driven roadmaps to prioritize innovation and ship measurable results."
-intro: "How do you design products that are “algorithm-ready” while running rapid experiments and building data-driven roadmaps? In this episode, Liesbeth Dingemans—strategy and AI leader, founder of Dingemans Consulting, former VP of Revenue at Source.ag and Head of AI Strategy at Prosus—walks through pragmatic approaches to AI product design that bridge vision and execution. <br><br> We cover algorithm-friendly UX and signal collection, a concrete interaction-design case study comparing TikTok and Instagram signals, and the Double Diamond framework for moving from problem framing to solution exploration. Liesbeth explains scoping and prioritization, parallel experiments and proofs of concept, one-week design sprints, appropriate timeframes for research-to-scale, and the role of designers, data scientists, engineers and product managers in shaping AI roadmaps. <br><br> Listeners will learn how to avoid rework by involving data science early, use scoping documents to challenge assumptions, create measurable experiments (the Task Force/“Jet Ski” model), and build data-driven pitches for long-term bets versus quarterly OKRs. Tune in for concrete frameworks and practices to make AI product design, rapid experiments, and data-driven roadmaps work in your organization"
+description: 'Master AI product design: build algorithm-ready UX, run rapid experiments
+  and craft data-driven roadmaps to prioritize innovation and ship measurable results.'
+intro: How do you design products that are “algorithm-ready” while running rapid experiments
+  and building data-driven roadmaps? In this episode, Liesbeth Dingemans—strategy
+  and AI leader, founder of Dingemans Consulting, former VP of Revenue at Source.ag
+  and Head of AI Strategy at Prosus—walks through pragmatic approaches to AI product
+  design that bridge vision and execution. <br><br> We cover algorithm-friendly UX
+  and signal collection, a concrete interaction-design case study comparing TikTok
+  and Instagram signals, and the Double Diamond framework for moving from problem
+  framing to solution exploration. Liesbeth explains scoping and prioritization, parallel
+  experiments and proofs of concept, one-week design sprints, appropriate timeframes
+  for research-to-scale, and the role of designers, data scientists, engineers and
+  product managers in shaping AI roadmaps. <br><br> Listeners will learn how to avoid
+  rework by involving data science early, use scoping documents to challenge assumptions,
+  create measurable experiments (the Task Force/“Jet Ski” model), and build data-driven
+  pitches for long-term bets versus quarterly OKRs. Tune in for concrete frameworks
+  and practices to make AI product design, rapid experiments, and data-driven roadmaps
+  work in your organization
 topics:
 - machine learning
 - design thinking
 - strategy
 - ai
 - practices
 dateadded: 2022-04-10
-
 duration: PT00H59M14S
-
 quotableClips:
 - name: Episode Introduction & Guest Overview
   startOffset: 0
@@ -132,11 +145,6 @@ quotableClips:
   startOffset: 3500
   url: https://www.youtube.com/watch?v=tcqBfZw41FM&t=3500
   endOffset: 3605
-- name: Closing Notes, Resources and Contact Links
-  startOffset: 3605
-  url: https://www.youtube.com/watch?v=tcqBfZw41FM&t=3605
-  endOffset: 3554
-
 transcript:
 - header: Episode Introduction & Guest Overview
 - header: 'Guest Background: Strategy, Product and AI Trajectory'
@@ -688,8 +696,7 @@ transcript:
   sec: 1817
   time: '30:17'
   who: Liesbeth
-- header: 'Scoping Documents: Challenging Assumptions with "Why"
-'
+- header: 'Scoping Documents: Challenging Assumptions with "Why" '
 - line: 'Let''s imagine we have this situation: a manager comes to me, or to the team,
     or to the product manager and says, “Hey, this is the problem we think we have.
     Let''s solve it with a neural network.” So how do we challenge that person? How
 
@@ -1,6 +1,6 @@
 ---
-title: "Algorithmic Trading with Python: Backtesting, Risk Management and Deployment"
-short: "Stock Market Analysis with Python and Machine Learning"
+title: 'Algorithmic Trading with Python: Backtesting, Risk Management and Deployment'
+short: Stock Market Analysis with Python and Machine Learning
 season: 17
 episode: 3
 guests:
@@ -14,14 +14,30 @@ links:
   apple: https://podcasts.apple.com/us/podcast/stock-market-analysis-with-python-and-machine/id1541710331?i=1000641465239
   spotify: https://open.spotify.com/episode/1ZXAeGr4Kx7F6oLQUip8Cc?si=KJwpYL-3SvuX8nPdc2cyOg
   youtube: https://www.youtube.com/watch?v=NThHAEIazFk
-description: "Master algorithmic trading: backtesting and risk management—learn practical data sources, features, models & execution to build robust strategies."
+description: 'Master algorithmic trading: backtesting and risk management—learn practical
+  data sources, features, models & execution to build robust strategies.'
 topics:
 - machine learning
 - data science
 - MLOps
 - algorithmic trading
 - tools
-intro: "How do you turn a trading idea into a robust, risk-managed algorithm in Python? In this episode Ivan Brigida — analytics lead behind PythonInvest with 10+ years in statistical modeling, forecasting, econometrics and finance — walks through practical steps for algorithmic trading with Python, from data sourcing to deployment (and a clear reminder this is educational, not investment advice). <br><br> We cover where retail traders get market data (Yahoo, Quandl, Polygon), OHLCV and adjusted-close nuances, and a concrete mean-reversion example. Ivan explains backtesting methodology, common pitfalls like time-series data leakage, and walk-forward simulation for realistic validation. He breaks down risk management (stop-loss thresholds, position sizing), execution and trading fees, plus evaluation metrics (ROI, precision) and defining prediction targets (binary growth thresholds such as 5%). <br><br> On the modeling side you’ll hear practical feature engineering (time-window stats, handcrafted indicators), model choices (logistic regression, XGBoost, neural nets), explainability via feature importance, and deployment options (cron, Airflow, APIs, partial automation). Listen to gain actionable guidance for building, validating, and deploying algorithmic trading systems in Python."
+intro: How do you turn a trading idea into a robust, risk-managed algorithm in Python?
+  In this episode Ivan Brigida — analytics lead behind PythonInvest with 10+ years
+  in statistical modeling, forecasting, econometrics and finance — walks through practical
+  steps for algorithmic trading with Python, from data sourcing to deployment (and
+  a clear reminder this is educational, not investment advice). <br><br> We cover
+  where retail traders get market data (Yahoo, Quandl, Polygon), OHLCV and adjusted-close
+  nuances, and a concrete mean-reversion example. Ivan explains backtesting methodology,
+  common pitfalls like time-series data leakage, and walk-forward simulation for realistic
+  validation. He breaks down risk management (stop-loss thresholds, position sizing),
+  execution and trading fees, plus evaluation metrics (ROI, precision) and defining
+  prediction targets (binary growth thresholds such as 5%). <br><br> On the modeling
+  side you’ll hear practical feature engineering (time-window stats, handcrafted indicators),
+  model choices (logistic regression, XGBoost, neural nets), explainability via feature
+  importance, and deployment options (cron, Airflow, APIs, partial automation). Listen
+  to gain actionable guidance for building, validating, and deploying algorithmic
+  trading systems in Python.
 dateadded: 2024-01-24
 duration: PT01H40S
 quotableClips:
@@ -129,10 +145,6 @@ quotableClips:
   startOffset: 3666
   url: https://www.youtube.com/watch?v=NThHAEIazFk&t=3666
   endOffset: 3696
-- name: Episode Wrap-up and final reminder (not financial advice)
-  startOffset: 3696
-  url: https://www.youtube.com/watch?v=NThHAEIazFk&t=3696
-  endOffset: 3640
 transcript:
 - header: Podcast Introduction
 - header: 'Guest Introduction: Ivan Brigida — Analytics Lead & PythonInvest'