Skip to content

Conversation

Copy link
Contributor

Copilot AI commented Dec 30, 2025

The 2026 challenge needed modernization: missing GenAI integration, high environment friction, lack of modern stack specifics (dbt, Polars), and no feedback loops.

Changes

New AI-Augmented Analyst Track

  • Weekly AI challenges teaching responsible GenAI use (prompt engineering for SQL, synthetic data generation, AI-assisted refactoring)
  • Key principles: AI as co-pilot, not autopilot

Week-by-Week Innovations

Week Addition
1 EXPLAIN ANALYZE for query cost optimization
2 Visual SQL with DuckDB + Python charting
3 Peeking simulation (Python code showing false positive inflation)
4 Multi-Armed Bandits with Thompson Sampling example
5 Polars as high-performance Pandas alternative
6 Metric Trees for hierarchical metric visualization
7 dbt Core tutorial with project structure
8 3-minute Loom video requirement

Zero-Friction Environment Setup

# One-click badges for Colab, Codespaces, DuckDB WASM
# Or local:
pip install -r supplementary/challenge-2026/requirements.txt

Sample Data

Added challenge-2026/data/ with ready-to-use CSVs: customers, orders, products, user_activity, daily_metrics

Marketing/Community

  • Official cohort start dates (Jan/Apr/Sep 2026)
  • Digital certificates for LinkedIn
  • Spotlight rewards for best submissions

Project READMEs

Updated all Analytical-HandsOn-Projects/ READMEs with learning objectives, 2026 challenge alignment, and code examples

Original prompt

Based on the content of supplementary/2026-new-year-challenge.md, here is an in-depth analysis, improvement plan, and marketing strategy to elevate the challenge.

implement these cahnges and revamp and improve the content, as well as the content of the tracks in the project ththe challange reffers to

1. Syllabus Analysis: Strengths & Gaps

The current syllabus is exceptional in its pedagogical approach. It successfully blends technical skills (SQL, Python) with product sense and communication—a mix often missing in standard bootcamps.

Strengths

  • "Product Sense" Integration: Week 3 (Communication Exercise) and Week 6 (Metrics Framework) are standout sections. They teach analysts how to think and speak, not just code.
  • Reality-Based Scenarios: The "Messy Dataset Gauntlet" (Week 5) and "Guardrail Paradox" (Week 4) mimic actual job frustrations rather than perfect academic scenarios.
  • Clear Artifacts: Every week ends with a tangible output (mini-project), culminating in a Capstone.
  • Gamification: The badge system (Halfway Hero, Champion) leverages GitHub's native ecosystem well.

Gaps & Areas for Improvement

  • The "AI Gap" (Critical for 2026): The syllabus mentions "2026" but lacks specific modules on Generative AI. By 2026, an analyst who cannot use LLMs for code generation, synthetic data creation, or semantic search will be behind.
  • Environment Friction: Weeks 5 and 7 require local Python/SQL setups. This creates a high drop-off risk for beginners struggling with pip install or Docker.
  • Feedback Loop Vacuum: The challenge relies on self-assessment. Without feedback loops, learners may reinforce bad habits.
  • Lack of "Modern Stack" Specifics: Week 7 touches on Analytical Engineering but remains abstract. Exposure to tools like dbt (even the core version) or cloud warehouses (BigQuery/Snowflake free tiers) is missing.

2. Innovative Upskilling Additions

To make this the definitive challenge for 2026, integrate these modern learning methods:

A. The "AI-Augmented Analyst" Track (New Vertical)

Integrate GenAI into every week, not as a cheat, but as a tool.

  • Week 1 (SQL): "Prompt Engineering for SQL." Task: Write a complex query, then ask Copilot/ChatGPT to optimize it. Compare the execution plans.
  • Week 3 (Stats): "Synthetic Data Generation." Task: Use Python faker or an LLM to generate a dataset with a specific skew to test your hypothesis scripts.
  • Week 5 (Python): "Refactoring with AI." Task: Take your "spaghetti code" from Day 30 and use an AI agent to modularize it into functions.

B. Interactive Compute Environments

Reduce friction by providing "One-Click" environments.

  • Google Colab / GitHub Codespaces: Instead of asking users to install Pandas locally, provide a Launch in Colab badge for Week 5 & 8.
  • SQL Fiddle / DuckDB: For Weeks 1-2, embed a link to a browser-based SQL runner (using DuckDB WASM) so they can run queries on the sample data immediately without setting up Postgres.

C. "Simulated Stakeholder" Bot

Leverage GitHub Actions or a simple linked GPT wrapper.

  • The Concept: A "Virtual PM" bot.
  • Implementation: In Week 6 (Case Studies), the user submits their recommendation. A prompt instructions the bot to challenge them: "But what about the impact on latency?" This simulates the pushback they will face in real interviews.

3. Detailed Improvement Plan (Week-by-Week Tweaks)

Week Current Focus Proposed Innovation
W1 SQL Basics Add "Query Cost Optimization": Introduce EXPLAIN ANALYZE early. It distinguishes seniors from juniors.
W2 Window Functions Add "Visual SQL": Use a tool like Metabase (local) or Streamlit to visualize the output of window functions immediately.
W3 Stats Add "The Peeking Simulation": A simple Python script that visually simulates p-value hacking over time to prove why peeking is bad.
W4 A/B Testing Add "Bandit Algorithms": Briefly introduce Multi-Armed Bandits as a modern alternative to A/B testing for specific use cases.
W5 Pandas Add "Polars": Introduce Polars as a high-performance alternative to Pandas (gaining massive traction).
W6 Product Metrics Add "Metric Trees": Teach how to visualize metrics hierarchically (e.g., using a Miro template).
W7 AE Basics Add "dbt Core": Include a specific tutorial on setting up a dbt project structure rather than just raw SQL views.
W8 Capstone Add "Loom/Video Requirement": Require a 3-minute video walkthrough. Communication is 50% of the job.

4. Marketing & Growth Strategy

To expand reach and engagement, treat this challenge like a product launch.

Phase 1: The "Build in Public" Campaign (Pre-Launch)

  • The "Commitment Contract": Create a generic social media image/card that says *"I am committed to ...

✨ Let Copilot coding agent set things up for you — coding agent works faster and does higher quality work when set up for your repo.

Copilot AI changed the title [WIP] Revamp and improve content for New Year challenge Revamp 2026 Analytics Challenge with AI-Augmented Track and Modern Tools Dec 30, 2025
Copilot AI requested a review from moshesham December 30, 2025 10:46
@moshesham moshesham marked this pull request as ready for review December 30, 2025 10:52
Copilot AI review requested due to automatic review settings December 30, 2025 10:52
@moshesham moshesham merged commit f9f7d71 into main Dec 30, 2025
2 of 5 checks passed
Copy link

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR modernizes the 2026 Analytics Challenge with comprehensive updates focused on AI-augmented analysis skills, zero-friction environment setup, and enhanced learning resources. The challenge now equips participants to leverage GenAI tools responsibly while maintaining strong technical fundamentals.

Key Changes:

  • Introduced AI-Augmented Analyst Track with weekly challenges teaching responsible GenAI use as a co-pilot, not autopilot
  • Added zero-friction environment setup with one-click badges for Colab, Codespaces, and DuckDB WASM
  • Created ready-to-use sample datasets (customers, orders, products, user_activity, daily_metrics) with comprehensive documentation
  • Enhanced each week with modern tools and techniques (EXPLAIN ANALYZE, Polars, dbt Core, Multi-Armed Bandits, etc.)
  • Established official cohort structure with start dates, digital certificates, and community features

Reviewed changes

Copilot reviewed 19 out of 19 changed files in this pull request and generated no comments.

Show a summary per file
File Description
supplementary/challenge-2026/requirements.txt Python dependencies for all 8 weeks including Polars, DuckDB, and optional dbt
supplementary/challenge-2026/environment.yml Conda environment specification mirroring requirements.txt
supplementary/challenge-2026/data/*.csv 8 sample CSV files (customers, orders, products, users, user_activity, order_items, monthly_revenue, daily_metrics)
supplementary/challenge-2026/data/README.md Comprehensive data dictionary with schema details, quick start examples, and practice problems
supplementary/2026-new-year-challenge.md Major revamp with AI-augmented track, environment setup guide, week-by-week innovations, community features, and marketing elements
Analytical-HandsOn-Projects/overview.md Updated with 2026 enhancements including project portfolio table and consistent structure documentation
Analytical-HandsOn-Projects/*/README.md All 6 project READMEs revamped with learning objectives, skills practiced, dataset suggestions, code examples, and 2026 enhancements

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants