Skip to content

MahfoudBouad/portfolio

Repository files navigation


Data Analyst Portfolio

Professional Summary

Aspiring data analyst with a Master’s in Mathematical Sciences and a background in statistical modeling, data visualization, and public-sector analytics. Project-based experience with AWS and Snowflake, as well as Python, SQL, and R, involving the development of cloud-based data pipelines and the application of statistical methods to complex datasets. Developing the ability to translate technical findings into clear insights to support institutional policy and decision-making.


Featured Projects

U.S. Food Access and Health Equity: Cloud Data Pipeline

Savvy Coders | February 2025 – May 2025 Implemented a full-stack cloud data engineering solution to analyze socioeconomic drivers of food insecurity across 3,000+ U.S. counties.

  • Data Engineering: Developed a serverless ETL pipeline using AWS Glue (PySpark) to transform raw USDA datasets into optimized Apache Parquet format.
  • Data Warehousing: Engineered a Snowflake environment featuring external stages, automated data loading (COPY INTO), and optimized SQL views for health-risk priority classification.
  • Hybrid Architecture: Established a dual-engine query layer utilizing AWS Athena for ad-hoc exploration and Snowflake for high-concurrency business intelligence.
  • Visualization: Designed an interactive Amazon QuickSight dashboard to quantify the "Food Access Health Gap," identifying a 2% increase in obesity rates within low-access regions.
  • Infrastructure Management: Managed resource lifecycles across multiple AWS regions, ensuring cost-efficient operations and adherence to security best practices. Skills: AWS (Glue, S3, Athena), Snowflake, PySpark, Amazon QuickSight, SQL, Data Engineering. GitHub Repo · Tableau Dashboard

Bioaccumulation in Pueblo Reservoir

Jan 2024 – Jul 2024 | University of Minnesota Duluth
Analyzed trace element contamination across trophic levels in aquatic ecosystems downstream of historic mining sites.

  • Cleaned and normalized environmental data using Box-Cox transformation
  • Applied PCA, Factor Analysis, and K-means clustering in R
  • Found evidence of biomagnification with policy implications for ecological risk
  • Created ggplot2 visualizations for non-specialist stakeholders
    Skills: R · PCA · Cluster Analysis · Environmental Analytics · Data Cleaning · Statistical Modeling

Diabetes Prediction Model

Jan 2024 – May 2024 | University of Minnesota Duluth
Developed machine learning models to predict diabetes risk using clinical data.

  • Preprocessed Pima Indians dataset using feature scaling and PCA
  • Built logistic regression and neural network models in Python (Scikit-learn)
  • Achieved 85% accuracy and strong ROC AUC performance
  • Compared model interpretability and performance across techniques
    Skills: Python · Scikit-learn · Logistic Regression · PCA · Neural Networks · Predictive Modeling

Green Fertilizer Facility Optimization (MUDAC 2024 Hackathon)

Apr 2024 | University of Minnesota Duluth
Collaborated with team to identify optimal locations for eco-friendly fertilizer facilities in Minnesota.

  • Used R for regression modeling and KNN imputation
  • Integrated geospatial and economic data to inform site selection
  • Presented findings to industry panel with actionable recommendations
    Skills: R · Regression · KNN · Predictive Analytics · Team Collaboration · Data Integrity

Postpartum Hemorrhage Outcomes Analysis

Aug 2023 – Dec 2023 | University of Minnesota Duluth
Evaluated clinical outcomes based on oxytocin administration routes using SAS and Power BI.

  • Conducted t-tests, chi-square tests, and multiple regression modeling
  • Built Power BI dashboard to visualize Shock Index and hemorrhage risk
  • Identified key predictors of postpartum hemoglobin levels
    Skills: SAS · Power BI · Regression · Biostatistics · Data Visualization · Clinical Analytics

Maternal Leave Policy Impact (Hackathon)

Jan 2024 – Mar 2024 | University of Minnesota Duluth
Analyzed 6M+ employment records to assess impact of paid maternal leave on hiring rates of female IT workers in India.

  • Used SAS and R for statistical modeling and trend analysis
  • Found significant post-policy increases in female hiring in urban tech hubs
  • Delivered policy recommendations based on rigorous pre/post comparisons
    Skills: SAS · R · Policy Analysis · Gender Equity · Statistical Inference · Data Visualization

Suicide Rate Analysis by Socioeconomic Factors

Jan 2023 – May 2023 | University of Minnesota Duluth
Conducted cross-country regression analysis using STATA to identify links between life expectancy, GDP, and suicide rates.

  • Built predictive models and validated statistical outputs
  • Found strong associations between healthcare quality and suicide risk
  • Presented findings for public health policy consideration
    Skills: STATA · Regression · Econometrics · Mental Health Analytics · Statistical Modeling

Professional Experience

Adjunct Instructor – Mathematics & Statistics

Saint Louis University | Aug 2025 – Present

  • Teach Intermediate Algebra and Introductory Statistics
  • Use Excel and LMS tools to track student performance and identify trends
  • Support curriculum development and provide individualized feedback

Adjunct Instructor – Statistics

St. Charles Community College | Aug 2025 – Present

  • Teach statistical methods including regression, ANOVA, and hypothesis testing
  • Guide students in Excel-based analysis and data interpretation
  • Emphasize data literacy and real-world applications

Graduate Teaching Assistant

University of Minnesota Duluth | Sep 2021 – Aug 2024

  • Supported instruction in Calculus, Linear Algebra, and Differential Equations
  • Led computational labs using Mathematica and analyzed student performance data
  • Maintained academic records and collaborated on curriculum design

Education

  • MS in Mathematical Sciences (Statistics Focus) – University of Minnesota Duluth – Jul 2024
  • BA in Mathematical Sciences & IT – Westminster College, MO – May 2018

Certifications

  • Savvy Coders Data Analytics + Python Bootcamp – May 2025
  • ICAgile Certified Professional (ICP) – May 2025

Technical Skills

  • Python · SQL · Tableau · R · SAS · STATA · Excel · Power BI
  • Regression · ANOVA · PCA · Clustering · Hypothesis Testing
  • Machine Learning · Neural Networks · Logistic Regression
  • Git/GitHub · Agile · Jira · Data Cleaning · Data Visualization

Resources



About

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors