Aspiring data analyst with a Master’s in Mathematical Sciences and a background in statistical modeling, data visualization, and public-sector analytics. Project-based experience with AWS and Snowflake, as well as Python, SQL, and R, involving the development of cloud-based data pipelines and the application of statistical methods to complex datasets. Developing the ability to translate technical findings into clear insights to support institutional policy and decision-making.
Savvy Coders | February 2025 – May 2025 Implemented a full-stack cloud data engineering solution to analyze socioeconomic drivers of food insecurity across 3,000+ U.S. counties.
- Data Engineering: Developed a serverless ETL pipeline using AWS Glue (PySpark) to transform raw USDA datasets into optimized Apache Parquet format.
- Data Warehousing: Engineered a Snowflake environment featuring external stages, automated data loading (COPY INTO), and optimized SQL views for health-risk priority classification.
- Hybrid Architecture: Established a dual-engine query layer utilizing AWS Athena for ad-hoc exploration and Snowflake for high-concurrency business intelligence.
- Visualization: Designed an interactive Amazon QuickSight dashboard to quantify the "Food Access Health Gap," identifying a 2% increase in obesity rates within low-access regions.
- Infrastructure Management: Managed resource lifecycles across multiple AWS regions, ensuring cost-efficient operations and adherence to security best practices. Skills: AWS (Glue, S3, Athena), Snowflake, PySpark, Amazon QuickSight, SQL, Data Engineering. GitHub Repo · Tableau Dashboard
Jan 2024 – Jul 2024 | University of Minnesota Duluth
Analyzed trace element contamination across trophic levels in aquatic ecosystems downstream of historic mining sites.
- Cleaned and normalized environmental data using Box-Cox transformation
- Applied PCA, Factor Analysis, and K-means clustering in R
- Found evidence of biomagnification with policy implications for ecological risk
- Created ggplot2 visualizations for non-specialist stakeholders
Skills: R · PCA · Cluster Analysis · Environmental Analytics · Data Cleaning · Statistical Modeling
Jan 2024 – May 2024 | University of Minnesota Duluth
Developed machine learning models to predict diabetes risk using clinical data.
- Preprocessed Pima Indians dataset using feature scaling and PCA
- Built logistic regression and neural network models in Python (Scikit-learn)
- Achieved 85% accuracy and strong ROC AUC performance
- Compared model interpretability and performance across techniques
Skills: Python · Scikit-learn · Logistic Regression · PCA · Neural Networks · Predictive Modeling
Apr 2024 | University of Minnesota Duluth
Collaborated with team to identify optimal locations for eco-friendly fertilizer facilities in Minnesota.
- Used R for regression modeling and KNN imputation
- Integrated geospatial and economic data to inform site selection
- Presented findings to industry panel with actionable recommendations
Skills: R · Regression · KNN · Predictive Analytics · Team Collaboration · Data Integrity
Aug 2023 – Dec 2023 | University of Minnesota Duluth
Evaluated clinical outcomes based on oxytocin administration routes using SAS and Power BI.
- Conducted t-tests, chi-square tests, and multiple regression modeling
- Built Power BI dashboard to visualize Shock Index and hemorrhage risk
- Identified key predictors of postpartum hemoglobin levels
Skills: SAS · Power BI · Regression · Biostatistics · Data Visualization · Clinical Analytics
Jan 2024 – Mar 2024 | University of Minnesota Duluth
Analyzed 6M+ employment records to assess impact of paid maternal leave on hiring rates of female IT workers in India.
- Used SAS and R for statistical modeling and trend analysis
- Found significant post-policy increases in female hiring in urban tech hubs
- Delivered policy recommendations based on rigorous pre/post comparisons
Skills: SAS · R · Policy Analysis · Gender Equity · Statistical Inference · Data Visualization
Jan 2023 – May 2023 | University of Minnesota Duluth
Conducted cross-country regression analysis using STATA to identify links between life expectancy, GDP, and suicide rates.
- Built predictive models and validated statistical outputs
- Found strong associations between healthcare quality and suicide risk
- Presented findings for public health policy consideration
Skills: STATA · Regression · Econometrics · Mental Health Analytics · Statistical Modeling
Saint Louis University | Aug 2025 – Present
- Teach Intermediate Algebra and Introductory Statistics
- Use Excel and LMS tools to track student performance and identify trends
- Support curriculum development and provide individualized feedback
St. Charles Community College | Aug 2025 – Present
- Teach statistical methods including regression, ANOVA, and hypothesis testing
- Guide students in Excel-based analysis and data interpretation
- Emphasize data literacy and real-world applications
University of Minnesota Duluth | Sep 2021 – Aug 2024
- Supported instruction in Calculus, Linear Algebra, and Differential Equations
- Led computational labs using Mathematica and analyzed student performance data
- Maintained academic records and collaborated on curriculum design
- MS in Mathematical Sciences (Statistics Focus) – University of Minnesota Duluth – Jul 2024
- BA in Mathematical Sciences & IT – Westminster College, MO – May 2018
- Savvy Coders Data Analytics + Python Bootcamp – May 2025
- ICAgile Certified Professional (ICP) – May 2025
- Python · SQL · Tableau · R · SAS · STATA · Excel · Power BI
- Regression · ANOVA · PCA · Clustering · Hypothesis Testing
- Machine Learning · Neural Networks · Logistic Regression
- Git/GitHub · Agile · Jira · Data Cleaning · Data Visualization