GitHub - Codecademy-Curriculum/pytorch-skillpath-capstone-example: Example solution to the PyTorch skillpath capstone project

PyTorch Skill Path - Capstone Project Example

Capstone Overview

🩺 Analyzing Health Factors - Predicting Diabetes and Age

Goal: Predict diabetes diagnosis (binary classification) and patient age (regression) using health-related factors.
Dataset: Subset of a CDC dataset
- Smaller, cleaned version available within the UCI Machine Learning Repo
- License: CC0 Public Domain
Models:
- Regression Models:
  - Linear regression (baseline)
  - Feedforward Neural Network
- Classification Models:
  - Feedforward Neural Network
Results:
- Regression: The FNN outperformed the linear regression baseline when predicting age, achieving lower MSE scores on the testing set.
- Classification: The FNN achieved an accuracy of 76% and decently balanced performance across both classes (F1-score of 0.77 for class 1 and an F1-score of 0.74 for class 0).

📚 Classifying Medical Text

Goal: Classify medical question-answer pairs into focus areas (multiclass classification) from trusted medical sources.
Dataset: MedQuAD dataset from the research paper A Question-Entailment Approach to Question Answering
- License: CC BY 4.0
Models:
- Feedforward Neural Network
- Specialized BERT Transformer (BiomedBERT)
Results:
- The FNN achieved ~83% accuracy on the testing set
- The specialized BERT achieved ~98% accuracy on the testing set, a more balanced performance across all five classes (higher precision, recall, and F1-scores)
- The specialized BERT demonstrated better understanding of medical text

👁 Classifying Retinal Images for Diabetes Retinopathy

Goal: Classify high-quality retinal fundus images for diabetes retinopathy (binary classification).
Dataset: IDRiD dataset
- License: CC BY 4.0
Models:
- Convolutional Neural Network
- Vision Transformer
Results:
- The CNN achieved ~58% accuracy on the testing set
- The ViT achieved ~82% accuracy on the testing set
- The ViT showed better performance across both classes (higher precision, recall, and F1-scores)

Getting Started

To explore and run the code for this capstone project, we suggest creating an environment containing the required list of libraries and their versions, which is included in the requirements.txt file.

The notebooks can run locally or on the cloud using Jupyter-compatible environments (recommended) like:

Google Colab
Kaggle Notebooks
Paperspace Gradient
Deepnote

💡 Tip: Enabling the GPU is highly recommended, especially for the image classification task, which involves classifying retinal fundus images.

Install Dependencies

All required libraries and their versions can be installed using:

pip install -r requirements.txt

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.ipynb_checkpoints		.ipynb_checkpoints
datasets		datasets
Analyzing Health Factors - Predicting Diabetes and Age.ipynb		Analyzing Health Factors - Predicting Diabetes and Age.ipynb
Classifying Medical Text with PyTorch.ipynb		Classifying Medical Text with PyTorch.ipynb
Classifying Retinal Images for Diabetes with PyTorch.ipynb		Classifying Retinal Images for Diabetes with PyTorch.ipynb
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

PyTorch Skill Path - Capstone Project Example

Capstone Overview

🩺 Analyzing Health Factors - Predicting Diabetes and Age

📚 Classifying Medical Text

👁 Classifying Retinal Images for Diabetes Retinopathy

Getting Started

Install Dependencies

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

Codecademy-Curriculum/pytorch-skillpath-capstone-example

Folders and files

Latest commit

History

Repository files navigation

PyTorch Skill Path - Capstone Project Example

Capstone Overview

🩺 Analyzing Health Factors - Predicting Diabetes and Age

📚 Classifying Medical Text

👁 Classifying Retinal Images for Diabetes Retinopathy

Getting Started

Install Dependencies

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages