|
| 1 | +# Medical Appointment No-Shows Prediction |
| 2 | + |
| 3 | +## Overview |
| 4 | +This project implements a machine learning model to predict whether a patient will miss their medical appointment. The model uses various patient features and appointment details to make accurate predictions. |
| 5 | + |
| 6 | +## Dataset |
| 7 | +The dataset contains the following information: |
| 8 | +- **Total Records**: Approximately 100,000 medical appointments |
| 9 | +- **Features**: 14 different features including patient demographics and appointment details |
| 10 | +- **Target Variable**: `No-show` (1 = No-show, 0 = Show) |
| 11 | + |
| 12 | +### Dataset Features |
| 13 | +1. `PatientId` - Unique patient identifier |
| 14 | +2. `AppointmentID` - Unique appointment identifier |
| 15 | +3. `Gender` - Patient gender (M/F) |
| 16 | +4. `ScheduledDay` - Date when appointment was scheduled |
| 17 | +5. `AppointmentDay` - Actual date of appointment |
| 18 | +6. `Age` - Patient age in years |
| 19 | +7. `Neighbourhood` - Neighbourhood where patient is from |
| 20 | +8. `Scholarship` - Indicates if patient is on any scholarship program |
| 21 | +9. `Hipertension` - Indicates if patient has hypertension |
| 22 | +10. `Diabetes` - Indicates if patient has diabetes |
| 23 | +11. `Alcoholism` - Indicates if patient has alcoholism |
| 24 | +12. `HandiCap` - Indicates if patient has any handicap |
| 25 | +13. `SMS_received` - Indicates if appointment reminder SMS was sent |
| 26 | +14. `No-show` - Target variable (0 = Showed up, 1 = No-show) |
| 27 | + |
| 28 | +## Files in This Project |
| 29 | +- `model.py` - Main model implementation with preprocessing pipeline and training code |
| 30 | +- `requirements.txt` - Python dependencies |
| 31 | +- `README.md` - This documentation file |
| 32 | + |
| 33 | +## Data Preprocessing |
| 34 | +- Missing values are handled by filling with mean values for numerical columns |
| 35 | +- Categorical variables are encoded using LabelEncoder |
| 36 | +- Numerical features are scaled using StandardScaler for normalization |
| 37 | + |
| 38 | +## Requirements |
| 39 | +- Python 3.7+ |
| 40 | +- pandas |
| 41 | +- numpy |
| 42 | +- scikit-learn |
| 43 | + |
| 44 | +## Usage |
| 45 | +See `model.py` for implementation details and usage examples. |
0 commit comments