- Dataset: Student Performance Dataset.
- Performed data cleaning, EDA using
matplotliband `seaborn". - Created a binary
pass/failtarget column for it. - Trained and evaluated Logistic Regression and Random Forest.
- Evaluated using Accuracy, Confusion Matrix, and F1-score.
📎 See my file: Student_Performance_Project.ipynb
- Database: Chinook Music Store
- SQL queries include:
- Top 5 customers by purchase
- Most popular genre
- Managers and their subordinates
- Most sold album per artist
- Monthly sales trends for 2013
📎 See: chinook_queries.sql
- Dataset: NYC Airbnb Open Data
- Dashboard contains:
- Listings count by neighborhood group
- Price distribution by room type
- Availability trends
- Interactive filters for neighborhood and room type
🔗 Tableau Public Link: CLICK HERE
📎 See My Tableau: Tableau_Dashboard_Link.txt
- Dataset: Online Retail Dataset
- Tasks completed:
- Cleaned data (nulls, duplicates)
- Added
TotalSalescolumn (Quantity × UnitPrice) - Created Pivot Table (Sales by Country and Month)
- Calculated Average Order Value & % contribution
- Highlighted top 5 countries by revenue
- Charted monthly revenue trend
📎 See: OnlineRetail_Analysis.xlsx ar here [https://docs.google.com/spreadsheets/d/1CdaYRAuo1UTDXpTn5AuoZdW5PLg-QLOr/edit?usp=sharing&ouid=115343273173524923923&rtpof=true&sd=true]
📎 See: Bonus_Response.docx
I explained how I would support struggling students and how I'd simplify the concept of Gradient Descent for beginners using analogies and visuals.
📎 See: Section6_AI_Tools.txt
- I used ChatGPT to help with SQL logic and formula suggestions.
- Shared the exact prompt and AI response
- Added a short reflection on what the AI did well and what I tweaked
🎥 Link to 10–15 min Screen Recording: [https://drive.google.com/file/d/18O7NETXnWbVdlpTta4WQnLhMpCXabPik/view?usp=drive_link]
Thank you for this opportunity! I look forward to the possibility of supporting students as a Teaching Assistant.