Welcome! This document is your starting point for building the Dengue Variant Tracker Dashboard.
A professional bioinformatics portfolio project featuring:
- Interactive R Shiny dashboard for dengue virus analysis
- Automated genomic data processing pipeline
- Quality control and motif detection
- Real-world public health application
- Cloud deployment (free hosting)
- Complete GitHub repository
Time Required: 2-3 weeks part-time (~20-30 hours total)
Difficulty: Intermediate (suitable for bioinformatics students/graduates)
RAM Requirement: 4GB minimum (optimized for low resources)
✅ R programming and Bioconductor ecosystem
✅ Genomic sequence analysis (FASTA, quality control, motif detection)
✅ Data visualization with ggplot2 and plotly
✅ Interactive web apps with Shiny
✅ Version control with Git/GitHub
✅ Cloud deployment (shinyapps.io)
✅ Shell scripting for automation
✅ Sequence quality assessment
✅ Motif and pattern matching
✅ Viral genomics basics
✅ Public health data analysis
✅ Reproducible research workflows
Here's what each file does:
| File | Purpose |
|---|---|
| README.md | Complete project documentation |
| QUICKSTART.md | 15-minute setup guide (read this next!) |
| PROJECT_GUIDE.md | Detailed week-by-week walkthrough |
| DEPLOYMENT.md | How to publish your dashboard online |
| download_data.sh | Downloads dengue sequences from NCBI |
| qc_analysis.R | Quality control and motif analysis script |
| app.R | Shiny dashboard application |
| install_packages.R | Installs all required R packages |
| .gitignore | Git configuration (what not to track) |
For those ready to start immediately:
- Read: QUICKSTART.md (15 minutes)
- Install: Run
install_packages.R(10 minutes) - Download: Run
download_data.sh(3 minutes) - Analyze: Run
qc_analysis.R(5 minutes) - Launch: Run
shiny::runApp('app.R')(immediate)
Total time: ~35 minutes to see your dashboard running!
For those who prefer learning before doing:
- Read: README.md - full project overview (20 minutes)
- Study: PROJECT_GUIDE.md - week-by-week plan (30 minutes)
- Then: Follow Path A above
Just browsing? Here's the highlights:
- Skim: README.md - what the project does
- Check: app.R - see the dashboard code
- View: Sample plots would be in plots/ folder after running analysis
Before starting, ensure you have:
- Computer: Windows, Mac, or Linux with 4GB+ RAM
- R: Version 4.0 or higher (download)
- RStudio: Latest version (download) - optional but recommended
- Git: For version control (download)
- Internet: For downloading packages and data
- Time: 2-3 hours for initial setup and first run
Don't have something? The QUICKSTART.md file has installation links.
For best results, read in this order:
- ✅ START_HERE.md (you are here!)
- → QUICKSTART.md - get everything running
- → README.md - understand what you built
- → PROJECT_GUIDE.md - follow day-by-day
- → Review individual script files with comments
- → DEPLOYMENT.md - publish your work
- Search README.md for specific topics
- Check PROJECT_GUIDE.md for troubleshooting
- Refer to code comments in .R files
Week 1: Setup & Analysis
├─ Install R, packages, Git
├─ Download dengue data
├─ Run quality control
└─ Understand Bioconductor
Week 2: Dashboard Development
├─ Learn Shiny basics
├─ Build interactive UI
├─ Add custom features
└─ Test and refine
Week 3: Deployment & Showcase
├─ Deploy to cloud
├─ Create documentation
├─ Share on LinkedIn
└─ Update portfolio
- Start small: Use the test dataset (10 sequences)
- Read comments: Every script has detailed explanations
- Google errors: They're normal! Part of learning
- Ask questions: StackOverflow, RStudio Community
- Don't skip install_packages.R: It sets everything up
- Use RStudio: Makes R much easier
- Try examples: Modify code to see what changes
- Watch tutorials: Links in PROJECT_GUIDE.md
- Follow DEPLOYMENT.md exactly: Step by step
- Free tier is enough: No need to pay
- Test locally first: Make sure it works on your computer
- Save deployment token: You'll need it multiple times
Q: Will this work on my low-spec laptop?
A: Yes! Optimized for 4GB RAM. Uses small datasets.
Q: Do I need to know biology?
A: No! Basic concepts are explained. Project focuses on analysis skills.
Q: How much R do I need to know?
A: Basic R helps, but code is well-commented. You'll learn as you go.
Q: Can I customize this for another virus?
A: Absolutely! Once you understand it, adapt for Zika, flu, etc.
Q: Is this a real research project?
A: It's a portfolio project demonstrating real bioinformatics workflows.
Q: Will this help me get a job?
A: Yes! Shows practical skills employers want. Perfect for interviews.
Q: What if I get stuck?
A: Check PROJECT_GUIDE.md troubleshooting section, ask in R forums, or revisit documentation.
You'll know you're successful when:
✅ Dashboard runs locally without errors
✅ You understand what each script does
✅ App is deployed and accessible via URL
✅ GitHub repo is public and documented
✅ LinkedIn profile shows this project
✅ You can explain it in an interview
Choose your next step:
→ Open QUICKSTART.md and start building (recommended)
→ Open PROJECT_GUIDE.md and review week 1
→ Open README.md and read the complete overview
→ Open app.R and examine the dashboard code
If you encounter issues:
- Check PROJECT_GUIDE.md - Has troubleshooting section
- Search error messages - On Google/StackOverflow
- R Documentation - In R console:
?function_name - Community Forums:
- RStudio Community: https://community.rstudio.com/
- Bioconductor Support: https://support.bioconductor.org/
- Stack Overflow: Tag questions with [r] [shiny] [bioconductor]
This project demonstrates skills that employers look for:
💼 For Bioinformatics Roles:
- "Shows end-to-end pipeline development"
- "Practical application to public health"
- "Modern tools: R, Bioconductor, cloud deployment"
📊 For Data Science Roles:
- "Interactive visualization expertise"
- "Clean, documented code"
- "Real-world problem solving"
🔬 For Research Positions:
- "Reproducible analysis workflow"
- "Open science principles"
- "Public data integration"
Most Important Commands:
# Install packages
Rscript install_packages.R
# Download data
./download_data.sh
# Run analysis
Rscript qc_analysis.R
# Launch dashboard
Rscript -e "shiny::runApp('app.R')"
# Deploy to cloud
rsconnect::deployApp()Key File Paths:
data/raw/ - Downloaded sequences
data/processed/ - Analysis results
plots/ - Generated visualizations
app.R - Dashboard code
This project is designed to be:
- Achievable: Even with limited resources
- Educational: Learn by doing
- Portfolio-ready: Impress employers
- Expandable: Add features as you grow
Remember: Everyone starts somewhere. The goal isn't perfection—it's progress and learning.
Choose one:
- For Quick Results: Open QUICKSTART.md → Start building in 15 minutes
- For Deep Understanding: Open PROJECT_GUIDE.md → Follow week-by-week plan
- For Overview: Open README.md → Read full documentation
Good luck! You're about to create something awesome. 🚀
Questions? Everything is explained in the documentation files.
Start your journey → Open QUICKSTART.md