This project presents a full downstream analysis pipeline of the PBMC3k single-cell RNA-seq dataset using the Scanpy library in Python. The dataset contains ~3,000 peripheral blood mononuclear cells from 10X Genomics.
- Perform quality control and filtering
- Normalize data and identify variable genes
- Run PCA, UMAP for dimensionality reduction
- Cluster cells and identify marker genes
Scanpy
- Source:
TENxPBMCData::pbmc3k(Bioconductor) - ~3,000 PBMCs processed using 10X Genomics Chromium
Gunjan Sarode
🌐 LinkedIn | 📫 gunjansarode.bioinfo@gmail.com
This project is for educational and research use.