normalization of exported bigwigs files #734
-
Hello! This is more of a naive question and not a bug but I cant specify other labels than 'bug' anymore. When exporting bigwig files using getGroupBW function wouldn't it make sense to normalise by number of cells along with the ReadsInTSS per cluster given that the coverage are represented by sum of fragments across cells ? Thank you! |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 3 replies
-
@Brawni - We moved questions and feature requests to the Discussions section and only bug reports go into Issues now. I'm migrating your post to Discussions. To answer your question - I'm not sure number of cells makes sense. What if you have multiple samples in your dataset and one of them is sequenced to 1/10th the depth? Then by normalizing to number of cells you are skewing the data. The ReadsInTSS is a pretty stable normalization that ignores how many cells (they are being grouped together into a single track after all) and only cares about the total number of fragments. Thinking of it another way, if you did bulk RNA-seq on 5000 cells and on 50,000 cells, would you normalize the gene expression counts based on the number of cells input? The normal conventions (TPM/RPKM) dont take into account cell number. Same thing applies to ATAC-seq in a way though we use ReadsInTSS instead of total reads because this also accounts for data quality. That being said, you can create arbitrary columns in |
Beta Was this translation helpful? Give feedback.
@Brawni - We moved questions and feature requests to the Discussions section and only bug reports go into Issues now. I'm migrating your post to Discussions.
To answer your question - I'm not sure number of cells makes sense. What if you have multiple samples in your dataset and one of them is sequenced to 1/10th the depth? Then by normalizing to number of cells you are skewing the data. The ReadsInTSS is a pretty stable normalization that ignores how many cells (they are being grouped together into a single track after all) and only cares about the total number of fragments. Thinking of it another way, if you did bulk RNA-seq on 5000 cells and on 50,000 cells, would you normalize the gen…