-
Notifications
You must be signed in to change notification settings - Fork 13
Description
Hello,
I have a question regarding the binning process in Genes2Genes.
When the distribution of values is skewed (e.g., most cells concentrated around 0.4–0.6), the bins determined by optbinning can include regions where no cells are present. As a result, when plotting a bar graph, the number of bars does not match n_bins, and the number of bars differs between the query and reference distributions due to these empty bins.
My questions are:
-
Should empty bins be excluded from the analysis since they contain no relevant data? It seems that this issue arises because the bin range is fixed between 0 and 1. What are your thoughts on this?
-
The first and last bins containing cells likely have fewer counts. Is the matching test(assigning IDMWV) robust enough to handle such sparsely populated edge bins?
I would appreciate your insights on these issues. Thank you!
