Skip to content

Handling empty bins in distribution #13

@kim1117

Description

@kim1117

Hello,

I have a question regarding the binning process in Genes2Genes.

When the distribution of values is skewed (e.g., most cells concentrated around 0.4–0.6), the bins determined by optbinning can include regions where no cells are present. As a result, when plotting a bar graph, the number of bars does not match n_bins, and the number of bars differs between the query and reference distributions due to these empty bins.

Image

My questions are:

  1. Should empty bins be excluded from the analysis since they contain no relevant data? It seems that this issue arises because the bin range is fixed between 0 and 1. What are your thoughts on this?

  2. The first and last bins containing cells likely have fewer counts. Is the matching test(assigning IDMWV) robust enough to handle such sparsely populated edge bins?

I would appreciate your insights on these issues. Thank you!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions