Add coverage #273

Phlya · 2025-07-15T15:28:53Z

Add simple calculation of coverage from pairs. Allows to use read1, read2 or both. Can shift each position a certain distance based on strand (e.g. useful for nucleosome dyads from micro-C). If file contains pos51/pos31/... can use that instead of pos1/2. Can do base pair resolution, or some binning.

Still needs tests.

Could be nice to be able to stream output and not generate a giant dataframe for the whole genome, but actually it's only possible for pos1 (since it comes first in sorting). Also atm multiprocessing doesn't seem to do much or anything, perhaps can be optimized somehow.

agalitsyna

comments after a brief checkout, mostly my misunderstanding plus generally mysterious parts of the code highlighted

agalitsyna · 2025-08-11T15:27:08Z

pairtools/cli/coverage.py

+    type=int,
+    default=0,
+    show_default=True,
+    help="Shift value for strand-specific adjustment",


This description feels a bit incomplete. Maybe add a note on what is the zero point relative to which the shift wil be performed here? Also, what will be the direction of the shift?

pairtools/cli/coverage.py

agalitsyna · 2025-08-11T15:28:40Z

pairtools/cli/coverage.py

+    type=click.Choice(["5", "3", "0"]),
+    default="0",
+    show_default=True,
+    help="5', 3' end, or 0 (default) - whatever is reported as pos1 and pos2 in pairs",


What is default "0" here?

How does this parameter interact with the "side" parameter?

Default is whatever is reported in pos1 or pos2 (so depends on how the pairs where generated).

Well... It calculates coverage of 5', 3' or default fragment ends in the specified side (i.e. --side 1 --end 5 will use pos51, --side 2 --end 3 will use pos32)

agalitsyna · 2025-08-11T15:30:12Z

pairtools/cli/coverage.py

+    type=str,
+    required=False,
+)
+@click.option(


Instead of having two separate parameters "side" and "end", maybe it's better to allow the user to submit the list of columns to use for the coverage calculation? It's just a suggestion, might be not the smartest one.

Interesting idea. How would you specify it in case you want to have the coverage by midpoint or whole length, as you suggest below and I also wanted to implement anyway?

agalitsyna · 2025-08-11T15:32:07Z

pairtools/cli/coverage.py

+    default="1",
+    show_default=True,
+    help="0: both sides, 1: first side, 2: second side",
+)


Might be a good idea to allow using (1) the midpoints of the reads for calculation and (2) whole length of the read (each nucleotide of the read adds +1 to the coverage)

Yes, will add it! At least the midpoint I was planning anyway for sure.

agalitsyna · 2025-08-11T15:33:59Z

pairtools/cli/coverage.py

+    "first column lists scaffold names. If not provided, will be extracted "
+    "from the header of the input pairs file.",
+)
+@click.option(


Is there a way to disable the output bgzip-/lz4c-file and have bigwig only? If bgzip-/lz4c- output is required, it shall have an enormous burden on the IO if you need bigwig only...

I think in pairtools we always have a text output, no? How bioframe.to_bigwig works is it saves to text and then converts anyway, so the burden is only double... But I guess we can at least default to saving to stdout and then if you don't want it you dump it to /dev/null?

agalitsyna · 2025-08-11T15:35:13Z

pairtools/lib/coverage.py

+    return coverage_df
+
+
+def save_coverage(


Empty function?

…for shift opt

Phlya added 4 commits July 15, 2025 14:54

Add coverage to init

0756eb9

actually add it to cli

358213d

fix bigwig saving

a0aca5a

Fix coverage for both sides

9812834

agalitsyna reviewed Aug 11, 2025

View reviewed changes

Phlya added 2 commits August 13, 2025 11:42

Address comments, e.g. "both" option for side, and improve help text …

e5374c7

…for shift opt

remove wrong import

1c8b00b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add coverage #273

Add coverage #273

Uh oh!

Phlya commented Jul 15, 2025

Uh oh!

agalitsyna left a comment

Uh oh!

agalitsyna Aug 11, 2025

Uh oh!

Uh oh!

agalitsyna Aug 11, 2025

Uh oh!

Phlya Aug 13, 2025

Uh oh!

agalitsyna Aug 11, 2025

Uh oh!

Phlya Aug 13, 2025

Uh oh!

agalitsyna Aug 11, 2025

Uh oh!

Phlya Aug 13, 2025

Uh oh!

agalitsyna Aug 11, 2025

Uh oh!

Phlya Aug 13, 2025

Uh oh!

agalitsyna Aug 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add coverage #273

Are you sure you want to change the base?

Add coverage #273

Uh oh!

Conversation

Phlya commented Jul 15, 2025

Uh oh!

agalitsyna left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants