You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
docs: Add cluster, complement, and subtract to range operations documentation
Update feature comparison table, API comparison table, coordinate system
mermaid diagram, and algorithm description to include the newly implemented
cluster, complement, and subtract operations.
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Copy file name to clipboardExpand all lines: docs/api.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -2,7 +2,7 @@ polars-bio API is grouped into the following categories:
2
2
3
3
-**[File I/O](#polars_bio.data_input)**: Reading files in various biological formats from **local** and **[cloud](/polars-bio/features/#cloud-storage)** storage.
4
4
-**[Data Processing](#polars_bio.data_processing)**: Exposing end user to the rich **SQL** programming interface powered by [Apache Datafusion](https://datafusion.apache.org/user-guide/sql/index.html) for operations, such as sorting, filtering and other transformations on input bioinformatic datasets registered as tables. You can easily query and process file formats such as *VCF*, *GFF*, *BAM*, *FASTQ*, *Pairs* using SQL syntax.
5
-
-**[Interval Operations](#polars_bio.range_operations)**: Functions for performing common interval operations, such as *overlap*, *nearest*, *coverage*.
5
+
-**[Interval Operations](#polars_bio.range_operations)**: Functions for performing common interval operations, such as *overlap*, *nearest*, *coverage*, *merge*, *cluster*, *complement*, and *subtract*.
6
6
-**[Pileup Operations](#polars_bio.pileup_operations)**: Per-base read depth computation from BAM/SAM/CRAM files using CIGAR operations, similar to mosdepth/samtools depth.
1. There is an [overlap](https://pyranges.readthedocs.io/en/latest/autoapi/pyranges/index.html#pyranges.PyRanges.overlap) method in PyRanges, but its output is only limited to indices of intervals from the other Dataframe that overlap.
Copy file name to clipboardExpand all lines: docs/supplement.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -3,7 +3,7 @@
3
3
This document provides additional information about the algorithms, benchmarking setup, data, and results that were presented in the manuscript.
4
4
5
5
## Algorithm description
6
-
`polars-bio` implements a set of *binary*interval operations on genomic ranges, such as *overlap*, *nearest*, *count-overlaps*, and *coverage*. All these operations share the very similar algorithmic structure, which is presented in the diagram below.
6
+
`polars-bio` implements a set of interval operations on genomic ranges, including *binary* operations (*overlap*, *nearest*, *count-overlaps*, *coverage*, *subtract*) and *unary* operations (*merge*, *cluster*, *complement*). The binary operations share a very similar algorithmic structure, which is presented in the diagram below. The unary operations (*merge*, *cluster*, *complement*) take a single set of intervals and produce transformed output — merged intervals, cluster assignments, or gap intervals respectively.
0 commit comments