Skip to content

Automatically catch intervals which extend beyond the chr? #54

@cafletezbrant

Description

@cafletezbrant

Hi gReLU team,

Here is a reprex of a pair of SNPs that I am trying to predict using Borzoi. The prediction interval of the first is contained within the chr, but not the 2nd.

import grelu 
import grelu.resources
import pandas as pd

df = pd.DataFrame({
    'ref': ['G', 'C'],
    'alt': ['A', 'T'],
    'chrom': ['chr17']*2,
    'start': [80920469, 81005659],
    'end': [80920470, 81005660],
    'pos': [80920469, 81005659]
})
grelu.data.dataset.VariantDataset(variants=df, seq_len=524288, genome='hg19')

This gives the error

AssertionError: All input sequences must have the same length.

It would be great for this to be caught automatically somehow :)

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions