Possibility to extend known gene models by adding UTRs

Dear Bambu Developers,
I am currently using bambu to analyze ONT direct RNA sequencing data from a leafy vegetable species. My dataset comprises 9 samples, each with 3 biological replicates. Unlike Arabidopsis, this species is not well-annotated, though both its genome and GTF annotation have been published.

Using bambu, I have generated an extended annotation (GTF). While the tool performs well in identifying novel genes and transcripts, I noticed that many existing gene models are not extended at their UTR regions, despite strong read support (hundreds of reads across multiple libraries). I have attached IGV screenshots illustrating such cases.

In an attempt to improve this, I experimented with different parameter settings in the bambu() function, but the 3' UTRs of many annotated genes remain truncated. Below is my latest command (NDR = 0.378):

![Image](https://github.com/user-attachments/assets/dc4aba6d-6f63-4947-8e94-18488bce2f98)
![Image](https://github.com/user-attachments/assets/d455aa7e-97cb-4360-afba-355a9e9881fb)

se <- bambu(
  reads = bam_files,
  annotations = bambuAnnotations,
  genome = fa.file,
  ncore = 10,
  discovery = TRUE,
  quant = TRUE,
  opt.discovery = list(min.sampleNumber = 2, min.readCount = 5)
)

Is there a way to allow bambu to extend known transcripts—particularly in the UTR regions—when there is strong read support? I am especially interested in improving annotation of known genes rather than just discovering novel ones.

Any suggestions or recommendations would be greatly appreciated.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Possibility to extend known gene models by adding UTRs #500

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Possibility to extend known gene models by adding UTRs #500

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions