Is there a way to combine paired-end and single-read RNASeq for a genome annotation?

elpren · May 7, 2025, 9:07am

Hi!

I’m using published data and have no way of generating new RNA-Seq data. There are two different sets of RNA-Seq data published for my non-model organism, one is paired-end reads and one is single-reads. I’m using HISAT2 to align and BRAKER3 to annotate. I’ve noticed that some annotations generated by the single library are missed from the paired library, and vice versa. Of course, most tools ask you to specify whether your data is single or paired, so these datasets aren’t compatible with each other.

Is there any way to combine these annotations in some way?

Thank you!

jennaj · May 7, 2025, 6:21pm

Hi @elpren

There is probably a way to do this once the data is has been reduced. You can also sometimes create different grouping levels where the sequencing method is factor but this depends on the tool.

What annotation files do you have right now and how do you want these combined?

elpren · May 8, 2025, 10:21am

Hi,

I have two GTF and GFF annotation files, one set from the paired library and one from the single library, both generated with BRAKER3. Ideally I’d like to combine the annotated features into one annotation file, but without duplicates that are annotated from both libraries. I’m not sure if this is possible at all, especially because lots of the attribute IDs are reused (they’re all just for example, g1.t1, g2.t1, g2.t2 etc.)

Currently I’m just viewing them as two different tracks with JBrowse, but I’d ideally like to do some DE analysis using the combined, more complete annotation.

Thanks for your help!

jennaj · May 8, 2025, 4:03pm

Hi @elpren

Thanks for explaining! I would suggest looking at the tool GffCompare.

You’ll be able to control how the merging is done. Maybe run a few times to decide on the best parameter choices for your goals?

elpren · May 19, 2025, 12:00pm

Hi @jennaj

Apologies for the delay in reply! Using GffCompare with default parameters worked great, thanks so much.

jennaj · May 19, 2025, 9:46pm

Great, glad it worked and thanks for letting us know!

Topic		Replies	Views
Generating a New GFF file from a previously merged Stringtie File reference-annotation , genome-annotation	1	34	July 5, 2024
TSEBRA-input files annotation , tool-help , tsebra	4	51	December 18, 2024
paired-end RNA seq with biological replicates usegalaxy.eu support gtn-tutorial , workflow	9	1298	February 7, 2022
RNA-seq reads to counts with pair-end data usegalaxy.eu support workflow	1	302	February 28, 2023
HOw do you create a CTF/GFF file in Galaxy?	7	95	July 9, 2024

Is there a way to combine paired-end and single-read RNASeq for a genome annotation?

Related topics