Stringtie Merge

In stringtie merge, do we merge the replicates of the treatment samples

1 Like

In short, the answer is usually “yes”. The tool can do this always but whether you want to or not depends on your analysis details.

Usage and which inputs to include (sample GTFs/known GTF) depend on whether you are going to perform the DE analysis on only known transcripts/genes, or on only discovered (sample) transcripts/genes, or on both.

Stingtie Merge can be used to do the following:

  1. Pre-process an existing reference annotation GTF dataset, by itself, so it can be used with Stringtie to guide assembly, and with other downstream tools. Not all public GTFs are formatted in a way other tools can interpret and this tool can be used to “groom” a GTF.

  2. Combine the per-sample GTF results of Stingtie, and optionally the reference GTF (knowns), into a unified GTF assembly that can be used with downstream differential expression tools.

More details that summarize the different use-cases are in this prior Galaxy Biostars Q&A post: Question: StringTie and StringTie merge - when to apply the Guide gff (reference annotation file)?.

Please also see the Stringtie tool manual.

It would probably also help to review the Galaxy Tutorials for DE analysis. Several of the RNA-seq protocols include it – and importantly, HISAT2 (settings to create Stringtie-readable BAMs).