Seeking advice on amount of ambiguity in featureCounts

jennaj · August 28, 2019, 10:48pm

Your read data appears to have high duplication. If you run FastQC you’ll find more details/confirmation about read duplication in those reports.

QA won’t help if the source data actually has high redundancy (tool: Trimmomatic). It could just be low-quality sequencing results or very deep sequencing was done. Contamination could be a factor, but removing those reads won’t help to get more data assigned to a known gene from your reference annotation, it will just reduce the final number of “unassigned-ambiguity” later on in the pipeline.

One note: It is import to run HISAT2 with the option to output results that are formatted for Stringtie. That is covered in the tutorial but is sometimes missed. Worth double-checking. Use the “rerun” (double circle icon) for the mapping jobs to review what options you used.

More DE analysis tutorials can be found here under the group “Transcriptomics” if you want to compare methods/tool choices:

Troubleshooting resources for errors or unexpected results

Hope that helps!

Topic		Replies	Views
High unassigned ambiguity counts for featureCounts data on bacterial transcriptomics usegalaxy.org support picard_markduplicates	2	1833	March 25, 2020
Unassigned_Ambiguity problem in featureCounts usegalaxy.org support transcriptomics , rna_star	4	1626	May 10, 2021
FeatureCounts No Features reference-annotation , reference-genome	3	1960	November 10, 2020
Transcriptomics troubleshooting transcriptomics , rna-seq , goseq	2	562	July 26, 2023
DESEQ2 analysis with galaxy transcriptomics , featurecounts , rna_star	9	1279	February 8, 2023

Seeking advice on amount of ambiguity in featureCounts

Related topics