Unassigned_Ambiguity problem in featureCounts

Nati2208 · May 6, 2021, 10:10pm

Hi, I’m new on processing raw data from RNA-seq experiments but my recent project requieres me to get it on.

So, I’m using the 12 runs from the ENSEMBL accession number: E-MTAB-4929;
I made the trimming using Cutadapt, also I had to figure out if the reads were stranded or not, so I used Infer Experiment for that and they are paired-end unstranded.
For the alignment I used RNA-STAR, I downloaded the reference genome from Genome NCBI (Glycine max (soybean)) as well as the annotation file in gff version 3 format.

My QC, made with MultiQC, from the alignment seem to me OK, here is a ss:

The problem is when I run the featureCounts;
my input files are the BAM files from the alignment and the anotation file gff version 3 of the Glycine max genome.
For the extra info., I followed the tutorial: Reference-based RNA-Seq data analysis

I made the QC on the summary file [one output file of featureCounts] and I obtained this:

I’m getting less than 50% of the total fragments mapped and I don’t know why :S
¿Can someone please help me?

Flow · May 7, 2021, 11:57am

Dear Nati2208,
The results of featurecounts does not show you the mapping statistics it shows you the counting done by feature count. That is to say, You have 50% of the reads that assign to at least two or more features in your annotation file, e.g., two or more transcripts of your gene, or two or more exons because they map to exon-intron boundaries.

I hope I could answer your questions.

Best wishes,
Florian

Nati2208 · May 7, 2021, 1:55pm

Thank you Flow,

I didn’t know, thank you for correcting me.

Is something I can do to have more assigned counts?

Flow · May 10, 2021, 8:02am

Dear Nati2208,
You can select in featureCount under advanced options different modes for ambiguitiy Allow reads to map to multiple features (.e.g, -O) together with the option Largest overlap to Yes. Test it and see what will happen.

Best wishes,
Florian

Nati2208 · May 10, 2021, 7:20pm

Dear Flow,

I ran again featureCounts with those filters and I obtained these results:

Thank you so much for your advise!

Topic		Replies	Views
featureCounts high Unassigned_NoFeatures - New to RNA-seq! usegalaxy.org support gtn-tutorial , troubleshooting	1	2460	November 10, 2020
High unassigned ambiguity counts for featureCounts data on bacterial transcriptomics usegalaxy.org support picard_markduplicates	2	1811	March 25, 2020
Unassigned Multimapping in featurecounts reference-annotation , salmon	9	9916	June 15, 2021
Seeking advice on amount of ambiguity in featureCounts usegalaxy.org support	7	2109	September 9, 2019
FeatureCounts No Features reference-annotation , reference-genome	3	1925	November 10, 2020

Unassigned_Ambiguity problem in featureCounts

Related topics