dog genome problem with featurecounts

dartagnan32 · March 31, 2021, 11:43am

Following my request for adding the dog genome, I ran into a problem. I had a trouble trying to align and then use featurecounts. In the table below you can see our numbers for CanFAM 4 (1rst column) and CanFAM 3.0.
So we aligned using canfam3 within galaxy and canfam 4 that we downloaded from NCBI in fna.gz format. Alignment with hisat2 went well with a majority of aligned reads.
However using featurecounts with annotations from UCSC, as you can see for canFAM3, we have some assigned reads but less than 20 % with a lot of no_features but also multimapping.
Using CanFAM4, which has many more transcripts, we thought we would increase the assigned category, but instead we lose everything in assigned, have many more multimapping and no_features and lose the ambiguity group.
Dog is less well annotated than human or mouse but with canfam 4 we should have better assignment of reads over transcripts. Why do we have 0 assigned now? and so many multimapping and no_features? Is it possible our annotations and genome don’t match well?
FYI, our RNA-seq are paired and stranded. Do you need any other info?
We are trying now to download canFAM4 genome from genbank instead of NCBI maybe we should do the same with annotations (but I’m not sure which one to use, I know when I am in UCSC)?

Assigned	0	10 824 955
Unassigned_Unmapped	1 462 466	2 285 520
Unassigned_Read_Type	0	0
Unassigned_MultiMapping	17 176 987	11 328 584
Unassigned_NoFeatures	51 033 117	40 504 328
Unassigned_Overlapping_Length	0	0
Unassigned_Ambiguity	0	13 927

thanks for any help.

Topic		Replies	Views
Unassigned_Ambiguity problem in featureCounts usegalaxy.org support transcriptomics , rna_star	4	1657	May 10, 2021
featureCounts high Unassigned_NoFeatures - New to RNA-seq! usegalaxy.org support gtn-tutorial , troubleshooting	1	2543	November 10, 2020
below 50 percents of reads are assigned in featurecount usegalaxy.org support	15	1147	June 13, 2021
High unassigned ambiguity counts for featureCounts data on bacterial transcriptomics usegalaxy.org support picard_markduplicates	2	1857	March 25, 2020
Featurecounts in built genome giving no read assignments mapping , transcriptomics , featurecounts	3	477	February 16, 2023

dog genome problem with featurecounts

Related topics