FastQC Overrepresented sequences percentage

jennaj · October 26, 2022, 10:23pm

These are true percentages with as many significant digits as there is space to write them out.

Your example

Five percent == 5.0
Point zero five percent == 0.05

Real data, graph view

One point five five percent == “1.5504756818485261”

Same data, raw view

>>Overrepresented sequences	fail
#Sequence	Count	Percentage	Possible Source
CGGTGCTCGACCCCTCCGACCCCCGCCGGCCGCTTCGAGCCTGAGCCCTT	76412	1.5504756818485261	No Hit

The usage/display in Galaxy is the same as the original tool, so the documentation linked from the bottom of the tool form can be a really useful reference wherever you are running it. There is per-module help plus example reports of “good” and “bad” data and other common use cases.

Babraham Bioinformatics - FastQC A Quality Control tool for High Throughput Sequence Data → Index of /projects/fastqc/Help → Overrepresented Sequences

This module lists all of the sequence which make up more than 0.1% of the total. To conserve memory only sequences which appear in the first 100,000 sequences are tracked to the end of the file. It is therefore possible that a sequence which is overrepresented but doesn't appear at the start of the file for some reason could be missed by this module.

Topic		Replies	Views
RNASeq analysis, FastQC usegalaxy.org support quality-control	0	553	April 7, 2019
FastQ ASCII table for raw sequences' QC usegalaxy.org support fastqc , fastq-format	2	10	July 4, 2025
Problems with the tutorial data gtn-tutorial , quality-control	4	1712	December 6, 2018
Percent abundance of nucleotide at specific position variant-analysis	0	435	March 18, 2021
FastQC Troubleshooting tool-help , quality-control , fastqc	4	156	August 26, 2024

FastQC Overrepresented sequences percentage

Related topics