FastQC Troubleshooting

jennaj · August 26, 2024, 6:25pm

Hi @Sonenshine Thanks for explaining, and great screenshots will all the details, very helpful!

Loading read data should always use default settings. If you don’t get the expected datatype, then this nearly always indicates some content problem to address.

The technical issue is how the + quality score lines are annotated: they are in a legacy Illumina format, and the quality scores themselves may have a legacy Illumina scaling. This has major scientific implications as well. Why? The wrong scaling (from what tools are expecting to process) will throw off all other statistical calculations.

All tools but a few tools will expect Sanger Phred +33, designated as fastqsanger and fastqsanger.gz in Galaxy.

You can load up other formats, and make adjustments (tools will be used to standardize the quality score + annotation line, then re-scale the quality scores themselves). I wrote up some FAQs back when this was a more common necessity, and those should all still work if you want to try. Warning: a bit complicated!

Let the Upload tool detect the format
Adjust the + lines first (this should repair the FastQC issue)
Then “groom” if needed
See here for all → Galaxy FAQs.
And, there was another recent discussion about this if you want more details about the “why” and exact steps. → Faster Download and Extract Reads in FASTQ and ENA reads are slightly different - #2 by jennaj

Or, you can get the SRR reads from NCBI already in fastqsanger format. These will already have the + annotation line standardized, and data points rescaled (if needed based on the original sequencing protocol), directly from the archives, with either of these two tools:

Into collection folders with → Faster Download and Extract Reads in FASTQ format from NCBI SRA
Into individual datasets with → Download and Extract Reads in FASTQ format from NCBI SRA

Please give those a review, thanks!

Topic		Replies	Views
FastQC fails in MRSA Genome Assembly tutorial usegalaxy.eu support gtn-tutorial , assembly	6	195	February 20, 2024
First time user - Genome comparison usegalaxy.org support gtn-tutorial , dropbox	2	288	October 11, 2023
Data upload problem usegalaxy.eu support upload	4	1001	October 26, 2021
Trouble with workflow usegalaxy.org support upload , workflow-extract , troubleshooting	5	26	October 7, 2024
FastQC doesn't work usegalaxy.org support quality-control	15	8733	July 16, 2020

FastQC Troubleshooting

Related topics