Hello everybody,
I’ve been reading many tutorials and educational materials about basic .fastq files quality control procedures but I’m confused because it doesn’t seem to apply to my experimental data. I was able to upload some .ab1 files from sanger semi-automated sequences to the Galaxy platform and to convert them into .fastq files using https://www.usegalaxy.org.au
Until now any problems. In fact, that’s where they begin, because after retrieving the results I realized something weird about them because it seems they included symbols from 2 different ASCII tables used for this purpose: at the same time my files contain symbols like the characters “#” and “%”, which belong to the 33-base ASCII table, they also contain uppercase letters of the end of the alphabet (R, S, T, U, V, W, X, Y and Z), which belong to the 64-base ASCII table only.
Please note I’m taking as reference the ASCII quality scores corresponding to the tables available at Quality (Phred) scores .
I’m confused! If .fastq sequence scores go from 1 (or zero) to 40, how can it be that I have a quality score composed of 64 characters?
Any clarification is greatly appreciated.
P.S.: some time after having posted I remembered to have seen somewhere the option to share my Galaxy history with other people, an option I didn’t find anymore after the question was posted.