Thank you for the response. I was able to remove the white spaces in the reference genome. I did this by replacing all whitespaces with underscores. I also made sure that the line started with a “>”.
After running it again, I received a new error message:
The mapped reference id NC_060925.1 from file /home/galaxy/galaxy/database/datasets/000/185/dataset_185260.dat is not an id of the genome file /home/galaxy/galaxy/database/datasets/000/186/dataset_186828.dat
Try cleaning up the fasta file so that just the sequence identifers are on the title lines. You can use the tool NormalizeFasta. Be sure to toggle the option to strip everything after the first white space.
Tools are trying to “match up” the sequence identifiers across different files. So the idea is to isolate the identifiers in each file to make that easier for the tool to process.
So, leave just the sequence identifier in the fasta file > title lines.
Fasta files are best formatted as >identifier
Not like this (what your first error was complaining about) >identifier description maybe several words
And not like this (what your second error was about) >identifier_description_maybe_several_words
The identifier value is what would be in other files:
BAM, in the @SN lines of the header and the third column of data lines.