cordial greetings
It turns out that I am starting to use galaxy so far the quality control of the sequences and the assembly has gone well, but at the time of making the quality control of the assembly comes out error in the data. I do not know if I am selecting the data wrong. Can someone tell me how to solve the problem?
Hi @Laura_Ximena_Nunez_R
click at ( i ) icon and examine the standard output and standard error log files in the middle window. Do you see any useful messages? Click at View Error icon, the one looking like lady bird beetle. Any useful info?
Kind regards,
Igor
My data are soil metagenomes, I assemble them and directly select the megahit output to upload to quast.
here is the work history
The tool completely died out, probably because of the large, fragmented assembly.
It also doesn’t appear to be quality filtered yet, and includes contigs like this!
k91_0 flag=1 multi=1.0000 len=331
GGGGGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACCCAAACAAAAAAAAACAAAAAAAAAACCCCACACAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACAAACAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACAAAAAAAATAAATTAAAAAAAAAAAAAAAAAAAAAAACACAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
Try reviewing tutorials for methods. This one in particular seems promising for you. Starts at the initial contig step. Find more linked from the bottom of tool forms or review categories at the GTN. Hands-on: Binning of metagenomic sequencing data / Microbiome
Actually it was done in quality control analysis with fastq and in general it was >30, then if I did the assembly with MEGAHIT but at the time of making the quality control of the assembly it comes out in error that I tell you. What can you recommend me?
I would recommend filtering that assembly for low information sequences to see if that helps.
One tool to do this for fasta data is → Remove sequencing artifacts Link at UseGalaxy.org
Another is → Filter FASTA Link at UseGalaxy.org
Any sequence with strings of AAAAAAAAA is not going to be a meaningful result.
Overall, I suspect something went wrong with the upstream QA if this is the result from MEGAHIT. You can try to clean up the data after but … this data might have impacted other results. That is why I pointed you to the tutorials for alternative methods for a comparision.