Error quast metagenoma

cordial greetings
It turns out that I am starting to use galaxy so far the quality control of the sequences and the assembly has gone well, but at the time of making the quality control of the assembly comes out error in the data. I do not know if I am selecting the data wrong. Can someone tell me how to solve the problem?

Captura de pantalla 2024-03-17 210902

Hi @Laura_Ximena_Nunez_R
click at ( i ) icon and examine the standard output and standard error log files in the middle window. Do you see any useful messages? Click at View Error icon, the one looking like lady bird beetle. Any useful info?
Kind regards,
Igor


Although I also have the doubt if I am uploading the data wrong and that can cause the error.


My data are soil metagenomes, I assemble them and directly select the megahit output to upload to quast.

here is the work history

Hi @Laura_Ximena_Nunez_R

The tool completely died out, probably because of the large, fragmented assembly.

It also doesn’t appear to be quality filtered yet, and includes contigs like this!

k91_0 flag=1 multi=1.0000 len=331
GGGGGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACCCAAACAAAAAAAAACAAAAAAAAAACCCCACACAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACAAACAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACAAAAAAAATAAATTAAAAAAAAAAAAAAAAAAAAAAACACAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA

Try reviewing tutorials for methods. This one in particular seems promising for you. Starts at the initial contig step. Find more linked from the bottom of tool forms or review categories at the GTN. Hands-on: Binning of metagenomic sequencing data / Microbiome

Actually it was done in quality control analysis with fastq and in general it was >30, then if I did the assembly with MEGAHIT but at the time of making the quality control of the assembly it comes out in error that I tell you. What can you recommend me?

Hi @Laura_Ximena_Nunez_R

I would recommend filtering that assembly for low information sequences to see if that helps.

One tool to do this for fasta data is → Remove sequencing artifacts Link at UseGalaxy.org

Another is → Filter FASTA Link at UseGalaxy.org

Any sequence with strings of AAAAAAAAA is not going to be a meaningful result.

Overall, I suspect something went wrong with the upstream QA if this is the result from MEGAHIT. You can try to clean up the data after but … this data might have impacted other results. That is why I pointed you to the tutorials for alternative methods for a comparision.