RNA/DNA converter on fasta.gz format

eduardofox2 · May 25, 2020, 7:56pm

Hello,
I have one very large fasta.gz file uploaded from FTP link which is in RNA format. I would like to convert it to DNA, which is naturally using RNA/DNA converter tool.

However that tool returns a 0 byte result, which I suspect is an input format issue.
I have uncompressed the fasta.gz to fasta, but RNA/DNA converter returns the following error: […]tool_script.sh: line 25: -d: command not found

Please, does anyone know a workaround for this situation?

jennaj · May 26, 2020, 6:17pm

Hello @eduardofox2

There was a problem with all tools from the Fastx-toolkit. Those issues were just now corrected and testing is in progress … but you could also just try a rerun for quicker results

If there are any lingering issues uncovered on our side with those tools, we’ll post an update back here.

These tools should convert compressed fasta to uncompressed at runtime, but that will be also something that I’ll double check. Since you already have the data uncompressed, rerun using those inputs to avoid more delays.

Thanks for reporting the problem!

eduardofox2 · May 27, 2020, 3:31pm

Hello jennaj, thanks for the quick reply !

However it still does not work as expected, it seems. Running RNA/DNA converter on the compressed (original) fasta will only covert the first sequence. And running on the uncompressed fasta will return an error – first in that it requires single-line formatting which I eventually corrected with “fasta width formatter”, and then finally it complains of some invalid sequence within the corrected fasta file(s).

Eventually I have used sed manipulation substitution which is not ideal but seems to have done the trick.

Please, double check with RNA/DNA converter is working fine on large compressed datasets.

Thanks!

jennaj · May 29, 2020, 9:16pm

Hi @eduardofox2

Doubled checked and you are correct, FASTX-toolkit tools will not work with compressed data. Input uncompressed and use unwrapped formatting. These are older tools wrappers – originally designed for an older short read input type. But they can be still be used with updated formats, it will just take some adjustments on your end.

The invalid sequences probably contain IUPAC characters – not just AUCGN/ATCGN. Change IUPAC bases to N as needed.

Another alternative tool is Manipulate FASTQ reads on various attributes (Galaxy Version 1.1.5).

Thanks!

Topic		Replies	Views
Galaxy fastq to fasta conversion gives coded output	1	281	July 17, 2020
How to convert fastqsanger to fasta usegalaxy.org support download , fastqgz , fastqsanger	3	5354	April 11, 2019
How can I gunzip a file? usegalaxy.eu support fastqsanger	2	295	March 10, 2024
fastq.gz.fastsanger.gz to fastq.gz in Galaxy and FastQC usegalaxy.org support troubleshooting	1	336	January 6, 2024
Do I need to convert genomic.fna.gz file to fasta for custom genomes, if so, how? usegalaxy.org support custom-genome	4	3259	June 29, 2021

RNA/DNA converter on fasta.gz format

Related topics