I have a hairpin dataset and i cant convert it with RNA to DNA convertor because this dataset has some things like Y or R in their sequence, So i cant remove them one by one, is there any tool which can do it all together.
For example clear all sequence except the ACTGactgNn ?
You could use the ‘Text transformation with sed’ tool with this SED Program
/^[^>]/s/[^ACTGNactgn]+//g
This will remove all non-ACTGnactgn characters from all non-header lines in FASTA.
1 Like
Thanks 