remove lines that has 'NA' as a gene symbol

Hi,

I’d like to remove all lines for which no gene symbol is known in order to obtain a dataset with only gene symbol annotated RNAseq expression data. Tried several options with the compute package but it either doesn’t work or my syntax is wrong. Any suggestions?

Henk

Dear @Henk,
I assume your question is regarding a solution in Galaxy. Have you tried the tool Filter Tabular, choosing regex replace value in column, selecting the column of the gene names, and as a regex pattern NA.

Kind regards,
Florian

1 Like