remove lines that has 'NA' as a gene symbol

Henk · January 27, 2022, 9:19pm

Hi,

I’d like to remove all lines for which no gene symbol is known in order to obtain a dataset with only gene symbol annotated RNAseq expression data. Tried several options with the compute package but it either doesn’t work or my syntax is wrong. Any suggestions?

Henk

Flow · January 28, 2022, 1:22pm

Dear @Henk,
I assume your question is regarding a solution in Galaxy. Have you tried the tool Filter Tabular, choosing regex replace value in column, selecting the column of the gene names, and as a regex pattern NA.

Kind regards,
Florian

Topic		Replies	Views
Goseq NA NA NA NA values usegalaxy.org support troubleshooting , transcriptomics , goseq	13	93	September 8, 2024
Analyzing RNA-seq data	1	1138	August 20, 2019
UniProt SignalP Predictions: How tobautomatically remove predicted signal equence from FASTA uniprot , fasta-manipulation , bed , text-manipulation	5	1245	December 17, 2018
Deleting sequence identifier line usegalaxy.org support fasta-manipulation	1	78	April 9, 2024
How to change from gene_id to GeneID usegalaxy.org support text-manipulation , transcriptomics , deseq2	2	310	September 4, 2023

remove lines that has 'NA' as a gene symbol

Related topics