Hello,
I used Salmon for quatification and now I’m trying to run DESeq2. I’m using a mapping file from Transcript ID to GeneID but I get the following error:
reading in files with read.delim (install ‘readr’ package for speed up)
1 2 3 4 5 6
removing duplicated transcript rows from tx2gene
Error in $<-.data.frame(*tmp*, “TXNAME”, value = character(0)) :
replacement has 0 rows, data has 997581
Calls: get_deseq_dataset → $<- → $<-.data.frame
I think my mapping file does not have header bur I do not know how to set header=false and I do not know if that is the real problem.
Would anyone help me with this issue?
This implies that your mapping file has more than two columns if I am understanding the log.
The mapping file should contain:
No header line
Two columns separated by a single tab
TranscriptIDs in the first column
GeneIDs in the second column
No extra spaces, tabs, or empty lines
The datatype should be assigned as “tabular”. If Galaxy doesn’t guess this datatype with the detecting-the-datatype-file-format function, then there is a format problem.
General help
This same mapping file should have been used when running Salmon (parameter = “File containing a mapping of transcripts to genes”)
If you used a GTF file instead with Salmon, you should also use that with DESeq2.
Make sure that you are inputting the correct output from Salmon == “quant.genes.sf”
I added more tags to your post that link to prior Q&A that helped others, and tutorials are here:
If you need more help – you can either create a share link to your history and post that back, or it might be enough to post back just the first few lines from each of your inputs.