Using the salmon tool I´ve got UCSC ids instead of gene_ids

nanohacker1 · December 3, 2024, 3:18pm

Hello everybody,
I´m veeeeeryyy new to the bioinfomatic tools. I want to investigate my geneexpression changes via nanopore sequencing.
Now, after running the created workflow starting with single fastq files (attached as a screenshot), I´ve resulted with UCSC ids in tabular format instead of the gene_ids I require for DESeq2. for salmon, I´ve used the Homo_sapiens.GRCh38.cDNA.all.fa file

How do I transform the files into the correct format? Did I add the wrong input? What went wrong?

Many many thanks to you all!!!

jennaj · December 3, 2024, 10:14pm

Welcome, @nanohacker1

It looks like you used the reference genome (chromosomes) instead of the reference transcriptome (transcripts). I also do not see your reference annotation included for the Salmon step – you will usually want to include it for the gene-transcript mapping, at the Salmon step then later for DESeq2.

The best advice I have is to get all of your reference data organized at the very start. The UseGalaxy servers will host the genome indexed, but you’ll need to supply the two other files, and UCSC hosts all of this data.

We have discussions about this, so please have a review and let us know if you have any followup questions.

Start here Correct reference transcriptome for Salmon quant on existing RNASTAR alignments - #2 by jennaj
Then this post has an example where I loaded all the file choices from UCSC, reformatted, and tagged the “matched files”. Correct reference transcriptome for Salmon quant on existing RNASTAR alignments - #4 by jennaj
Suggested data formatting for these tools. FAQ: Extended Help for Differential Expression Analysis Tools
More is under reference-transcriptome and reference-genome and reference-annotation
Human data has extra tips in this guide Reference genomes at public Galaxy servers: GRCh38/hg38 example

Hopefully this helps!

nanohacker1 · December 6, 2024, 11:22pm

Many many thanks for your help!!

Topic		Replies	Views
Getting partial conversion from transcript to gene when using Salmon on Galaxy usegalaxy.org support salmon	0	350	July 12, 2019
Salmon quantification using Ensembl references usegalaxy.eu support salmon	4	1124	October 21, 2021
Salmon quant to DESeq2 usegalaxy.org support salmon	0	501	November 12, 2020
Correct reference transcriptome for Salmon quant on existing RNASTAR alignments usegalaxy.org support troubleshooting , reference-annotation , reference-genome , resources , reference-transcriptome , salmon	3	548	May 16, 2024
Salmon Output File "Name" column gene id format usegalaxy.org support transcriptomics , salmon	3	288	July 25, 2023

Using the salmon tool I´ve got UCSC ids instead of gene_ids

Related topics