No options available (Select Reference Genome)

cw951 · June 13, 2022, 4:14pm

Hi everyone,

We recently installed the Galaxy Server to our mainframe. However, we are having difficulty in using RNA STAR as there is no reference genome available (see attached picture). How can we rectify this?

Many thanks,
Chris

gbbio · June 14, 2022, 7:19am

Did you added references already?

If not maybe it is good to check out this first:

cw951 · June 14, 2022, 5:00pm

Is there any video tutorial on how we can do this? We just simply want to add the reference genome (human) with built-in index and without the built-in gene model.

Apologies, we are very novice to this.

Thanks,
Chris

jennaj · June 14, 2022, 7:38pm

Hi @cw951

The slides that @gbbio shared are part of a larger tutorial suite. Admin topics are here:

Review the section “Data Management & Reference Data”. In short, you have several choices for the method. These include:

Create your own indexes using Data Managers. These are like other tools Galaxy, but only available for local server admins. You’d need to install the tools, then access them under the “Admin” tab, then either run the tools in a specific order directly or use our batch processing utility.
Link in or copy of the indexes available at public Galaxy servers from our data server. This would be configured by a local admin. Human genome builds are included.

The how-to is covered in the tutorials and some include videos. I also added a few tags to your post that cover prior Q&A about this topic (troubleshooting, etc). If you get stuck or have problems, feel free to ask more questions.

Each genome build is associated with

the genome fasta and several technical indexes. Some optional, some required. Example: items like the Samtools index would be needed to interpret BAM outputs produced by mapping tools.
and tool specific indexes (including RNA-Star for most).

Having a built-in indexes is different from using a Custom genome/build where just the fasta is add to the working history. Why? Galaxy creates those technical and tool specific indexes at runtime as single-use temporary files. Custom genomes/builds are fine for smaller genomes (or any “-ome”) but larger genomes would require much more time/resources to create those indexes every time the tool is run. Convenient for some use cases but not practical for others.

For human, definitely use a built-in indexes. And if you chose to use the pre-computed indexes we host, you won’t need to scale up your local resources to create the indexes with Data Managers. You will still need to have sufficient resources to execute a tool. For `RNA-star, I think the minimum working memory would be about 60 GB – but you can check with your own data. Run the tool at a public server with a representative dataset query and against a genome build target, click into the Job Details ( icon within the result dataset), and review the resources used in the lower part of the report.

Hope that helps!

Topic		Replies	Views
"genomeDir": an invalid option server-admin , reference-index , galaxy-local , reference-annotation , reference-genome	3	290	October 17, 2023
RNA STAR Reference Genome at UseGalaxy.org -- Resolved usegalaxy.org support	7	3036	January 8, 2021
RNA STAR : -- genomeDir not available at UseGalaxy.org -- Resolved usegalaxy.org support gtn-tutorial , public-galaxy-server	3	887	January 8, 2021
Indexing reference genomes with Data Managers: Resources, tutorials, troubleshooting galaxy-local , data-manager , picard_markduplicates	28	7612	July 7, 2021
Add a new reference genome for RNAstar usegalaxy.org support custom-genome , transcriptomics	1	321	February 1, 2024

No options available (Select Reference Genome)

Related topics