Diamond UniRef90

Hi everyone,

I’m running a metagenomics workflow on Galaxy and have reached the functional annotation step using DIAMOND.
Currently, the only databases available in the DIAMOND tool are RefSeq and UniProtKB/Swiss-Prot.

For compatibility with HUMAnN, I need to use the UniRef90 database for protein family annotation.

Could you please let me know if UniRef90 is available on this Galaxy instance, or if it can be added to the DIAMOND database options?

They said “If your database of interest is not listed, contact your Galaxy admin”, so I guess something can be done. Anyone knows who is the Galaxy admin in this case?

Thanks a lot for your help and support

Best,
Elena

Welcome @elena.peresani

Which Galaxy server are you working at? The server URL please – find this at the top of your browser window. Thanks! :slight_smile:

Then, if you would like to move over to a UseGalaxy server, most will have the UniRef90 database already included. Screenshot from the UseGalaxy.eu server.

If you are following this GTN tutorial, how to check which servers will support the tools/data is in the Available at these Galaxies menu at the top. The EU server is probably the best choice since that is where the scientists who developed the workflow are working but any should be fine while learning.


Who to contact to install new reference data will vary by server and there are ways to connect to most the indexes hosted at public servers though a shared resources. We can help you to find the administrators and let them know. :scientist:

Hope this helps!


Resources

Hi @jennaj,

thanks for your help.

I’m working on the Galaxy Europe server: Galaxy

My issue is that I’m trying to perform an assembly-based functional annotation — starting from scaffolds, not from raw reads, from my metagenomic sequencing.

The workflow I was following is:
Prodigal (gene prediction)DIAMOND (protein annotation using UniRef90 database)HUMAnN (metabolic pathway reconstruction)UniRef to KEGG KO conversion.

The problem is that I cannot find the UniRef90 database among the available reference databases in the DIAMOND tool on Galaxy.

I understand that a read-based profiling directly with HUMAnN is simpler, but since I already have assembled scaffolds, I would like to perform a more targeted assembly-based analysis.

Could you please tell me if the UniRef90 database can be made available for DIAMOND on Galaxy Europe, or who I should contact to request it?

Thank you very much for your time and support.

Best regards,
Elena

Thanks for explaining more @elena.peresani

Yes, I understood your request but was hoping to alert about some of the details involved. I’m not sure if your planned route through these tools is going to produce the result you want. All three analysis paths involve abundance calculations, and interpretations. But I’m guessing that you know this and want to try anyway!

I’ve made a request for the UniRef90 index to be installed natively at the UseGalaxy servers for Diamond. How quickly this can be created and published out has a variable timeline. You can track along with the request here. → Request: index UniRef90 for Diamond at the UseGalaxy servers · Issue #76 · galaxyproject/idc · GitHub

Hope this helps! :slight_smile:

Hi @jennaj,

thank you very much for your help and for submitting the request.

I’m trying a few different approaches for my analyses and I hope that at least one of them will work out successfully.

Thanks again for your time and support!

Best regards,
Elena

1 Like