Re-using NetMHC Runs?

daForce93 · April 5, 2019, 6:25pm

Hi,

I’m building a Galaxy tool that uses NetMHC, which takes in a file containing peptides, and an MHC allele, and predicts peptide-MHC binding affinity. On something the size of the human proteome, running it takes quite a while, so I’d like to cache the results for when my tool needs NetMHC ran on the same set of peptides and the same MHC allele.

I know about data managers, but those seem to be aimed towards data that I as the tool creator/administrator upload ahead-of-time, rather than caching the results of user’s runs. Does Galaxy have any kind of scaffolding for what I’m trying to do? I tried to look in the documentation, but I didn’t find anything.

Thanks,

Jordan

innovate-invent · April 5, 2019, 6:34pm

You are correct about the data manager approach. Datamanagers usually have an associated tool to ingest and retrieve data from the manager.
One approach would be to have two outputs on this tool, one for emitting the cached result and another for forwarding the input dataset to your analysis tool. Have that tool link back to another input of your datamanager tool. If this tool does not produce an output, the linked tools will not be run.

daForce93 · April 23, 2019, 4:06pm

Ah, okay, thanks!

Topic		Replies	Views
MetaPhlAn Database reference-index , galaxy-local , metagenomics	1	66	September 16, 2024
Training: Peptide and Protein ID using SearchGUI and PeptideShaker	2	435	April 16, 2021
Clustering of protein sequences on Galaxy usegalaxy.org support	1	106	February 20, 2024
First time user - Genome comparison usegalaxy.org support gtn-tutorial , dropbox	2	293	October 11, 2023
Request to update BLASTP nr database usegalaxy.eu support reference-index , blastp	5	19	March 14, 2025

Re-using NetMHC Runs?

Related topics