Fasta format genome file in mirdeep2 (custom genome)

maham_hamid · May 10, 2019, 11:25pm

I’m using mirdeep2 for analysis of human miRNA data and in this tool, the input requirements of this tool is 2 files from mirdeep2 quantifier, mature.fa and precursor.fa from mirBase and a coresponding genome fasta file.
i am not sure which type of file this is? is this reference genome of humans i.e hg38 ? if yes than why do i have to upload it again while galaxy already has a builtin genome.

thanks

jennaj · May 13, 2019, 5:48pm

Hi @maham_hamid & thanks for posting this to the forum!

In short, for these tools, as installed at https://usegalaxy.eu:

MiRDeep2 identification of novel and known miRNAs does not make use of the general indexed reference genomes (and probably could/should)
MiRDeep2 Mapper process and map reads to a reference genome does have some genomes available as indexes, but not all – and I’m not really sure why yet.

Using a Custom reference genome fasta might work with both of the “Genome” inputs. It depends on your reference genome. Larger genomes are sometimes problematic (are too large to process against). Smaller genomes would likely work fine.

The first FAQ covers how to format a reference genome/transcriptome/exome used from the history. The second covers some common issues people encounter when mixing natively indexed genomes with custom genomes or externally mapped data or when incorporating other inputs (reference annotation, etc).

ping @hexylena @bjoern.gruening for advice about usage and future plans for indexes. Maybe we could reach out to the tool authors and see if they can update the tools to use the genomes already indexed on a server for this type of input? (“all_fasta” data table ??).

A few other tools have needed changes around using build-in indexes (were initially pulling data from a tool-specific data table, instead of a global data table) but I can’t remember exactly how at the end that was remedied – tool wrapper update or a Galaxy update/config change? Thanks & we can ping the authors at their development repository (once clarified && if needed).

amir · September 2, 2019, 7:48am

i used the reference genome i.e hg38 but it came up with an erorr with no explanation.
can anyone help me ?

jennaj · September 4, 2019, 6:05pm

Duplicate, question is now in this post:

Topic		Replies	Views
mirdeep2 errors -- Solution: verify input format usegalaxy.org support custom-genome , troubleshooting , reference-genome	1	467	April 13, 2023
Create reference genome from my WGS data (custom reference) usegalaxy.org support workflow , tool-dev	0	404	April 10, 2019
Adding new Reference genomes to the DeepVariant deep learning-based variant caller usegalaxy.eu support custom-genome , mapping , transcriptomics , reference-annotation , reference-genome , custom-build , featurecounts	1	776	January 26, 2023
miRNA seq differential analysis usegalaxy.eu support workflow	0	591	August 20, 2020
Tool form does not list expected indexed reference genome: Solution -- assign the "database" to the inputs usegalaxy.org support database , metadata , datatype	1	597	December 2, 2019

Fasta format genome file in mirdeep2 (custom genome)

Related topics