I’m trying to run MiRDeep2 to identify novel miRNAs from my non-model species. I have successfully mapped my cleaned small RNA reads to my genome using MiRDeep2 Mapper and individually run the resulting collapsed read files and mapping ARF file, along with my genome, in MiRDeep2. I didn’t add any known miRNAs or precursors and used the default settings. I have four samples, so I tried to pool these together in MiRDeep2 Mapper, which seemed to work fine. However, when I tried to run the pooled collapsed read file and mapping ARF file in MiRDeep 2, it showed an error (Fatal error: Exit code 2 ()) and aborted the run after less than 2 minutes. This is the run log:
As a cross check, maybe compare the usage to a public Galaxy server?
The second tool test in here makes use of two pooled samples. Notice how each was given a sample identifier and input in a separate block. Click into the pencil icon then the details tab to see the command string.
In short, you are either not getting any hits at all, or the tool cannot match up the sample query to that sample’s mapped hits correctly, and you’ll need to determine why.
For line command help, this forum isn’t the best place to get advice, but to get you started, check the usage manual here. See example 6. The config.txt file is used to organize the pooled samples. This is what the Galaxy tool form is doing for you by breaking out the inputs – creating that sample sheet directly. Maybe the extra help in Galaxy can flush out how to use the tool in other ways?
Thanks for your reply! I get hits when I run the samples individually, so it’s likely a matching problem. The usage manual looks very informative, thanks
I ran different samples pooled and didn’t come across the above issue at all, but the bottom of my tabular output is a bit strange. After the novel predictions, it has a section for the known miRNAs I’ve supplied. It looks a bit odd though, with the section “#miRBase miRNAs not detected by miRDeep2” followed by many empty rows,and then lists 4 of the known miRNAs I supplied. However, the columns the data are in don’t seem to match up and I’m not sure how to interpret it. Has anyone come across this before?
I haven’t but maybe someone else will recognize it and comment.
My first instinct when I see a file like this, is to open it in an editor where I can review the whitespace (like vim).
Then, as a guess, content from one or more of the inputs is getting pushed out into this file. If you are running pooled without the sample designation (config.txt) it certainly seems possible that you may have data points that may not be failing the tool outright but that are still not being interpreted correctly. I’m not sure if I would trust this result!
All that said, this looks like a screenshot from Galaxy. Did you upload the result from the command line tool to view it? The whitespace in the file, if not standard “tabular” and not another supported report-file flavor of datatype, could have be compromised. Some of this depends on how you loaded it. The advanced option for “converting whitespace to tabs” in the advanced options can be very useful, or it can produce a view like yours, and other variations. Choosing “txt” is usually a safer choice to preserve everything without changes.
But this loops back to reviewing the original file in an editor, then reviewing the command string versus the manual. Tools can very happily do all sorts of odd things and still not have it be “bad enough” to fail. Galaxy has an Visualization → Editor function but I’m not sure if it will be helpful for this case or not from what you have shared so far. I guess you could try? But maybe start with a clean txt upload first.