Hi, I submitted three datasets to the MethylDackel tool and it is running for the past two days (other users said it took them 2-3 hours to get results). does anyone know what’s the problem?
Thanks
Hi, I submitted three datasets to the MethylDackel tool and it is running for the past two days (other users said it took them 2-3 hours to get results). does anyone know what’s the problem?
Thanks
Hi @Guy_Haim,
I have this here in your logs.
[fai_load] build FASTA index.
[fai_load] build FASTA index.
[fai_load] build FASTA index.
[fai_load] build FASTA index.
[fai_load] build FASTA index.
[fai_load] build FASTA index.
[fai_load] build FASTA index.
[fai_load] build FASTA index.
[fai_load] build FASTA index.
[fai_load] build FASTA index.
[fai_load] build FASTA index.
[fai_load] build FASTA index.
[fai_fetch_seq] The sequence "chr1" not found
[fai_fetch_seq] The sequence "chr1" not found
faidx_fetch_seq returned -2 while trying to fetch the sequence for tid chr1:0-1000000!
Note that the output will be truncated!
[fai_fetch_seq] The sequence "chr1" not found
[fai_fetch_seq] The sequence "chr1" not found
faidx_fetch_seq returned -2 while trying to fetch the sequence for tid chr1:1000000-2000000!
Note that the output will be truncated!
[fai_fetch_seq] The sequence "chr1" not found
[fai_fetch_seq] The sequence "chr1" not found
faidx_fetch_seq returned -2 while trying to fetch the sequence for tid chr1:3000001-4000001!
Note that the output will be truncated!
[fai_fetch_seq] The sequence "chr1" not found
[fai_fetch_seq] The sequence "chr1" not found
faidx_fetch_seq returned -2 while trying to fetch the sequence for tid chr1:4000001-5000001!
Note that the output will be truncated!
[fai_fetch_seq] The sequence "chr1" not found
[fai_fetch_seq] The sequence "chr1" not found
faidx_fetch_seq returned -2 while trying to fetch the sequence for tid chr1:5000001-6000001!
Note that the output will be truncated!
[fai_fetch_seq] The sequence "chr1" not found
[fai_fetch_seq] The sequence "chr1" not found
faidx_fetch_seq returned -2 while trying to fetch the sequence for tid chr1:6000001-7000001!
Note that the output will be truncated!
[fai_fetch_seq] The sequence "chr1" not found
[fai_fetch_seq] The sequence "chr1" not found
faidx_fetch_seq returned -2 while trying to fetch the sequence for tid chr1:10000002-11000002!
Note that the output will be truncated!
Are you supplying your own index? Which reference genome are you using?
Ciao,
Bjoern
Yes, my own index.
do you think I should use another index?
Any reason you can not use …
not that I can think of (except I used it for the previous stages of the reduced representation bisulfite sequencing files I got).
anyway, I’ll try…
Please let me know if this speeds up your processing.
No problem, thanks.
Hi again, it’s still running. How long does it take normally?
Also, do you think I should re-do the previous stages? This time with this ref genome:
Hi @Guy_Haim
Using the same exact reference genome (build + version) throughout analysis is important.
The natively indexed version of hg38
was sourced from UCSC. If that is the same as what you used for upstream steps (in Galaxy or not), all should be fine.
Other reference genome sources can be used, however, some data adjustments are usually needed to avoid problems. This FAQ has more details:
Hi, thanks for replying…
Actually, I was told to use the built-in genome index in galaxy instead of the index I created on my own, but it didn’t work and I used it for the upstream steps and for the MethylDackel tool as well.
Still, it’s running for almost two weeks now.
If you can think of anything else I can do I would appreciate it.
Thanks again,
Guy
בתאריך יום ו׳, 4 ביוני 2021, 19:37, מאת Jennifer Hillman-Jackson via Galaxy Community Help <galaxy@discoursemail.com>:
Using mammalian or some plant genomes, that are large, tends to exceed resources at public Galaxy servers when input as a custom genome (fasta) from the history. Why? Because before the tool is run, the genome may need indexing. I saw that you are now using the built-in index, so that will resolve part of the problem.
If you are still having issues, check these three things first:
Other than that, if a job is queued, it is waiting for resources to become available. How long a job queues depends on how busy the server is and how many jobs you have that are queued (grey) + executing (yellow). If all that doesn’t help, maybe @bjoern.gruening or @gallardoalba can help more since you are working at the EU server.
OK, I’ll try…
Thank you so much for the help.
Guy
בתאריך יום ו׳, 4 ביוני 2021, 22:34, מאת Jennifer Hillman-Jackson via Galaxy Community Help <galaxy@discoursemail.com>:
Methyldackel has some problems at the moment we are updating the tool to the latest version. Please check again in 24h and try the latest version. Sorry for the inconvenience.
2 posts were split to a new topic: Searching the tool panel – Example at UseGalaxy.eu for tool MethylDackel
Hi @bjoern.gruening, I also appear to be having problems with long MethylDackel run times. I am using the bam outputs from the Bismark mapper, and for some of these MethylDackel finishes quickly (a few minutes in some cases!), but for most of the others it has been running for a long time (some for more than a few days). Is this normal? Do you have an idea as to how long these jobs should normally take?
Many thanks,
Lewis
Have you tried the new version as well? 0.5.2?
Yep, all of the jobs are running using version 0.5.2+galaxy0.
Still no luck with the new version however, pretty much all of the jobs have been running for 2 days plus now. Do you have any ideas why they might be taking so long to complete?
I resubmitted my files and they are running for the past 8 days, with the new version…