I ran busco to evaluate the draft of a bacterial genome and got this error:
“FileNotFoundError: [Errno 2] No such file or directory: ‘/cvmfs/data.galaxyproject.org/byhand/busco/v5/lineages/bacilli_odb10/refseq_db.faa.gz’”
So, instead, try one of the other combinations. Some can be unexpected. But if a database “isn’t available” in the error message, that means the underlying tool cannot process data that way.
Hum, if you can show an example of where that worked before, I’d be willing to take a look.
Now, the prior versions with simplified form all required a linage selection (most like the first example in dataset 8-9 above) so off hand that may have been it. But now, with the compound form, I think it is all working as expected.
Hello again, in the post you mentioned above, there is something that doesn’t make sense.
Prokaryotic data is used, but the ortholog database used is from plants (liliopsida).
BUSCO works well if you use version 5.8.0+galaxy0.
Therefore, the error does not stem from the absence of orthologous protein databases for each lineage, but rather from the version of the program used. It is clear that the latest version does not work.
I was just setting the default combinations at the top level to show what would be produced – my example isn’t a scientific example, more about the settings. I should have worded this better! I’ll try again.
You have bacterial data, correct? Using Metaeuk is the only requirement, the remainder you can set as needed. That can be “auto” for the linage as I used – but you can also choose a linage. In short, using Miniprot won’t work with the “auto” choice because it only functions with a linage selection.