Hello! While doing work on Funnanotate, it has failed in several ways without performing its task:
- Arthropoda 2023 data base does not work:
[Jun 21 10:21 AM]: OS: Rocky Linux 8.6, 125 cores, ~ 438 GB RAM. Python: 3.8.10
[Jun 21 10:21 AM]: Running funannotate v1.8.9
[Jun 21 10:21 AM]: ERROR: arthropoda busco database is not found, install with funannotate setup -b arthropoda
- While using the 2022 database:
- Names are too long:
[Jun 21 11:23 AM]: OS: Rocky Linux 8.6, 125 cores, ~ 438 GB RAM. Python: 3.8.10
[Jun 21 11:23 AM]: Running funannotate v1.8.9
[Jun 21 11:23 AM]: GeneMark not found and $GENEMARK_PATH environmental variable missing. Will skip GeneMark ab-initio prediction.
[Jun 21 11:23 AM]: Skipping CodingQuarry as no --rna_bam passed
[Jun 21 11:23 AM]: Parsed training data, run ab-initio gene predictors as follows:
e[4mProgram Training-Methode[0m
augustus busco
glimmerhmm busco
snap busco
[Jun 21 11:24 AM]: Genome assembly error: headers contain more characters than the max (16), reformat headers to continue.
[Jun 21 11:24 AM]: First 5 headers that failed names:
JAAFCF010006596.1
JAAFCF010006680.1
JAAFCF010005931.1
JAAFCF010005748.1
JAAFCF010003739.1
- When names are just fine:
[Jun 21 11:25 AM]: OS: Rocky Linux 8.6, 125 cores, ~ 438 GB RAM. Python: 3.8.10
[Jun 21 11:25 AM]: Running funannotate v1.8.9
[Jun 21 11:25 AM]: Can’t find Repeat Database at /data/db/data_managers/funannotate/2022-01-17-193541/repeats.dmnd, you may need to re-run funannotate setup
In conclusion, the tool seems to be failing no matter the configuration selected. Thanks for your attention and sorry for the inconveniences!