I have installed Kraken, Kraken2, and other necessary tools successfully. Then I went on to install the kraken2 database using the data_manager_build_kraken_database from the Install Tool section. Then, after uploading a fastq file for classification, I could not find any database to classify the reads.
Please see the attached photo below:
Does the database show up on the Kraken tool form? I’m asking since the indexes are specific to the tool release and your screenshot looks like Kraken2.
Later on, be sure to not mix up data outputs generated between the two – they have been incompatible in the past.
The ToolShed is undergoing maintenance today, but you could also get the Kraken2 data manager and install indexes with that for the matching tool once it is backup.
Hi @jennaj Thank you very much for the reply. Unfortunately, it did not work.
I have now installed kraken2 and also the data_manager for building the kraken2 database. However, after installing this, I do not know how to run the data manager. Because it does not show up anywhere in my local Galaxy interface.
So, still I do not see any database for running kraken2.
These data administration tools will not show up in the “regular” tool panel.
Instead, find this under the Admin masthead menu while logged into your administrative account. This is where the Kraken (1) data manager would have been, too. The section is Admin → Server → Local Data.
Hi @jennaj thanks a lot. Yes, I have found it now and it is working.
But I have another question:
I was trying to build the database from Pre-built RefSeq Indexes and the index files are dated back to June 2022 latest. Is there a way to get updated Indices I see already from here https://benlangmead.github.io/aws-indexes/k2 that they continuously update the database indices.
Please let me know.
Thank you very much for the cooperation.
There isn’t a current way to capture the daily/weekly updates from RefSeq (that I know of!) but if this is something that you want to help with – meaning, improve the data manager to handle that – I’m sure others would be interested.
Part of the reason the public servers don’t do this is practical. Any data that is ever hosted, is then retained forever and available for reproducibility reasons. Obviously you can set your own data retention policies on a private server to do this differently!