I have uploaded a species genome to Galaxy, this is a conifer genome which is quite large (for example Picea abies genome size is: https://www.ncbi.nlm.nih.gov/Traces/wgs/CBVK01?display=contigs). When I try to create a BLAST database (with the NCBI BLAST+ makeblastdb tool) I receive an error message. The message is about duplicated sequence IDs in the file, however, the problem in not this, there is no duplication for sure, the problem is the file size. I tried with a smaller genome and everything works fine, there Galaxy says: Maximum file size: 1000000000B.
According to this I was thinking to cut the genome somehow and than create two or three BLAST databases? Is it a good approach? Is it possible to achieve this in Galaxy (is there a tool to cut)?
How am I able to create such a large BLAST database?
I’m extremely appreciate any help!