What is the maximum number of input smiles strings possible to calculate, since I have to create descriptors for 2.5 million compounds?
I don’t think there is a limit. It might take a long time
I would recommend to split your data into smaller chunks, we have tools for this. And then try with one chunk to see how long it takes.