Unable to import my files as .qza

I have a R1 and R2 reads as a fastqsanger, and I need to run a taxonomic analysis of the sequencing results. The 2 files are in Galaxy, but every time I use the tool to import into qiime2 artifact, it gives me a message, “No list data set collections available”.

I also have a data set with all the different reads that are already demultiplexed, and the same message appears every time.

What should I do to be able to get my files to become a .qza?

Welcome @isabelle_miserani

For this analysis, you could use the tools directly if you want to. Some potential tutorials for follow are here → Galaxy Training!. If you had a tool in mind, check the form since more tutorials will be linked there, plus you can search that site directly. This would avoid the need to create the artifacts.

However, if you still want to use the Qiime/Qiime2 tools instead, you can get your artifact organized by putting the data into a collection. The reads will need to have the original sequencing identifiers. Then you input to the import tool, and proceed to downstream tools.

This topic here has the links to the resources we have:

To inspect artifacts, you will need to download the file then upload to the site hosted by the original authors. This is another good topic since one of the authors was helping.

Hope this helps to get you started, and we can follow up more! :slight_smile:

1 Like

I am new to this computational approach. So when you say to put the data into a collection, should I create a collection of paired ends? One more question, when you say original sequencing identifiers, do you mean the reads still multiplexed?

1 Like

Yes. For the collection part, see the tutorials here to learn how to create them. You will want the paired end version for what you are doing.

No, you can have the reads separated into sample groups. I’m referring to the sequence identifiers on the @ lines. This is a nice reference guide about how to interpret those.

Hope this helps! See the other topics I referenced. One has a walk through of adjusting the sequence identifiers to match the pattern this tool is expecting.