Iterate/loop through items in lists for workflow

phagepower · June 1, 2021, 11:47am

Hi,

I was wondering how I could setup a workflow to iterate through a list of FASTA files, in addition to a list of pairs (PE Illumina reads), and run a tool (bowtie2 for alignment) so that the first FASTA file is used with the first pair of reads, second FASTA file used for second pair of reads, and so on. I’ve attached some screenshots below with my current history structure for reference.
I couldn’t find any way to loop through collections (ideally using a “for”-style of loop). Closest i could find was the “Extract Dataset” tool which requires you enter an index; so this would become a pretty manual process if I were to do this for all 30+ items spread over 2 lists.
Just wanting to make sure that there is no way to do this within Galaxy already before I go out and start programming a custom tool for this.

Thanks in advance!

David · June 1, 2021, 12:14pm

Do you really need each PE pair to a specific reference? Are you doing this to check contaminations?
Is there a problem if you merge your fastas and run bowtie on the merged reference?
What about run [all the PE pairs] x fastas [separated, as collection] ?

phagepower · June 1, 2021, 12:30pm

Thanks for your reply. The reason I’m doing this type of read mapping specifically on a per-sample basis (and not combining all my FASTA files/contig as into one file) is part of the bacterial genome binning process and MAG assembly pipeline we are following.

If I merge my FASTAs and create a Bowtie2 index/reference from that, I run the risk of reads aligning to contigs present from another sample, potentially altering coverage which is used in the binning steps.

David · June 1, 2021, 12:50pm

Got it.
Afaik, there is no way to do this specific loop.

phagepower · June 1, 2021, 2:35pm

Alright thanks the same!

David · June 4, 2021, 4:43pm

@phagepower, I can’t test this now, but, maybe Galaxy API - Galaxy Community Hub can help:

When the analysis involves complex control, such as looping and branching

phagepower · June 6, 2021, 7:02pm

Thanks @David, I forgot about the API. I’ll consider it, although I think having something on the graphical front-end for users would be ideal. Shouldn’t be too hard to write a simple script to take care of it and wrap it as a Galaxy tool.

Topic		Replies	Views
First time user - Genome comparison usegalaxy.org support gtn-tutorial , dropbox	2	290	October 11, 2023
De-multiplex paired-end reads usegalaxy.org support collections , tool-help , __apply_rules__	1	26	October 30, 2024
Tool request: Get data from Genbank/RefSeq by accession usegalaxy.org support tool-dev	1	545	August 6, 2019
Bowtie 2 - in Workflow no custom reference genome available -- Solution: Create a Custom Build usegalaxy.eu support custom-genome , reference-annotation , reference-genome , custom-build	1	1154	August 27, 2019
Paird-end Fastq-dump Manipulation - Fastq De-Interlacer	3	2513	May 7, 2019

Iterate/loop through items in lists for workflow

Related topics