Addition of several plots to MultiQC

Hi, does anybody have the idea how can i connect several plots into one multiqc? Ive been trying to create a collection from them, but sadly it doesnt work for EU galaxy

Thanks!

Hi @Orange_Pomeranian

Thanks for sharing the workflow! Very helpful.

To input all of the reports of the same type into the same section of a MuliQC report, you can consider putting all the reports into the same collection, then inputting that collection to MultiQC.

You are already using collections, so this would involve adding in the Merge collections tool.

You could merge the similar data together before processing the data through the tools (that are generating the reports) or you could merge after the data passes through those tools (merging the report outputs instead as a preparatory step for MultiQC).

In either case, all of the reports for the same report type shouldould be organized together into a single collection. Then, configure the target tool’s section on the MuliQC form to accept a collection input.

Whenever adding a new collection to a workflow, remember to disconnect then reconnect any downstream tools! This resets the workflow metadata. :hammer_and_wrench:

Note: if the data to be merged was originally in an individual file shape (files not in a collection yet!), then Build list would be used to create the new collections instead!

Did I understand your question correctly? Then, does this actually help? Please let us know! :slight_smile:

Xref

Thanks for your reply. I tried connecting the inputs to the ‘Merge collections’. Sadly most of the inputs don’t work (even when I remove downstream connections).

I want to merge several plots into one collection so that it can be added to the final multiqc to the Image Content section. Merge collections seems to don’t work with that. Do you have any example workflow where its shown how it can be connected?

Hi @Orange_Pomeranian

For this

You can try a few other things:

  1. Disconnect all noodles, then reconnect in the order of job execution
  2. Double check that you are merging collections that contain data all with the same datatype. For your case, this would be the txt formatted reports for MultiQC: one merge per report type
  3. Then, that merged collection is connected to MultiQC, after the section is defined to be a collection and the tool/report type is set

Then for this

This tutorial has a general merge example

And this workflow template includes MultiQC with a merge in the data preparation steps.

Notice how I was ensuring that all samples had a unique element identifier (sample name) at the very start. Paired fastq data is a bit different from BAMs and Bigwig data, and involves an extra step to add in forward/reverse notation inside of the workflow, but the basic logic applies to all data: simple, standardize formats for the element identifiers used in the input collections. This makes it easier to manipulate the collections later.

Finally, I have some unsolicited advice :nerd_face:: your workflow contains duplicated processing paths. Could this be simplified? Maybe input all BAMs together, all BigWigs together, run the statistics together, then split out control/case data after into collections (using the element identifiers from a tabular input?) for the downstream MACS2 jobs?

But either way should work. You can welcome to share back a link to your workflow (and screenshots showing what to pay attention to) as you work through this. But try the first items above first please! Thanks! :slight_smile: