Want to retrieve all bacterial sequences from whole genome

santatra · October 11, 2024, 6:34pm

Hi all,

I would like to retrieve all bacteria sequences from whole genome. All I have is an assembled fasta file (contigs but not yet a long sequence genome). How could I retrieve the bacterial sequence from the contigs data I have?

Thanks
Santatra

jennaj · October 11, 2024, 7:52pm

Welcome, @santatra

I’m not sure if I have understood your question completely but you can clarify more about what you would like to do. Could you explain a bit more about your goals?

Meanwhile, I can share some analysis protocols through tutorials that may help to frame the kinds of questions we can answer here.

If you are completely new to Galaxy, this is a good place to start.

Get Started - Galaxy Community Hub

Then, this tutorial is an example of a bacterial genome assembly. It involves WGS and ONT reads.

Hands-on: Unicycler Assembly / Unicycler Assembly / Assembly

You can find more tutorials using keywords or by navigating the training site directly.

Prokaryote → GTN Materials Search
Assembly → Assembly / Tutorial List
Microbiome → Microbiome / Tutorial List

Hope this helps!

santatra · October 11, 2024, 10:14pm

Hi @jennaj,

Thank you for your response.

I would like to know the bacterial community of my sample. However, my data (shot gun metagenomics reads) is still contigs (more than 50 contigs for each sample, and in fasta file). Do I need to assemble these contigs and transform into scaffolding before running the microbiome analysis or can I retrieve all bacteria from these contigs reads directly?

Thank you

jennaj · October 11, 2024, 10:27pm

Hi @santatra

There are a few tools you can use for metagenomic profiling. See that same training site for examples and the different tools you can try.

But in short, Kraken2 is usually a good choice for WGS reads. For amplicon, there is Mothur. Both are covered in the examples – along with all the other little steps – data preparation, assembly (if any), then result interpretation with graph generation.

Topic		Replies	Views
bacterial genome assembly workflow usegalaxy.org support gtn-tutorial , assembly	1	434	February 21, 2023
Long read genome assemby usegalaxy.eu support gtn-tutorial , assembly	1	24	December 13, 2024
metagenome analysis of redundant data usegalaxy.org support	12	510	December 11, 2021
Extract a subsequence from the whole genome assembly? usegalaxy.eu support custom-genome , data-manager	11	1535	June 11, 2021
Bam file to fasta file - Genome assembly usegalaxy.org support genome , assembly	3	4715	February 6, 2019

Want to retrieve all bacterial sequences from whole genome

Related topics