Fool seeks aid for Lefsa analysis in Galaxy

dan_ja · February 9, 2024, 5:25pm

Hi all,

I’m having some trouble with analysing my sequencing results using Lefsa, I’ve been trying to get my head around all the ways in which I’m getting it wrong but honestly I keep just hitting road blocks, whether it’s Galaxy, Conda, Python or R.

I think (hope) it’s down to my actual data structure.

My primary data file looks like this:

while my metadata file is structured like so:

is there anything glaringly wrong with my data structure here that I need to change?

thank you!

jennaj · February 9, 2024, 7:02pm

Hi @dan_ja

Mostly a guess → labels in the first file are like this “High_Protein” and labels in the second file are like this “High Protein”. And, “KD10__” versus “KD10”. Then, “Sample” versus “NAME”. Then, “Condition” versus “Protein_group”.

Tools that are merging data between files want exact matches for the labels. Plus, R tools don’t like values that include spaces, odd characters, or that start with a number. So – all OneWord, not starting with a number, and only use underscores (optional) as One_Word for compound names.

In Galaxy, the tools have an extra component added in that can smooth that naming out, but it is impossible to be perfect about that, especially values common between different files, so simplify the naming yourself if trouble comes up.

I’d start with addressing that first.

Topic		Replies	Views
LEfSe after update on the Galaxy	9	569	June 9, 2021
using LEfse on the galaxy usegalaxy.org support qiime2	0	665	April 20, 2021
Question：an error in LefSe	2	980	December 3, 2018
Problems with format data for LEfSe usegalaxy.org support metagenomics , lefse	1	1318	April 23, 2021
LEfSe Formatting Error galaxy-local , lefse , mothur	1	1572	March 25, 2019

Fool seeks aid for Lefsa analysis in Galaxy

Related topics