Scanpy Find Marker Genes groupby formatting

Kashish_Kumar · June 21, 2023, 5:00am

I am looking to find differentially expressed genes in the disease severity sample groupings within the cell type clusters (scRNAseq with scanpy) rather than across all clusters. Using two separate Strings in the groupby section yields an error, so how can this specific subgrouping be specified through a file?

jennaj · June 21, 2023, 7:33pm

Hi @Kashish_Kumar

Did you try use the file option for groupby instead yet? I think that that field directly on the form requires a single term. Or, it may require a dash between the values. Not sure but the tutorials here will probably help.

https://training.galaxyproject.org/training-material/search2?query=scanpy

Kashish_Kumar · June 21, 2023, 7:48pm

Thank you for your response! Yes, I am unable to find the information on the format/parameters I should pass through the file groupby.

jennaj · June 21, 2023, 7:56pm

Hum, the file is probably one key/value per line.

The direct input seems to be comma separated.

Many ways to slice up and label data, then to reference the same for calculations/plots, are covered in this specific tutorial: Clustering 3K PBMCs with Scanpy

Kashish_Kumar · June 21, 2023, 10:11pm

I recieved Keyerrors for both separating by a comma and the line-separated file. The tutorial does not specify a method for what I am aiming for. I will try to separate it based on groups, but I don’t think this will provide cell-specific marker DE genes.

jennaj · June 21, 2023, 10:53pm

hi @Kashish_Kumar

The tutorials are just examples of converting over methods from the tool developer – or some publication – into the Galaxy version of the tools. You can do this directly, too.

Meaning, the underlying tools are the same, Galaxy just puts a GUI on top of it. Most functions are usually available, and if not for some reason, the help section will usually state why.

This sounds similar to what you are trying to do: Visualizing marker genes — Scanpy documentation

Kashish_Kumar · June 21, 2023, 11:30pm

Thank you, I have changed some of the “Advanced Settings” inputs so the command running is the following. This might result in what I require. I need DE genes expression tables rather than plots.

scanpy-find-markers --save diffexp.tsv --n-genes ‘500’ --groupby ‘CoVID-19 severity’ --method ‘t-test_overestim_var’ --use-raw --groups ‘celltype’ --reference ‘rest’ --filter-params ‘min_in_group_fraction:0.25,max_out_group_fraction:0.5,min_fold_change:2.0’ --input-format ‘anndata’ input.h5 --show-obj stdout --output-format anndata output.h5

Topic		Replies	Views
Filter single cells by a specific gene in scRNAseq dataset using scanpy usegalaxy.org support single-cell	1	399	July 25, 2023
Using Scanpy to process single cell data usegalaxy.eu support troubleshooting , single-cell	4	508	July 19, 2022
read normalization usegalaxy.org support resources , tool-help , deseq2	3	31	January 31, 2025
Marker trait associations using LASSO regd. machine-learning	1	76	May 10, 2024
Help with DeSEQ Analysis	1	635	January 4, 2019

Scanpy Find Marker Genes groupby formatting

Related topics