using published genomic data

seq2022 · January 1, 2023, 6:21pm

Hi,

I would like to combine samples from different studies to do differential gene expression. For example, paper 1 has published 10 nasal samples from healthy children and 10 nasal samples from children with COVID infections; and paper 2 from a different research group has published 10 nasal samples with influenza infections. They may or may not have used the same sequencing platform/kits and seq depths.

Can I download the samples from both papers and do the the following DEG/pathway analyses etc on the dataset:

healthy children from paper 1 (as Factor 1), COVID from paper 1 (as factor 2)
healthy children from paper 1 (as Factor 1) and influenza from paper 2 (as Factor 2)?
COVID from paper 1 (as Factor 1) and influenza from paper 2 (as factor 2)
Healthy children (factor 1) vs COVID (factor 2) vs. Influenza (factor 3)

Since I will be using the same bioinformatic tools and commands to process both sets in the same analysis and all transcripts will be aligned to the same assembly, will this normalize data correctly and allow for valid comparisons?

Thanks,
Swati

igor · January 3, 2023, 12:45am

Hi @seq2022
maybe ask people at a bioinformatics forum or DGE software developers. How do you distinguish between batch effect and potential biological effect in the described scenario with one condition coming from a different source?
Kind regards,
Igor

Topic		Replies	Views
Comparing SE and PE read samples	2	560	March 8, 2019
DESeq2 2x2 Design and Interaction Effect transcriptomics , limma , deseq2 , edger	1	442	March 16, 2023
Paired Samples in Lima Voom for Bulk RNAseq usegalaxy.org support	0	343	August 8, 2020
Deseq2 Biological v. Sequencing replicates	0	286	March 4, 2021
Deseq normalized values and RNA seq analysis	0	264	April 1, 2020

using published genomic data

Related Topics