goseq error in length and/or count files?

Hello there,
I did some RNAseq analysis, which actually worked well.
Now I wanted to move on with DEG analysis. I took the result files from the former analysis and everything worked fine until the goseq analysis:

Error in [.default(summary(map), , 1) : incorrect number of dimensions

This happens either I manipulate my data or use the originals from the former feature count analysis.

I know that two coworkers also used these exact workflows without any issues.
May you help with some expertise? Thanks :slight_smile:

History:

Workflow invocation: 9dbd24ecdd3fcec8

Welcome @mars142

Thanks for sharing your history, super helpful!

The problem is with the Ensembl gene identifiers – these have an extra version .N and the goseq tool doesn’t understand how to parse those correctly.

If you scroll down on the tool form, you’ll see the expected format, and those happen to use Ensembl too.

I’m guessing that your coworkers were either using a different annotation source, or they removed the versioning.

This recent topic here has more details (a different Bioconductor tool, but all are in R and work the same at a technical level). →


What to do

Hope this helps! :slight_smile:

Hey, thanks for your help. Seemed to work. Now I just got the problem of having absolute zero differences in expression values, which should not be the case. I can not find the case for that. Maybe I switched data or parameters. But I did not find any differences, while going through the training material.
For your information, I want to analyse 5 treated cell cultures against one control. So looking over all experiments, that were done before, there must be a difference in gene expression.

Hi @mars142

It is hard to see what is going on with this result. But I do see that you had problems linking back in the annotation. You could switch to using the UCSC version of this annotation instead. It already has the simplified Ensembl genes/transcripts and will work with all these tools without extra manipulations.

Other than that, maybe only having one control sample is the root issue.

There is probably a lot of discussion about this online, but we have one captured in our FAQ here as a starting place.

Hope this helps! :scientist: