i have 28 samples each containing gene Ensembl IDs and read counts. i want to use edgeR to normalize to TPM here on galaxy but when i load my data and run it the following error appears.
Fatal error: Exit code 1 ()
Error: Sample IDs in factors file and count matrix don’t match
This may seem obvious, but review the Sample IDs in both inputs and make sure they are a match.
Be sure to not start IDs with numbers – and use only alphanumeric characters (a-z, A-Z, 0-9), dashes, and underscores in the names.
Also check for the following, some which may look “hidden” when viewing the datasets (“whitespace” characters):
inconsistent capitalization
blank fields, internal or at the end of lines
blank lines
extra or trailing spaces/tabs
The first three you’ll need to fix yourself, before uploading the data to Galaxy or sometimes using Text Manipulation tools within Galaxy (you’ll need to experiment).
The last can often be remedied by using the “convert spaces to tabs” option in the Upload tool.
Let’s start there, and the EU team can jump in if problems persist.