Ways to make a variant calling for RNA Seq (paired-end)

Sofi · February 22, 2023, 4:57pm

Hello everyone! I’m a newer in bioinformatic and galaxy - sorry
I trying to use RNA STAR tool. I have full human transcriptom (RNAseq) in 8files (with R1 and R2 reads). So I have to get names of genes and their characteristics (pathogenic etc) in the end for clinicians.
I want to create vcf file for 1-snpsift variant type and 2-snpsift annotate SNPs from dbsnp. And after this I wanted download this vcf file (for ClinVar annotation etc).

I dont need the Trimmomatic so I was trying to do next:
Fastq–>RNA-STAR(hg38)–>Samtoolsmerge(for mapped.bam)—>FreeBayes

But FreeBayes show me the error:

{ “code_desc”: “”, “desc”: “Fatal error: Exit code 101 ()”, “error_level”: 3, “exit_code”: 101, “type”: “exit_code” }
Job API ID: bbd44e69cb8906b5339b2fad2d9562d3

Please explain me what I do wrong and which tools I need to use for my purpose?

jennaj · March 1, 2023, 12:38am

Hi @Sofi

You should at least run FastQCand probably Fastq Info on your read data to make sure there is not a content/format problem.

These prior topics may also be helpful: Search results for 'FreeBayes order:likes' - Galaxy Community Help

Does the Dataset Details view contain any other informative messages? If not, check the prior topics for common help with this protocol, along with the tutorials below.

If you need more help, please post back a share link to the history containing this error. Please leave all inputs and outputs undeleted, along with the outputs from the QA checks. Sharing your History

Tutorials

Sofi · March 27, 2023, 10:25am

Thank you for answer. I’m sure for 1000% about high quality of my fastq-files. I made FastQC for files to demonstration it.
Here’s my history Galaxy
My problem is 106. FreeBayes.
I’ll be waiting for your commentaries

jennaj · March 27, 2023, 7:38pm

Hi @Sofi

It looks like you mapped against a different reference genome when running RNAStar (likely by accident).

Try rerunning those alignment jobs using the exact same reference database that is used for other steps → hg38. Then proceed with downstream steps using the newer BAM outputs.

Sofi · April 2, 2023, 11:56pm

Thank you, it really was a mistake. I used Samtools merge for my 4 .bam files after RNA STAR and file “samtools merge” is 2.3GB. But after FreeBayes file contains ~ 18k lines (7Mb)
Tell me please is it normal for RNAseq?
Can I use next tools for this file:
SnpSift Filter → SnpSift Annotate (+dbSNP_146_hg38.vcf)

jennaj · April 12, 2023, 7:03pm

Do you mean the VCF output? Yes, those are usually much smaller than BAMs.

Examples are included in the variant tutorials linked above.

Topic		Replies	Views
Variant calling with RNA-seq	3	1393	March 17, 2021
Comparing SNPs across samples and sample groups variant-analysis	3	30	July 19, 2024
Variant calling from VCF files chrominfo , vcf	3	718	October 16, 2023
RNAseq variant calling usegalaxy.eu support	4	586	August 28, 2021
variant calling samtools vcf	5	933	December 13, 2021

Ways to make a variant calling for RNA Seq (paired-end)

Related topics