import dataset NCBI

Dear all, I cannot import dataset from UNITE or NCBI…is there any tutorial about this issue?
thanjs

Welcome, @Anna_Maria

These FAQs have some tips about loading data to Galaxy. https://training.galaxyproject.org/training-material/faqs/galaxy/#data%20upload

And many tutorials have examples that include an Upload step. You can also navigate the training site by analysis domain. https://training.galaxyproject.org/training-material/search?query=upload

If you still need more help after reviewing, please share more details. Troubleshooting errors

Dearc jennifer tha nks for the support but I really do not know how to porceed. Could you send me a protocol for the analysis of the 18S (fungi)
thanks
anna Maria

Please see Analyses of metagenomics data - The global picture

Dear Jennifer many thanks, the document refers to 16s analysis . I repeat all the steps but I do not know how to go on when I have to align my ITS sequences (18s) . what is the :

Select Reference Template
“reference” to the reference file in the ncbi nrDNA database ? how can upload the file?

many thanks

Anna Maria

Hello,

Tutorials are examples. Going through them will help you to learn and understand what appropriate inputs and parameters are involved. None of this is exact, and is what you would need to do even if working outside of Galaxy.

In short, you’ll need to adapt methods for your own data. Suggestions:

  1. find a few publications that do something similar to what you want to do
  2. consider visiting general bioinformatics forums
  3. tool author forums
  4. public data repositories
  5. sites hosted by scientific communities focused on your research domain
  6. leveraging resources that are not currently using Galaxy is expected. GTN tutorials cover a small slice of available bioinformatics tools/methods.

Others are welcome to add more comments!

dear Jennifer finally I got a results but it seems that something is wrong . If I copy that sequence obtained (fileterd…merged…ecc) they blast on NCBI from position 200nt…as only 1 of the paired sequence has been processed. I add here the steps I did

Dear Jenniferfinally I got a results but it seems that something is wrong . If I copy that sequence obtained (fileterd…merged…ecc) they blast on NCBI from position 200nt…as only 1 of the paired sequence has been processed. I add here the steps I did and the files.The sections in yellow are more critical pointsù

Could you please …thank you!!!

data
PS8_1.fastq.gz
PS8_2.fastq.gz
PS8_F_filt.fastq.gz
PS8_R_filt.fastq.gz
PS8count
PS8dadaehisotry.Rhistory
PS8taxa
tabella
UNITE.fasta

help me to understand what is wrong?

fnFs ← sort(list.files(pattern=“PS8_1.fastq”))

fnRs ← sort(list.files(pattern=“PS8_2.fastq”))

sample.names ← sapply(strsplit(fnFs, “_”), [, 1)

fnFs ← file.path(fnFs)

fnRs ← file.path(fnRs)

plotQualityProfile(fnFs[1:2])

> filt_path <- file.path( "filtered")
> filtFs <- file.path(filt_path, paste0(sample.names, "_F_filt.fastq.gz"))
> filtRs <- file.path(filt_path, paste0(sample.names, "_R_filt.fastq.gz"))
> out <- filterAndTrim(fnFs, filtFs, fnRs, filtRs,
+                      maxN=0, maxEE=c(2,2), truncQ=2, rm.phix=TRUE,
+                      compress=TRUE, multithread=FALSE)
Creating output directory: filtered
> errF <- learnErrors(filtFs, multithread=TRUE)
14339212 total bases in 50672 reads from 1 samples will be used for learning the error rates.
> errR <- learnErrors(filtRs, multithread=TRUE)
11466811 total bases in 50672 reads from 1 samples will be used for learning the error rates.
> derepFs <- derepFastq(filtFs, verbose=TRUE)
Dereplicating sequence entries in Fastq file: filtered/PS8_F_filt.fastq.gz
Encountered 29001 unique sequences from 50672 total sequences read.
> derepRs <- derepFastq(filtRs, verbose=TRUE)
Dereplicating sequence entries in Fastq file: filtered/PS8_R_filt.fastq.gz
Encountered 36009 unique sequences from 50672 total sequences read.
> names(derepFs) <- sample.names
Warning message:

Hi @Anna_Maria

Are you still working in Galaxy?

Are you following/adapting a published protocol?

dear Jennifer, I am following this protocol
https://training.galaxyproject.org/training-material/topics/metagenomics/tutorials/mothur-miseq-sop/tutorial.html

Dear Jennifer, could you please help me?
thanks
Anna Maria

I’m not sure why you are having problems when mapping at NCBI. I can let you know that BLAST excepts fasta data as an input, not fastq.

The tutorial you are following 16S Microbial Analysis with mothur (extended) includes a workflow. Maybe you can adapt that to fit your analysis?