Nanopore Preprocessing

Before starting any analysis, it is always a good idea to assess the quality of your input data and to discard poor-quality base content by trimming and filtering reads.

Generally, we are not interested in the host sequences, but rather only those originating from the pathogen itself. It is important to get rid of all host sequences and to only retain sequences that might include a pathogen, both in order to speed up further steps and to avoid host sequences compromising the analysis.

Input Datasets

Collection of sequenced Nanopore reads of all samples to be analysed in a fastqsanger or fastqsanger.gz format.

Output Datasets

Collection of Pre-Processed Sequenced reads of all samples, ready for further analysis with the other workflows, in a fastqsanger or fastqsanger.gz format.
Tables indicating total number of reads before and after host sequences trimming, and the host sequences percentages found in each sample.

If you're unsure how to use this workflows, or if you want to see it in action with test datasets, it is included in our detailed training material for foodborne pathogen detection and tracking. You can find step-by-step instructions and practical examples in the following GTN tutorial

Changelog

[0.1] 2024-04-25

First release.

The following tools are required to run this workflow.

This will eventually be a pretty page with links to each tool in the (new) toolshed, etc.

toolshed.g2.bx.psu.edu/repos/iuc/porechop/porechop/0.2.4+galaxy0
toolshed.g2.bx.psu.edu/repos/iuc/nanoplot/nanoplot/1.42.0+galaxy1
toolshed.g2.bx.psu.edu/repos/devteam/fastqc/fastqc/0.74+galaxy0
toolshed.g2.bx.psu.edu/repos/iuc/fastp/fastp/0.23.4+galaxy0
toolshed.g2.bx.psu.edu/repos/iuc/multiqc/multiqc/1.11+galaxy1
toolshed.g2.bx.psu.edu/repos/iuc/minimap2/minimap2/2.28+galaxy0
toolshed.g2.bx.psu.edu/repos/iuc/bamtools_split_mapped/bamtools_split_mapped/2.5.2+galaxy2
Grep1
toolshed.g2.bx.psu.edu/repos/iuc/samtools_fastx/samtools_fastx/1.15.1+galaxy2
toolshed.g2.bx.psu.edu/repos/nml/collapse_collections/collapse_dataset/5.1.0
__FILTER_FAILED_DATASETS__
toolshed.g2.bx.psu.edu/repos/iuc/kraken2/kraken2/2.1.1+galaxy1
Cut1
toolshed.g2.bx.psu.edu/repos/iuc/krakentools_extract_kraken_reads/krakentools_extract_kraken_reads/1.2+galaxy1
toolshed.g2.bx.psu.edu/repos/iuc/collection_column_join/collection_column_join/0.0.3
toolshed.g2.bx.psu.edu/repos/devteam/column_maker/Add_a_column1/2.0
toolshed.g2.bx.psu.edu/repos/galaxyp/regex_find_replace/regexColumn1/1.0.3