Gene-based Pathogen Identification

Nanopore datasets analysis - Phylogenetic Identification - antibiotic resistance genes detection and contigs building

Gene-based Pathogen Identification

In this workflow, we determine whether the samples are pathogenic or not, by looking for genes known to be linked to pathogenicity or to the pathogenecity character of the organism.

  • Virulence Factor (VF): gene products, usually proteins, involved in pathogenicity. By identifying them, we can call a pathogen and its severity level

  • Antimicrobial Resistance genes (AMR).

    These type of genes have three fundamental mechanisms of antimicrobial resistance that are enzymatic degradation of antibacterial drugs, alteration of bacterial proteins that are antimicrobial targets, and changes in membrane permeability to antibiotics, which will lead to not altering the target site and spread throughput the pathogenic bacteria decreasing the overall fitness of the host.

In this workflow we:

  1. Perform genome assembly to get contigs, i.e. longer sequences, using metaflye (Kolmogorov et al. 2020) then assembly polishing using medaka consensus pipeline and visualizing the assembly graph using Bandage Image (Wick et al. 2015)
  2. Generate reports with AMR genes and VF using ABRicate

Input Datasets

  • Collection of Pre-Processed Sequenced reads of all samples, ready for further analysis with the other workflows, in a fastqsanger or fastqsanger.gz format, the output of Nanopore Preprocessing workflow.

Output Datasets

  • FASTA and Tabular files to track genes and visualise our pathogenic identification through out all samples.

If you're unsure how to use this workflows, or if you want to see it in action with test datasets, it is included in our detailed training material for foodborne pathogen detection and tracking. You can find step-by-step instructions and practical examples in the following GTN tutorial