Proteogenomics
purlPURL: https://gxy.io/GTN:P00023Comment: What is a Learning Pathway?We recommend you follow the tutorials in the order presented on this page. They have been selected to fit together and build up your knowledge step by step. If a lesson has both slides and a tutorial, we recommend you start with the slides, then proceed with the tutorial.
This learning path aims to teach you the basics of how to perform proteogenomics analysis of the Mass spectrometry data within the Galaxy platform. You will learn how to use Galaxy for analysis and will be guided through the most common first steps of any proteogenomics database generation to searching the database, followed by novel peptide data analysis.
New to Galaxy and/or the field of metaproteomics? Follow this learning path to get familiar with the basics!
Module 1: Database generation
Get a first look at the Galaxy platform for data analysis. We start with a short introduction to familiarize you with the Galaxy interface, and then proceed with understanding how to generate a customized database for proteogenomics.
Time estimation: 30 minutes
Learning Objectives
- Generating a customized Protein sequence database with variants, indels, splice junctions and known sequences.
Lesson | Slides | Hands-on | Recordings |
---|---|---|---|
Proteogenomics 1: Database Creation |
Module 2: Database searching
This section helps to guide the users through an MSMS dataset search against the customized database generated in the first module. The identified peptides and proteins will be then analyzed later in the novel peptide analysis.
Time estimation: 15 minutes
Learning Objectives
- A proteogenomic data analysis of mass spectrometry data to identify and visualize variant peptides.
Lesson | Slides | Hands-on | Recordings |
---|---|---|---|
Proteogenomics 2: Database Search |
Module 3: Novel Peptide Analysis
The last module in the proteogenomics tutorial is to identify “novel peptides” using BlastP and to localize the peptides to their genomic coordinates. Both inputs from modules 1 and 2 are required to run this tutorial.
Time estimation: 30 minutes
Learning Objectives
- How to assign and visualize the genomic localization of these identified novel proteoforms?
Lesson | Slides | Hands-on | Recordings |
---|---|---|---|
Proteogenomics 3: Novel peptide analysis |
Editorial Board
This material is reviewed by our Editorial Board:
Subina Mehta