Assembly

DNA sequence data has become an indispensable tool for Molecular Biology & Evolutionary Biology. Study in these fields now require a genome sequence to work from. We call this a ‘Reference Sequence.’ We need to build a reference for each species. We do this by Genome Assembly. De novo Genome Assembly is the process of reconstructing the original DNA sequence from the fragment reads alone.

Requirements

Before diving into this topic, we recommend you to have a look at:

Material

You can view the tutorial materials in different languages by clicking the dropdown icon next to the slides (slides) and tutorial (tutorial) buttons below.
Lesson Slides Hands-on Recordings Input dataset Workflows
An Introduction to Genome Assembly
An introduction to get started in genome assembly and annotation
Assembly of metagenomic sequencing data
Assembly of the mitochondrial genome from PacBio HiFi reads
Chloroplast genome assembly
De Bruijn Graph Assembly
Decontamination of a genome assembly
Deeper look into Genome Assembly algorithms
ERGA post-assembly QC
Genome Assembly Quality Control
Genome Assembly of MRSA from Oxford Nanopore MinION data (and optionally Illumina data)
Genome Assembly of a bacterial genome (MRSA) sequenced using Illumina MiSeq Data
Genome assembly using PacBio data
Large genome assembly and polishing
Making sense of a newly assembled genome
Unicycler Assembly
Unicycler assembly of SARS-CoV-2 genome with preprocessing to remove human genome reads
Using the VGP workflows to assemble a vertebrate genome with HiFi and Hi-C data
Vertebrate genome assembly using HiFi, Bionano and Hi-C data - Step by Step

Frequently Asked Questions

Common questions regarding this topic have been collected on a dedicated FAQ page . Common questions related to specific tutorials can be accessed from the tutorials themselves.

Follow topic updates rss-feed with our RSS Feed

Community Resources

Community Home Maintainer Home

Editorial Board

This material is reviewed by our Editorial Board:

orcid logoSimon Gladman avatar Simon GladmanAnton Nekrutenko avatar Anton Nekrutenkoorcid logoDelphine Lariviere avatar Delphine Lariviereorcid logoCristóbal Gallardo avatar Cristóbal Gallardo

Contributors

This material was contributed to by:

orcid logoDelphine Lariviere avatar Delphine LariviereFabian Recktenwald avatar Fabian Recktenwaldorcid logoWolfgang Maier avatar Wolfgang Maierorcid logoPolina Polunina avatar Polina PoluninaMarcella Sozzoni avatar Marcella SozzoniBrandon Pickett avatar Brandon Pickettorcid logoLinelle Abueg avatar Linelle Abuegorcid logoMiaomiao Zhou avatar Miaomiao Zhouorcid logoBjörn Grüning avatar Björn Grüningorcid logoAnthony Bretaudeau avatar Anthony Bretaudeauorcid logoLaura Leroi avatar Laura Leroiorcid logoTom Brown avatar Tom BrownMatúš Kalaš avatar Matúš KalašDeepti Varshney avatar Deepti Varshneyorcid logoStéphanie Robin avatar Stéphanie Robinorcid logoSaskia Hiltemann avatar Saskia Hiltemannorcid logoHelena Rasche avatar Helena Rascheorcid logoNate Coraor avatar Nate Coraororcid logoAlexandre Cormier avatar Alexandre CormierTeresa Müller avatar Teresa MüllerAnton Nekrutenko avatar Anton NekrutenkoGiulio Formenti avatar Giulio Formentiorcid logoCristóbal Gallardo avatar Cristóbal Gallardoorcid logoAnna Syme avatar Anna Symeorcid logoAlex Ostrovsky avatar Alex OstrovskyBazante Sanders avatar Bazante Sandersorcid logoBérénice Batut avatar Bérénice Batutorcid logoErwan Corre avatar Erwan Correorcid logoSimon Gladman avatar Simon Gladman

Funding

These individuals or organisations provided funding support for the development of this resource