Assembly

DNA sequence data has become an indispensable tool for Molecular Biology & Evolutionary Biology. Study in these fields now require a genome sequence to work from. We call this a ‘Reference Sequence.’ We need to build a reference for each species. We do this by Genome Assembly. De novo Genome Assembly is the process of reconstructing the original DNA sequence from the fragment reads alone.

Requirements

Before diving into this topic, we recommend you to have a look at:

Material

You can view the tutorial materials in different languages by clicking the dropdown icon next to the slides (slides) and tutorial (tutorial) buttons below.
Lesson Slides Hands-on Recordings Input dataset Workflows
An Introduction to Genome Assembly
An introduction to get started in genome assembly and annotation
Assembly of metagenomic sequencing data
Assembly of the mitochondrial genome from PacBio HiFi reads
Chloroplast genome assembly
De Bruijn Graph Assembly
Decontamination of a genome assembly
Deeper look into Genome Assembly algorithms
ERGA post-assembly QC
Genome Assembly Quality Control
Genome Assembly of MRSA from Oxford Nanopore MinION data (and optionally Illumina data)
Genome Assembly of a bacterial genome (MRSA) sequenced using Illumina MiSeq Data
Genome assembly using PacBio data
Large genome assembly and polishing
Making sense of a newly assembled genome
Unicycler Assembly
Unicycler assembly of SARS-CoV-2 genome with preprocessing to remove human genome reads
Using the VGP workflows to assemble a vertebrate genome with HiFi and Hi-C data
Vertebrate genome assembly using HiFi, Bionano and Hi-C data - Step by Step

Frequently Asked Questions

Common questions regarding this topic have been collected on a dedicated FAQ page . Common questions related to specific tutorials can be accessed from the tutorials themselves.

Follow topic updates rss-feed with our RSS Feed

Editorial Board

This material is reviewed by our Editorial Board:

orcid logoSimon Gladman avatar Simon Gladman Anton Nekrutenko avatar Anton Nekrutenko orcid logoDelphine Lariviere avatar Delphine Lariviere orcid logoCristóbal Gallardo avatar Cristóbal Gallardo

Contributors

This material was contributed to by:

orcid logoSimon Gladman avatar Simon Gladman orcid logoMiaomiao Zhou avatar Miaomiao Zhou orcid logoLaura Leroi avatar Laura Leroi orcid logoAlex Ostrovsky avatar Alex Ostrovsky Marcella Sozzoni avatar Marcella Sozzoni orcid logoDelphine Lariviere avatar Delphine Lariviere Teresa Müller avatar Teresa Müller orcid logoSaskia Hiltemann avatar Saskia Hiltemann orcid logoStéphanie Robin avatar Stéphanie Robin orcid logoLinelle Abueg avatar Linelle Abueg orcid logoBérénice Batut avatar Bérénice Batut orcid logoAnna Syme avatar Anna Syme orcid logoPolina Polunina avatar Polina Polunina Deepti Varshney avatar Deepti Varshney orcid logoWolfgang Maier avatar Wolfgang Maier orcid logoHelena Rasche avatar Helena Rasche Matúš Kalaš avatar Matúš Kalaš orcid logoAlexandre Cormier avatar Alexandre Cormier Bazante Sanders avatar Bazante Sanders Fabian Recktenwald avatar Fabian Recktenwald orcid logoAnthony Bretaudeau avatar Anthony Bretaudeau orcid logoBjörn Grüning avatar Björn Grüning Anton Nekrutenko avatar Anton Nekrutenko orcid logoTom Brown avatar Tom Brown orcid logoCristóbal Gallardo avatar Cristóbal Gallardo Brandon Pickett avatar Brandon Pickett Giulio Formenti avatar Giulio Formenti orcid logoErwan Corre avatar Erwan Corre orcid logoNate Coraor avatar Nate Coraor

Funding

These individuals or organisations provided funding support for the development of this resource