New Tutorial: Single-cell ATAC-seq standard processing with SnapATAC2

Author(s) orcid logoTimon Schlegel avatar Timon Schlegel
new tutorial single-cell epigenetics

Posted on: 12 July 2024 purlPURL: https://gxy.io/GTN:N00094

We are proud to announce that a new training, explaining the analysis of single cell ATAC-seq data with SnapATAC2 and Scanpy, is now available in the Galaxy Training Network.

SnapATAC2 pipeline. Open image in new tab

Figure 1: SnapATAC2 standard processing pipeline. The single cell data is first preprocessed and a high-quality count matrix is produced. Next, the dimensionality of the data is reduced and clusters visualized. Finally, clusters are manually annotated with marker genes.

The tutorial consists of three sections: Preprocessing, Dimensionality Reduction and Clustering, and Cluster Annotation. During the preprocessing, cell-by-feature count matrices are created, which are filtered to produce high-quality AnnData count matrices. After that, the dimensionality of the data is reduced through nonlinear spectral embedding. The data is then projected onto two-dimensional space and clusters are identified. To annotate the clusters, the gene activity for each cell is measured and the activity of marker genes is visualized with the Scanpy package. The activity profile of each cluster is then used to determine the correct cell type.

View Material

Recent News

See all news

Scaling Up Hands-On Bioinformatics Training with TIAAS – An Open University Perspective

3 July 2025   gtn TIAAS

Back in September 2024, we ran the Open University Bioinformatics Bootcamp—a free, five-day online course introducing students to the core tools and techniques used in single-cell biology. We were genuinely delighted by the level of interest: 120 students signed up, 100 showed up, and around 80 worked through the hands-on tutorials during the week. That’s a fantastic level of engagement, especially for a course that’s entirely optional and doesn’t count towards their degree.

Enhancing Scientific Training: The Galaxy Training Network's Role in the ELIXIR Training Life-Cycle

1 July 2025   gtn

In the rapidly evolving landscape of data science, continuous learning and skill development are crucial. The Galaxy Training Network (GTN) plays a pivotal role in this educational ecosystem, particularly within the ELIXIR Training Life-Cycle. This blog post explores how the GTN contributes to each phase of the life-cycle and aligns with the SPLASH recommendations, ensuring high-quality training for researchers worldwide.

An Ode to the Galaxy Community and all I learned

19 June 2025   gtn

This is my ode to the Galaxy Community to say how grateful I have been for your welcome, your energy, and your support. I have learned so very, very much, about bioinformatics; about software development; and most of all, about open-source communities in this complex scientific world.