ATACseq

This workflow takes as input a collection of paired fastq. It will remove bad quality and adapters with cutadapt. Map with Bowtie2 end-to-end. Will remove reads on MT and unconcordant pairs and pairs with mapping quality below 30 and PCR duplicates. Will compute the pile-up on 5' +- 100bp. Will call peaks and count the number of reads falling in the 1kb region centered on the summit. Will compute 2 normalization for coverage: normalized by million reads and normalized by million reads in peaks. Will plot the number of reads for each fragment length.

Author(s):
Lucille Delisle
Release: 0.17
License: MIT
UniqueID: 9debc3c9-ccbd-4ce6-a7de-80580e663cac

ATACseq Workflow

This workflow is highly concordant with the corresponding training material. You can have more information about ATAC-seq analysis in the slides and the tutorial.

Inputs dataset

The workflow needs a single input which is a list of dataset pairs of fastqsanger.

Inputs values

reference_genome: this field will be adapted to the genomes available for bowtie2 and the genomes available for bedtools slopbed (dbkeys table)
effective_genome_size: this is used by macs2 and may be entered manually (indications are provided for heavily used genomes)
bin_size: this is used when normalization of coverage is performed. Large values will allow to have smaller output files but with less resolution while small values will increase computation time and size of output files to produce more resolutive bigwigs.

Processing

The workflow will remove nextera adapters and low quality bases and filter out any read smaller than 15bp.
The filtered reads are mapped with bowtie2 allowing dovetail and fragment length up to 1kb.
The BAM is filtered to keep only MAPQ30, concordant pairs and pairs outside of the mitochondria.
The PCR duplicates are removed with Picard (only from version 0.8).
The BAM is converted to BED to enable macs2 to take both pairs into account.
The peaks are called with macs2 which at the same time generates a coverage file.
The coverage file is converted to bigwig
The amount of reads 500bp from summits and the total number of reads are computed.
Two normalizations are computed:
- By million reads
- By million reads in peaks (500bp from summits)
Other QC are performed:
- A histogram with fragment length is computed.
- The evaluation of percentage of reads to chrM or MT is computed.
A multiQC is run to have an overview of the QC.

Warning

The reference_genome parameter value is used to select references in bowtie2 and bedtools slopbed. Only references that are present in bowtie2 and bedtools slopbed are selectable. If your favorite reference genome is not available ask your administrator to make sure that each bowtie2 reference has a corresponding len file for use in bedtools slopbed.

Changelog

[0.17] 2024-09-23

Automatic update

toolshed.g2.bx.psu.edu/repos/lparsons/cutadapt/cutadapt/4.9+galaxy0 was updated to toolshed.g2.bx.psu.edu/repos/lparsons/cutadapt/cutadapt/4.9+galaxy1
toolshed.g2.bx.psu.edu/repos/iuc/samtools_view/samtools_view/1.15.1+galaxy2 was updated to toolshed.g2.bx.psu.edu/repos/iuc/samtools_view/samtools_view/1.20+galaxy3
toolshed.g2.bx.psu.edu/repos/devteam/column_maker/Add_a_column1/2.0 was updated to toolshed.g2.bx.psu.edu/repos/devteam/column_maker/Add_a_column1/2.1
toolshed.g2.bx.psu.edu/repos/iuc/multiqc/multiqc/1.11+galaxy1 was updated to toolshed.g2.bx.psu.edu/repos/iuc/multiqc/multiqc/1.24.1+galaxy0

Manual update

Add a step to remove comments lines from histogram to be compatible with new multiQC version

[0.16] 2024-07-15

Automatic update

toolshed.g2.bx.psu.edu/repos/lparsons/cutadapt/cutadapt/4.8+galaxy1 was updated to toolshed.g2.bx.psu.edu/repos/lparsons/cutadapt/cutadapt/4.9+galaxy0

[0.15] 2024-05-27

Automatic update

toolshed.g2.bx.psu.edu/repos/lparsons/cutadapt/cutadapt/4.8+galaxy0 was updated to toolshed.g2.bx.psu.edu/repos/lparsons/cutadapt/cutadapt/4.8+galaxy1
toolshed.g2.bx.psu.edu/repos/devteam/bowtie2/bowtie2/2.5.3+galaxy0 was updated to toolshed.g2.bx.psu.edu/repos/devteam/bowtie2/bowtie2/2.5.3+galaxy1
toolshed.g2.bx.psu.edu/repos/iuc/bedtools/bedtools_bamtobed/2.30.0+galaxy2 was updated to toolshed.g2.bx.psu.edu/repos/iuc/bedtools/bedtools_bamtobed/2.31.1+galaxy0
toolshed.g2.bx.psu.edu/repos/iuc/bedtools/bedtools_slopbed/2.30.0+galaxy1 was updated to toolshed.g2.bx.psu.edu/repos/iuc/bedtools/bedtools_slopbed/2.31.1+galaxy0
toolshed.g2.bx.psu.edu/repos/iuc/bedtools/bedtools_mergebed/2.30.0 was updated to toolshed.g2.bx.psu.edu/repos/iuc/bedtools/bedtools_mergebed/2.31.1
toolshed.g2.bx.psu.edu/repos/iuc/bedtools/bedtools_coveragebed/2.30.0+galaxy1 was updated to toolshed.g2.bx.psu.edu/repos/iuc/bedtools/bedtools_coveragebed/2.31.1+galaxy0

[0.14] 2024-04-22

Automatic update

toolshed.g2.bx.psu.edu/repos/lparsons/cutadapt/cutadapt/4.7+galaxy0 was updated to toolshed.g2.bx.psu.edu/repos/lparsons/cutadapt/cutadapt/4.8+galaxy0

[0.13] 2024-04-08

Automatic update

toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_awk_tool/9.3+galaxy0 was updated to toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_awk_tool/9.3+galaxy1
toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_grep_tool/9.3+galaxy0 was updated to toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_grep_tool/9.3+galaxy1

[0.12] 2024-03-25

Automatic update

toolshed.g2.bx.psu.edu/repos/lparsons/cutadapt/cutadapt/4.6+galaxy1 was updated to toolshed.g2.bx.psu.edu/repos/lparsons/cutadapt/cutadapt/4.7+galaxy0

[0.11] 2024-03-18

Automatic update

toolshed.g2.bx.psu.edu/repos/devteam/bamtools_filter/bamFilter/2.5.2+galaxy1 was updated to toolshed.g2.bx.psu.edu/repos/devteam/bamtools_filter/bamFilter/2.5.2+galaxy2

[0.10] 2024-03-14

Automatic update

toolshed.g2.bx.psu.edu/repos/lparsons/cutadapt/cutadapt/4.4+galaxy0 was updated to toolshed.g2.bx.psu.edu/repos/lparsons/cutadapt/cutadapt/4.6+galaxy1
toolshed.g2.bx.psu.edu/repos/devteam/bowtie2/bowtie2/2.5.0+galaxy0 was updated to toolshed.g2.bx.psu.edu/repos/devteam/bowtie2/bowtie2/2.5.3+galaxy0
toolshed.g2.bx.psu.edu/repos/devteam/samtools_idxstats/samtools_idxstats/2.0.4 was updated to toolshed.g2.bx.psu.edu/repos/devteam/samtools_idxstats/samtools_idxstats/2.0.5
toolshed.g2.bx.psu.edu/repos/devteam/picard/picard_MarkDuplicates/2.18.2.4 was updated to toolshed.g2.bx.psu.edu/repos/devteam/picard/picard_MarkDuplicates/3.1.1.0
toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_awk_tool/1.1.2 was updated to toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_awk_tool/9.3+galaxy0
toolshed.g2.bx.psu.edu/repos/iuc/samtools_view/samtools_view/1.15.1+galaxy0 was updated to toolshed.g2.bx.psu.edu/repos/iuc/samtools_view/samtools_view/1.15.1+galaxy2
toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_grep_tool/1.1.1 was updated to toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_grep_tool/9.3+galaxy0

[0.9] 2023-10-23

Fix the normalization factor. It was coverage per reads and per reads in peaks instead of per million reads and per million reads in peaks.

[0.8] 2023-10-19

Fix the remove duplicate step! In all previous versions, due to an error, PCR duplicates were not removed.

[0.7] 2023-10-17

Automatic update

toolshed.g2.bx.psu.edu/repos/iuc/macs2/macs2_callpeak/2.2.7.1+galaxy0 was updated to toolshed.g2.bx.psu.edu/repos/iuc/macs2/macs2_callpeak/2.2.9.1+galaxy0

[0.6] 2023-09-27

Automatic update

toolshed.g2.bx.psu.edu/repos/bgruening/deeptools_bigwig_average/deeptools_bigwig_average/3.5.2+galaxy0 was updated to toolshed.g2.bx.psu.edu/repos/bgruening/deeptools_bigwig_average/deeptools_bigwig_average/3.5.4+galaxy0

[0.5.1] 2023-09-22

Fix bug in normalize profiles when used with multiple samples (in 0.5.0 it is averaging samples instead of normalizing each sample).

[0.5] 2023-03-17

Automatic update

toolshed.g2.bx.psu.edu/repos/devteam/picard/picard_MarkDuplicates/2.18.2.3 was updated to toolshed.g2.bx.psu.edu/repos/devteam/picard/picard_MarkDuplicates/2.18.2.4
toolshed.g2.bx.psu.edu/repos/iuc/bedtools/bedtools_coveragebed/2.30.0 was updated to toolshed.g2.bx.psu.edu/repos/iuc/bedtools/bedtools_coveragebed/2.30.0+galaxy1
toolshed.g2.bx.psu.edu/repos/lparsons/cutadapt/cutadapt/4.0+galaxy1 was updated to toolshed.g2.bx.psu.edu/repos/lparsons/cutadapt/cutadapt/4.4+galaxy0

Manual update

add normalization steps for coverage

[0.4] 2023-01-16

Automatic update

toolshed.g2.bx.psu.edu/repos/devteam/bamtools_filter/bamFilter/2.5.1+galaxy0 was updated to toolshed.g2.bx.psu.edu/repos/devteam/bamtools_filter/bamFilter/2.5.2+galaxy1

[0.3] 2022-12-17

Automatic update

toolshed.g2.bx.psu.edu/repos/iuc/multiqc/multiqc/1.11+galaxy0 was updated to toolshed.g2.bx.psu.edu/repos/iuc/multiqc/multiqc/1.11+galaxy1

[0.2] 2022-11-28

Automatic update

toolshed.g2.bx.psu.edu/repos/lparsons/cutadapt/cutadapt/4.0+galaxy0 was updated to toolshed.g2.bx.psu.edu/repos/lparsons/cutadapt/cutadapt/4.0+galaxy1
toolshed.g2.bx.psu.edu/repos/devteam/bowtie2/bowtie2/2.4.5+galaxy1 was updated to toolshed.g2.bx.psu.edu/repos/devteam/bowtie2/bowtie2/2.5.0+galaxy0

[0.1] 2022-10-12

First release.

ATACseq

ATACseq Workflow

Inputs dataset

Inputs values

Processing

Warning

Changelog

[0.17] 2024-09-23

Automatic update

Manual update

[0.16] 2024-07-15

Automatic update

[0.15] 2024-05-27

Automatic update

[0.14] 2024-04-22

Automatic update

[0.13] 2024-04-08

Automatic update

[0.12] 2024-03-25

Automatic update

[0.11] 2024-03-18

Automatic update

[0.10] 2024-03-14

Automatic update

[0.9] 2023-10-23

[0.8] 2023-10-19

[0.7] 2023-10-17

Automatic update

[0.6] 2023-09-27

Automatic update

[0.5.1] 2023-09-22

[0.5] 2023-03-17

Automatic update

Manual update

[0.4] 2023-01-16

Automatic update

[0.3] 2022-12-17

Automatic update

[0.2] 2022-11-28

Automatic update

[0.1] 2022-10-12

The following tools are required to run this workflow.