Working with very large fastq datasets


  • Run FastQC on your data to make sure the format/content is what you expect. Run more QA as needed.
    • Search GTN tutorials with the keyword “qa-qc” for examples.
    • Search Galaxy Help with the keywords “qa-qc” and “fastq” for more help.
  • How to create a single smaller input. Search the tool panel with the keyword “subsample” for tool choices.
  • How to create multiple smaller inputs. Start with Split file to dataset collection, then merge the results back together using a tool specific for the datatype. Example: BAM results? Use MergeSamFiles.
Persistent URL
Resource purlPURL: https://gxy.io/GTN:F00051
Still have questions?
Gitter Chat Support
Galaxy Help Forum