Galaxy: collection patterns
Use this MOC to choose corpus-grounded Galaxy collection transformation patterns.
Pattern maps are the entry points; operation and recipe pages are the concrete Galaxy authoring moves that Molds can reference and casts can condense.
Use this MOC to choose corpus-grounded Galaxy collection transformation patterns.
Use this MOC to choose corpus-grounded Galaxy when and pick_value conditional patterns.
Use this MOC to choose corpus-grounded Galaxy genomic interval operations and recipes on coordinate features.
Use this MOC to choose corpus-grounded Galaxy operations on sequence records (FASTA) — interconvert, reformat, merge, length, extract/mask by region.
Use this MOC to choose corpus-grounded Galaxy tabular transformation patterns.
Use Apply Rules to promote identifier columns into a list:paired collection, with optional cleanup first.
Use BUILD_LIST to assemble named outputs into a collection bundle for publishing or downstream fan-in.
Use FILTER_EMPTY or FILTER_FAILED after map-over when bad elements would break downstream collection steps.
Use FLATTEN to collapse nested collection outputs to a flat list once the outer axis no longer matters.
Use SORTLIST with sort_type:file to reorder one collection by another collection's identifiers.
Derive collection element identifiers in a tabular mapping, then apply them with RELABEL_FROM_FILE.
Use Apply Rules, identifier extraction, find/replace, and relabeling for structural fan-out cleanup.
Use Apply Rules regex columns to split one collection identifier into nested list identifiers.
Use Apply Rules to regroup a list:list collection by swapping outer and inner identifier columns.
Use collection_element_identifiers with FILTER_FROM_FILE or RELABEL_FROM_FILE to align sibling collections.
Use __EXTRACT_DATASET__ with which: first when a one-element collection must become a dataset.
Derive a boolean from empty or non-empty data, then use when to skip reporting or export steps.
Use when-gated alternatives plus pick_value to merge binary or one-of-N routes into one downstream value.
Use a workflow boolean connected as inputs.when to skip an optional Galaxy step or branch.
Gate an optional transform, then use pick_value to pass transformed data when present or original data otherwise.
Clean sparse mapped outputs, keep sibling collections aligned, then gate report publishing on non-empty results.
Bundle parallel outputs into a collection consumer, then flatten nested results for pooled downstream processing.
Use a manifest or table to build a collection, map a tool per row, then relabel or reshape outputs.
Use Apply Rules and deterministic relabeling when domain fan-out creates the wrong map-over axis.
Compute regions from regions: concatenate candidate intervals, merge into non-overlapping spans, then subtract the set to keep. The gops_* set-algebra recipe.
Two coverage modes: genome-wide depth as a bedgraph (genomecoveragebed) and reads counted in given regions (coveragebed). Same family, different question.
Find features reproducible across replicates: multi-intersect per-replicate sets, threshold by replicate count, then intersect back against the merged call.
Keep, drop, or annotate coordinate features by overlap with a second feature set; bedtools intersect (BED) or vcfvcfintersect (VCF), mapped over a collection.
Collapse overlapping or book-ended intervals within one set into single spans; bedtools mergebed or the gops_merge Operate-on-Genomic-Intervals tool.
Extend features by a fixed or fractional amount to build neighborhood windows, clamped to chromosome ends; bedtools slopbed with a genome file.
Quantify signal in fixed neighborhoods around point features: window the features (slop), collapse overlaps (merge), then count reads in each window (coverage).
Use compose_text_param to build connected text expressions from constants plus runtime scalar values.
Read a one-value dataset with param_value_from_file, including count recipes that feed typed parameters.
Use map_param_value to translate workflow enum values into downstream tool codes, flags, or snippets.
Emit a (id, length) table from a FASTA so downstream tabular steps can filter, sort, or threshold records by length; fasta_compute_length.
Turn coordinates into sequence: extract FASTA at BED intervals (getfasta), mask regions by BED (maskfasta), or extract transcripts from a GFF (gffread).
Move sequence records between FASTA and a (header, sequence) table so tabular tools can edit them; fasta2tab one way, tab2fasta back.
Concatenate several FASTA files into one and drop duplicate records by sequence identity in a single step; fasta_merge_files_and_filter_unique_sequences.
Rewrap FASTA records to a fixed sequence-line width so downstream tools and viewers get canonical 60/70/80-column output; cshl_fasta_formatter.
Edit FASTA headers you cannot easily regex in place: fasta2tab, rewrite column 1 with find/replace, then tab2fasta back. The high-value sequence recipe.
Use column_maker (Add_a_column1) with strict error_handling to insert/replace a computed column. Per-expression-kind auto_col_types rule.
Use collapse_dataset to row-bind a collection of tabulars into one table, with optional element IDs and header dedupe.
Use Cut1 with a comma-separated cN list to project — and reorder — columns. Listing out of order is the canonical reorder idiom.
Use Filter1 with a Python expression over cN columns to drop rows. Highest-frequency tabular row filter in IWC.
Use tp_grep_tool for whole-line regex row filters on tabular input. Grep1 is the legacy alternative.
Use datamash_ops for grouped tabular aggregation: multi-column grouping, collapse, countunique, min/max, and reductions.
Use tp_easyjoin_tool for two-tabular key joins; use tp_multijoin_tool for many files and query_tabular for SQL joins.
Use collection_column_join to outer-join a collection of 2-column id/value tables into one wide table.
Use tp_awk_tool to prepend a constant header line, optionally skipping or reformatting an existing first row.
Use tp_awk_tool to replace each row or label with deterministic sample_N values from awk NR.
Use tp_awk_tool to split semicolon-delimited taxonomy strings into explicit rank columns with missing-rank handling.
Use query_tabular when SQL semantics justify it: windows, joins, anti-joins, or fused project+compute over tabulars.
Use tp_awk_tool to convert chrom/start/end rows into 6-column BED, subtracting 1 from start and setting constants.
Use split_file_to_collection split_by:col to fan a tabular into collection elements by row/key.