Galaxy: genomic interval patterns
The runtime-facing map for Galaxy coordinate-feature choices — operations that understand chrom/start/end/strand, as opposed to opaque-column galaxy-tabular-patterns or container-shaped galaxy-collection-patterns. Use it before loading raw survey notes; iwc-interval-operations-survey is the evidence backing, these pages are the actionable references.
This is the smallest of the three data-shape MOCs by design. Interval algebra is a real but moderate cluster in IWC — concentrated in epigenetics peak-consensus and SARS-CoV-2 masking — and its highest-value units are recipes, not single operations. Reach for the recipes first when your need is a multi-step construction.
Overlap
- interval-overlap-filter — keep/drop/annotate features by overlap with a second set (
bedtools intersect, or VCF-nativevcfvcfintersect).
Set operations
- interval-merge-overlapping — collapse a set’s own overlapping features into single spans (
bedtools merge/gops_merge). - interval-mask-by-set-algebra — (recipe) compute regions from regions: concat → merge → subtract (the
gops_*set-algebra mask).
Windows & coverage
- interval-window-flank — extend features into neighborhood windows (
bedtools slop). - interval-coverage — genome-wide depth (bedgraph) or reads-in-given-regions counts (
bedtools genomecov/coverage).
Recipes
- interval-consensus-by-multi-intersect — reproducible features across replicates: multi-intersect → count-threshold → intersect back.
- interval-mask-by-set-algebra — compute a mask: union (concat+merge) then difference (subtract).
- interval-windowed-coverage — quantify signal in fixed windows: slop → merge → coverage.
Bridges
- interval ↔ collection —
bedtoolsops map over per-sample collections viareduce_or_iterate: iterate; multi-intersect consumes a whole collection at once. See galaxy-collection-patterns. - interval ↔ tabular — coordinate construction (tabular-synthesize-bed-from-3col, and the indel-aware variant-to-BED arithmetic in interval-mask-by-set-algebra) and coordinate filtering (
Filter1 c4 < Non a bedgraph, tabular-filter-by-column-value). The line: tabular treats columns as opaque; interval understands coordinates. See galaxy-tabular-patterns.
Gaps (no corpus exemplar, no page)
Per corpus-first, these have zero IWC uptake and get no pattern page; documented here so the absence is explicit, not an oversight. GTN training-corpus counts below are grounded in iwc-interval-operations-survey §GTN cross-reference (non-IWC signal):
closest/ proximity — the “nearest feature + distance” operation. The interval-algebra form (bedtools closest, fetch-closest,windowbed) is absent from both IWC and GTN. The task is absent from IWC entirely; in GTN it appears only as a domain step (deeptools computeMatrixreference-point mode, TSS-relative signal), which #268 scopes out. This is the operation that motivated the MOC (#268). The natural tool —bedtools_closestbed(-dfor distance) — exists in the sameiuc/bedtoolssuite this MOC’s other tools come from; it simply has no corpus exemplar, so per corpus-first there is no recurring pattern to author, and an agent that needs nearest-feature-plus-distance reaches for it directly. (computeMatrixcovers only the narrower ChIP/ATAC TSS-proximity case, not a general substitute.)complement— taught in one GTN tutorial (assembly), zero IWC.- coordinate-aware
sort— taught across three GTN tutorials; IWC sorts intervals with tabularsort1instead. makewindows,map,window(-wneighborhood join),annotate,jaccard— zero in both corpora.
These are tracked as IWC-input-blocked candidates (GitHub requires-iwc-inputs); a page follows only when an IWC workflow uses the operation.
See also
- iwc-interval-operations-survey — interval-operation survey and evidence trail.
- galaxy-sequence-patterns — companion MOC for sequence-record operations; shares the extract/mask-by-region bridge (
getfasta/maskfastaproduce sequence from these intervals). - galaxy-tabular-patterns — companion MOC for opaque-column tabular operations.
- galaxy-collection-patterns — companion MOC for collection-container operations.