Home Mold

cwl-summary-to-galaxy-template

gxformat2 skeleton with per-step TODOs from a CWL summary and prior Galaxy design briefs.

Mold health

error
  • Source layout

    1 non-index Markdown file with frontmatter.

  • Axis fields

    source-specific fields are coherent.

  • Eval plan

    eval.md declares cases and check type.

  • Typed refs

    13 typed references; 0 resolver issues.

  • On-demand triggers

    All on-demand references describe triggers.

  • Evidence checks

    Hypothesis references include verification.

axis
source-specific
source
cwl
name
cwl-summary-to-galaxy-template
contract

Reference Loading

Typed Mold references describe what casting consumes and when the generated skill should load each artifact.

Schemagalaxy-workflow-draft

Structured contract copied for validation or lookup.

Purpose
Output contract: the emitted gxformat2 draft conforms to [[galaxy-workflow-draft]]. Cast bundles the JSON Schema so the skill carries its output shape alongside the [[draft-validate]] CLI checks.
CLI Commanddraft-validate

Command reference. Usually cast to a sidecar and loaded only when invoked.

Purpose
Validate the emitted draft against draft-contract rules (sentinel form, topology, _plan_* placement) before handing off.
Trigger
After writing or modifying the draft workflow file.
Verify
Cast the skill, run on a representative CWL workflow, confirm draft-validate diagnostics route back.
Researchcomponent-cwl-workflow-anatomy

Background synthesis loaded by explicit progressive-disclosure metadata.

Purpose
Preserve the lightweight CWL boundary and avoid re-inferring structure already present in the summary.
Verify
Run after the first CWL interface/data-flow briefs and confirm gxformat2 placeholder steps preserve CWL step boundaries unless the data-flow brief explicitly asks to split them.
Researchgalaxy-workflow-draft-format

Background synthesis loaded by explicit progressive-disclosure metadata.

Purpose
Emit the gxformat2 draft superset: TODO tool_id, optional tool_state / tool_shed_repository, and per-step _plan_state / _plan_context planning fields.
Verify
Promote after a downstream per-step implementation Mold consumes _plan_state and _plan_context without round-tripping back through the source summary.
Researchgalaxy-workflow-testability-design

Background synthesis loaded by explicit progressive-disclosure metadata.

Purpose
Choose stable workflow input/output labels, testable checkpoint outputs, and fixture-compatible workflow interfaces while drafting the skeleton.
Trigger
When the template decides workflow inputs, workflow outputs, promoted checkpoints, or collection output identifiers that future tests will need to address.
Researchgalaxy-collection-semantics

Background synthesis loaded by explicit progressive-disclosure metadata.

Purpose
Translate CWL arrays, records, scatter, and secondary-file shapes into Galaxy collection typing and map-over/reduction semantics.
Trigger
When creating workflow inputs, outputs, and placeholder connections involving collections.
Researchgalaxy-data-flow-draft-contract

Background synthesis loaded by explicit progressive-disclosure metadata.

Purpose
Respect the handoff from abstract data-flow draft to gxformat2 skeleton.
Trigger
When translating abstract nodes, unresolved tool needs, and placeholder transformations into template TODOs.
Verify
Promote after two worked CWL-to-Galaxy templates preserve the data-flow/template split without schema changes.
Patterngalaxy-tabular-patterns

Workflow-construction idiom. Usually condensed during casting.

Purpose
Use corpus-grounded tabular pattern guidance for unresolved skeleton steps.
Trigger
When adding TODO steps for tabular filtering, projection, joins, aggregation, text-processing recipes, or tabular-collection bridges.
Patterngalaxy-conditionals-patterns

Workflow-construction idiom. Usually condensed during casting.

Purpose
Use corpus-grounded conditional pattern guidance for unresolved skeleton steps.
Trigger
When adding TODO steps for optional steps, gating on non-empty results, routing between alternative outputs, or transform-or-pass-through branches.
Patterngalaxy-interval-patterns

Workflow-construction idiom. Usually condensed during casting.

Purpose
Use corpus-grounded genomic-interval pattern guidance for unresolved skeleton steps.
Trigger
When adding TODO steps for interval overlap, merge, coverage, windowing, masking, or set-algebra on coordinate features.

Cast artifacts

How to install →

Artifact handoffs

/ pipeline contract

Produces

Consumes

  • Structured CWL summary emitted by [[summarize-cwl]]; consulted while emitting placeholder steps.

    produced by
  • Galaxy interface brief from [[cwl-summary-to-galaxy-interface]] that pins workflow inputs, outputs, labels.

    produced by
  • Galaxy data-flow brief from [[cwl-summary-to-galaxy-data-flow]] that pins abstract operations and collection choices.

    produced by
  • Structural diff guidance from [[compare-against-iwc-exemplar]] (run on the design briefs); steers the skeleton toward IWC-aligned structure before per-step authoring. Carries an inline gxformat2 excerpt of the nearest exemplar.

    produced by
  • Cleaned gxformat2 view of the nearest IWC exemplar's relevant subgraph from [[compare-against-iwc-exemplar]]; pattern-match the draft's input/collection shapes, map-over wiring, output promotion, and post-job actions against this concrete idiom. Absent when no nearest exemplar was found.

    produced by

cwl-summary-to-galaxy-template

Read the original CWL source artifact, the CWL summary, the CWL-to-Galaxy interface brief, and the CWL-to-Galaxy data-flow brief. Emit a gxformat2 skeleton with workflow inputs, workflow outputs, placeholder steps, rough connections, and TODO slots for later implementation Molds.

CWL already carries structured workflow shape, so this Mold should be lighter than nextflow-summary-to-galaxy-template. Treat the prior-step index as the working context: CWL source, CWL summary, interface brief, data-flow brief, and any open questions carried forward.

Topology is this Mold’s job to settle. The output must be concrete gxformat2: workflow inputs with their final collection shapes and formats, workflow outputs, the step set, the producer→consumer edge graph, branches, and when: guards are all decided here. The upstream interface and data-flow briefs guide those decisions, but if they hedge or leave a topology choice open, this Mold makes the call from source evidence, IWC exemplars, and pattern pages — never emit a topology TODO. Wrapper resolution, by contrast, is evidence-gated, not source-gated: resolve each tool step to the tier its evidence supports — Resolved (fully concrete, no _plan_*), Identity-pinned (concrete tool_id, parameters and changeset left to the per-step Mold), or Deferred (tool_id: TODO) — as defined in galaxy-workflow-draft-format. Capture whatever you defer in the _plan_* family (_plan_state, _plan_context, _plan_in, _plan_out) so the per-step Mold has the source evidence and constraints it needs.

Source tendency: a CWL CommandLineTool carries baseCommand / arguments, DockerRequirement / SoftwareRequirement hints, and explicit input/output bindings — so identity is often inferable to Identity-pinned, and a step reaches Resolved when a pattern page or IWC exemplar covers the operation (fill tool_id, parameters, and port names from the worked example). A custom-script tool with no Galaxy equivalent stays Deferred — pack _plan_context with the baseCommand / arguments, DockerRequirement URIs, SoftwareRequirement packages, and EnvVarRequirement / ResourceRequirement constraints the per-step Mold needs to pick a wrapper. Emitting TODO over a pattern-covered recipe discards real evidence the per-step Mold cannot recover.

Output shape is gxformat2 with wrapper-tier relaxations and _plan_state / _plan_context / _plan_in / _plan_out per tool step — see galaxy-workflow-draft-format. Refinement open work for those planning fields lives in refinement.md.

Use CWL step ids as the first pass for placeholder labels, then revise labels only when the interface/data-flow briefs or IWC comparison notes give a clearer Galaxy convention. Preserve one placeholder per logical CWL step unless the data-flow brief explicitly asks to split an expression, nested workflow, or collection operation into Galaxy-native steps.

Incoming References (13)

  • Galaxy: collection patternsrelated mold— Use this MOC to choose corpus-grounded Galaxy collection transformation patterns.
  • Galaxy: conditionals patternsrelated mold— Use this MOC to choose corpus-grounded Galaxy when and pick_value conditional patterns.
  • Galaxy: genomic interval patternsrelated mold— Use this MOC to choose corpus-grounded Galaxy genomic interval operations and recipes on coordinate features.
  • Galaxy: sequence patternsrelated mold— Use this MOC to choose corpus-grounded Galaxy operations on sequence records (FASTA) — interconvert, reformat, merge, length, extract/mask by region.
  • Galaxy: tabular patternsrelated mold— Use this MOC to choose corpus-grounded Galaxy tabular transformation patterns.
  • CWL → GALAXYphase of pipeline— Path from a CWL Workflow to a Galaxy gxformat2 workflow. Lighter upstream extraction.
  • CWL workflow anatomyrelated mold— CWL structure relevant to summarize-cwl: normalized documents, steps, scatter, conditionals, requirements, and dependency handling.
  • CWL pickValue → Galaxy pick_value (post galaxy#22222)related mold— CWL `pickValue` (first_non_null / the_only_non_null / all_non_null) → Galaxy's native `pick_value` workflow step added by galaxyproject/galaxy#22222.
  • CWL when:/pickValue → Galaxy branching translationrelated mold— CWL `when:`/`pickValue` → Galaxy. Three honest translations (paired_or_unpaired input, native pick_value step, sibling workflows) plus how to pick among them.
  • Galaxy data-flow draft contractrelated mold— Defines the proposed boundary between Galaxy data-flow drafts, gxformat2 templates, and concrete step implementation.
  • Galaxy workflow draft formatrelated mold— gxformat2 draft superset: wrapper-tier TODOs (tool_id, tool_state, port names) plus _plan_state / _plan_context / _plan_in / _plan_out per tool step.
  • Galaxy workflow draft (gxformat2 superset)related mold— JSON Schema for `class: GalaxyWorkflowDraft` — gxformat2 with `TODO_*` sentinels and `_plan_*` planning fields per draft step.
  • CWL workflow summaryrelated note— JSON Schema for the structured summary emitted by the summarize-cwl Mold.