Nitrate DMQC for autonomous platforms such as Argo floats

Author(s) Marie Josse avatar Marie JosseVirginie Racape avatar Virginie Racape
Reviewers Marie Josse avatarBjörn Grüning avatar
Overview
Creative Commons License: CC-BY Questions:
  • How to access Argo data?

  • What hat is a DMQC analysis / QCV analysis?

  • How to qualify and calibrate nitrate in the ocean?

Objectives:
  • Learn how to do a QCV analysis on the nitrate variable of the ocean ?

Requirements:
Time estimation: 1 hour
Supporting Materials:
Published: Aug 8, 2025
Last modification: Sep 16, 2025
License: Tutorial Content is licensed under Creative Commons Attribution 4.0 International License. The GTN Framework is licensed under MIT
purl PURL: https://gxy.io/GTN:T00548
version Revision: 3

The next sections explain how to qualify and calibrate the nitrate sensor mounted on the Argo float 4903881 either tool by tool or by running a pre-defined Galaxy workflow. This requires two preliminary steps to prepare the environment.

For your information, execution time for running tools depends on the number of files, their size and/or the number of people working on Galaxy.

Warning: Do not change file name

DO NOT change file names to ensure that any changes you make with ODV will be carried over. However, it is possible to change the name of Galaxy collections to make them easier to find.

Agenda

In this tutorial, we will cover:

  1. Prerequisites
    1. Galaxy
    2. Nitrate calibration
  2. Manage your data
    1. Get your data
    2. Organize your data
  3. Run a predefined workflow
    1. Workflow human in the loop
    2. Available public workflows
  4. Tool by tool
    1. Harmonize your data
    2. Qualify your data
    3. Calibrate your data
    4. Validate your data
  5. Extra information

Prerequisites

Galaxy

Create or login to your Galaxy Europe account.

Once you are logged in, Galaxy is divided into 4 vertical panels:

  • Far left: Galaxy action list (Upload, Tools, Interactive Tools…)
  • Center: Main panel
  • Far right: History section

Galaxy organisation.

All symbols have a tooltip when you hover them. These names will be indicated with {} in this tutorial.

The history section stores all your job results with the following color code:

  • Grey = cancelled
  • Orange = running
  • Green = finished with success
  • Red = failed

It is possible to organise your history with sub-directories:

  • New history: click on + at the top-right of history, edit the name with the pencil, save
  • Switch history: click on the icon and select a history
  • Copy datasets: click on OperationsCopy datasets and choose source/destination histories

Galaxy operation symbol.

  • Quota per user account: check your quota. Free space by clicking the database icon in the history panel and “Review and clear” GB. (Note: deleting a dataset in the history does not free quota until you purge it.)

Galaxy quota info.

  • Time limit for interactive tools: 24h. After that, ODV closes automatically.
  • Avoid weekends: upgrades are performed and can stop tools/workflows.
  • Max interactive tools simultaneously: 2.

Nitrate calibration

Comment: Oxygen prerequisite

Because oxygen concentration is implied in the estimation of the nitrate adjustment from neural network method, platforms to be calibrated must have good-quality oxygen (adjusted in real time or delayed mode for Argo floats or gliders).

Nitrate calibration needs a potential reference dataset such as the WOA annual climatology. You can download it here and upload it into Galaxy.

Manage your data

Get your data

Hands On: Get your data from the S3 server
  • Click on Upload in the vertical panel on the left
  • In the pop-up window, click on Choose from repository
  • In the top search bar, search for argo
  • In the result list, select Argo marine floats data and metadata from Global Data Assembly Centre (Argo GDAC)
  • Navigate to pub/dac/coriolis/4903881
  • Tick the following:
    • 4903881_meta.nc
    • All files under profiles/
  • Click on Select (bottom-right)
  • Remove the synthetic files SR4903881_*.nc (by clicking the trash icon)
  • Click on Start
  • Wait until all files are stored in your history on the right (green color)

S3 upload interface.

Hands On: Get your data from your computer
  • Click on Upload in the vertical panel on the left
  • In the pop-up, click on Choose local file
  • Select the files on your system (Argo float or WOA)
  • Click on Open then Start
  • Wait until all files are stored in your history (green color)

Local upload interface.

Organize your data

Hands On: Creating a dataset collection with autobuild
  • Click on galaxy-selector Select Items at the top of the history panel Select Items button
  • Check n files of interest
  • Click n of N selected and choose Auto build List

    Collection building with autobuild

  • Enter a name for your collection to 4903881
  • Turn off Remove file extension

    Put a name and remove extension

  • Click Build to build your collection
  • Click on the checkmark icon at the top of your history again

Once the collection is created, all files turn green. You can limit visible files using the eye icons in the history panel.

Hands On: Change file extension (for WOA file)
  1. Change the datatype of the WOA file to NetCDF
  • Click on the galaxy-pencil pencil icon for the dataset to edit its attributes
  • In the central panel, click galaxy-chart-select-data Datatypes tab on the top
  • In the galaxy-chart-select-data Assign Datatype, select NetCDF from “New Type” dropdown
    • Tip: you can start typing the datatype into the field to filter the dropdown menu
  • Click the Save button

Change extension.

Hands-on: Choose Your Own Tutorial

This is a "Choose Your Own Tutorial" (CYOT) section (also known as "Choose Your Own Analysis" (CYOA)), where you can select between multiple paths. Click one of the buttons below to select how you want to follow the tutorial

Do you want to run a workflow ‘human in the loop’ or to discover the tools one by one?

Run a predefined workflow

Hands On: Run a public workflow
  1. Click on Workflows on the vertical panel on the left of the Galaxy action list
  2. At the top of the resulting page you will have the option to switch between the My workflows, Workflows shared with me and Public workflows tabs. Select the tab you want to see all workflows in that category..
  3. Search for Argo-Glider Nitrate QCV.

Select workflow

  1. Click on the workflow name: a pop-up window opens with a preview of the workflow
  2. To run it directly: click Run (top-right)

  3. Recommended: click Import (left of Run) to make your own local copy under Workflows / My Workflows

You may have to refresh your history to see the queued jobs

  • When you click Run, a panel asks you for the input files:
    • Select the Argo files and climatology uploaded during the data management phase
    • Once filled in, click Run again

Select workflow.

Is the workflow not available on your Galaxy instance? You can upload the workflow yourself as follows:

  1. Click on galaxy-workflows-activity Workflows in the Galaxy activity bar (on the left side of the screen, or in the top menu bar of older Galaxy instances). You will see a list of all your workflows
  2. Click on galaxy-upload Import at the top-right of the screen
  3. Provide your workflow
    • Option 1: Paste the URL of the workflow into the box labelled “Archived Workflow URL”
    • Option 2: Upload the workflow file in the box labelled “Archived Workflow File”
  4. Click the Import workflow button

Below is a short video demonstrating how to import a workflow from GitHub using this procedure:

Video: Importing a workflow from URL

Workflow human in the loop

Now that your workflow is running, remember it contains both non-interactive and interactive steps. For the interactive tools (ODV), user action is required.

  • When the workflow arrives at the ODV interactive step for qualification:
    • Open ODV
    • Apply QC following
    • Export history
  • Once the qualification step is finished and the history exported, close ODV. The workflow will then continue automatically.

  • When the workflow arrives at the ODV interactive step for validation:
    • Open ODV again
    • Follow the validation phase as described in

Expand the ODV left panel (3 vertical dots) to access the clipboard. Paste text into ODV forms via this clipboard. Click outside the panel to collapse it.

ODV clipboard.

If ODV becomes grey with the red banner ODV - Disconnected, refresh the tab.

Warning: ODV execution limit

ODV has a 24h time limit. If you launch a workflow and wait longer than 1 day before opening ODV, the workflow will continue automatically without your qualification step.

Available public workflows

Workflow to perform Qualification, Nitrate Calibration, and Validation of one Argo float or one Glider. Uses Neural Network + Climatology. Copy it before running and adjust calibration parametrizations if necessary.

Tool by tool

Warning: Before starting

Before running the QCV procedure tool by tool, be sure that your environment (see Prerequisites and Manage your data) is ready!

Harmonize your data

The original files must be pre-processed in order to use the same tools regardless of their origin. This is performed by the tool QCV harmonizer.

This tool:

  • harmonizes vocabulary and format
  • aggregates multiple files from the same platform into one single file

Its outputs:

  • a single NetCDF file named ###_harm.nc (since version 3.0)

This tool is currently operational for Argo floats, World Ocean Atlas, and Gliders. It should be run as many times as there are datasets/platforms.

Hands On: Run QCV harmonizer
  1. Finding your tool
    • Click on Tools just below Upload on the left panel
    • Search for QCV harmonizer in the search bar
    • Select the tool. Its configuration page opens.
  2. QCV harmonizer ( Galaxy version 3.0+galaxy1) with the following parameters:
    • In Input the NetCDF data files:
      • If using a dataset collection (e.g. 4903881): Select Dataset collection → click on Select Value → choose 4903881
      • If using WOA: Select Single dataset → click on Select Value → choose woa###.nc
    • Click Run

QCV harmonizer tool.

If your file is not listed in the selection window, you can drag and drop it from the history panel into the Select value box.

Qualify your data

Create the ODV collection

The visualization tools for the qualification, the validation and the extraction or reporting of the user actions include ODV software, the ODV collection manager and the ODV history manager.

This tool creates:

  • ODV spreadsheet collections from harmonized file(s)
  • ODV view for nitrate qualification

Capabilities:

  • maps input QC flags with expected output QC flag scale
  • subsets reference datasets (option: 1=subset, 0=no, -1=inverse)

Default parameters:

  • operator name = anonymous
  • QC convention = ARGO QC flag scale
  • Subsetting = 1 (yes, even if no reference data)
  • plt = 0 (no plot)

Outputs:

  • odv_collection.txt
  • qualification_startingPoint_nitrate.xview
  • log file (YYYY-MM-DDTHHMM_galaxy_odv-coll-manager_QV.log)
Hands On: Run ODV collection manager
  1. Finding your tool
    • Click on Tools in the left panel
    • Search for ODV collection manager
    • Select the tool. Its configuration page opens
  2. ODV collection manager ( Galaxy version 1.3+galaxy2) with the following parameters:
    • For Input raw data:
      • Select Multiple Datasets → click Select Value → choose 4903881_harm.nc
    • (Optional) Add a harmonized WOA reference dataset
    • (Optional) Change default parameters: operator name, QC convention, subsetting, plt
    • Click Run

Outputs:

  • ODV spreadsheet collection odv_collection.txt
  • View qualification_startingPoint_nitrate.xview
  • Log file with actions

ODV collection manager.

Launch ODV automatically

ODV is a software for visualizing and/or qualifying scientific data. It can be used locally or directly on the Galaxy Ocean Data View.

Hands On: Run ODV interactive tool
  1. Finding your interactive tool
    • Click on Tools or Interactive Tools in the left panel
    • Search ODV interactive tool
    • Select the tool. Its configuration page opens
  2. Ocean Data View with the following parameters:
    • Ensure the tool version is v5.8_1 (change via the 3 cubes on the left if needed)
    • Set parameters:
      • Load data automatically: Yes
      • NetCDF or tabular text file: select ODV collection manager output or drag and drop odv_collection.txt
      • Do you have a view?: Yes → drag and drop qualification_startingPoint_nitrate.xview
    • Click Run

ODV launch.

  • When ODV is ready, a red dot appears on Interactive Tools
  • Click it → open the ODV panel → click the expand symbol
  • ODV opens in a new window with your selected view

ODV interactive tool. ODV nitrate view.

After some inactivity, the ODV window may close. Just refresh the browser tab.

Open the clipboard (3 dots on the left panel) to copy-paste text. Clipboard in ODV.

Qualify the dataset with ODV

Use ODV features to apply QC flags and edits as needed. Refer to the ODV guide.

Export history

Hands On: Export history
  1. Export to your Galaxy history
    • In ODV: Export > History
    • Save as default name history_from_odv_collection.txt in working/Documents/ODV/galaxy/outputs
    • Do not change the default filename
    • Close ODV (File > Exit)

Outputs:

  • Galaxy collection ODV all outputs (zip, history, odv_collection)
  • File ODV history extracted

(optional) Report QC & Data changes

Run ODV history manager to report QC flags and/or data changes into the harmonized dataset before calibrating the nitrate sensor.

This tool extracts QC changes (EDITFLAGS) and data edits (EDITDATA) from the ODV history and reports them into the harmonized file(s).

It manages 2 types of ODV history files:

  • synthetic (manual export)
  • extended (automatic export, not yet available in Galaxy)

Default parameters:

  • QC convention = ARGO QC flag scale
  • ODV convention changes: EDITFLAGS, EDITDATA

Outputs:

  • ###_harm_qced.nc (extended NetCDF files including changes)
  • ###_harm_history.csv (summary of user changes)
  • log file (YYYY-MM-DDTHHMM_galaxy_odv-history-manager_history.log)
Hands On: Run ODV history manager
  1. Find your tool
    • Click on Tools just below Upload on the left panel
    • Search ODV history manager in the search bar
    • Select the tool
    • Its configuration page opens in the center panel
  2. ODV history manager ( Galaxy version 1.2+galaxy2) with the following parameters:

    Depending on your ODV spreadsheet file, select the appropriate inputs:

    Case 1: using ODV collection manager output

    • Input NetCDF data: harmonized file(s) ###_harm.nc created by QCV harmonizer
    • Input history text file: ODV history extracted (txt file exported from ODV after changes in Export history)
    • Input ODV file: ODV collection manager output (the ODV spreadsheet collection created by ODV collection manager)

    Case 2: using odv_collection.txt

    • Input NetCDF data: harmonized file(s) ###_harm.nc created by QCV harmonizer
    • Input history text file: history_from_odv_collection.txt (txt file inside ODV all outputs)
    • Input ODV file: odv_collection.txt (spreadsheet collection created by ODV collection manager and opened in ODV for QC changes)

    (Optional) In Tool Parameters, you may change the default configuration:

    • QC convention for the ODV output file: select one of the QC flag scales understood by ODV
    • ODV convention regarding changes performed on the ODV collection:
      • EDITFLAGS = report QC flag changes from history to the data collection
      • EDITDATA = report data edits from history to the data collection

ODV history manager parametrization.

You can unselect either EDITFLAGS or EDITDATA if you want to report only QC changes or only data changes. However, you cannot unselect both. If you do, the backend will actually apply both edit types.

Warning: Check the logs

At the end of the process, Galaxy distributes the tool outputs into 3 collections:

  • ODV history manager netcdf collection (extended harmonized NetCDFs, ###_harm_qced.nc)
  • ODV history manager csv collection (CSV summaries, ###_harm_history.csv)
  • ODV history manager log files (with YYYY-MM-DDTHHMM_galaxy_odv-history-manager_history.log)

Always check the log file (click the galaxy-eye icon) to ensure that changes were correctly reported. If issues persist, make sure filenames match the step Export history conventions.

All these outputs will appear in your History (green status). They are now ready for the next step: Biogeochemical Calibration.

Calibrate your data

Nitrate

Hands On: (optional) Choose Reference dataset
  • Upload WOA nitrate climatology (NetCDF) from your computer
  • Harmonize it with QCV harmonizer as in the step Export history

Currently, the tool Biogeochemical calibration is operational for nitrate.

Hands On: Run Biogeochemical calibration
  1. Find your tool
    • Click on Tools → search Biogeochemical calibration
  2. BioGeoChemical calibration ( Galaxy version 2.1+galaxy2) with the following parameters:
    • Input harmonized netcdf data: ###_harm_qced.nc (created by QCV harmonizer + ODV history manager)
    • Input reference data (optional): WOA harmonized NetCDF file if you want to estimate reference nitrate from climatology
    • (Optional) Change default QC flag configuration (ARGO QC flag scale by default)
    • (Optional) Add calibration parametrizations:
      • Single regression
      • CanyonB (default)
      • Climatology / WOA
      • Custom P value with user-defined breakpoints
    • Run the tool

Calibration tool.

By default, the tool calibrates nitrate using:

  • Automatic evaluation of reference pressure
  • CanyonB as reference method
  • Linear regression for correction
  • ARGO QC flag convention

Default QC flag mapping:

  • Bad Data QC = 4
  • Raw QC value to replace = 3
  • Replacement QC value = 1 (adjusted value)

Possible configurations:

  • User-defined reference pressure
  • Canyon-med or WOA as reference method
  • Multi-linear regression with automatic or user-defined breakpoints

Outputs per platform/configuration:

  • NetCDF file(s): ###_C{param}-nitrate.nc
  • Diagnostic figures (up to 4):
    • ###_..._LinearModel.png
    • ###_..._Rsquared.png
    • ###_..._Zvar_ref_detailed.png
    • ###_..._Zvar_ref_results.png
  • Log file: YYYY-MM-DDTHHMM_galaxy_odv-calibration-methods_nitrate_c.log
  • Reproduce default method: set Multiple linear regressionSingle linear regression
  • Add canyon B automatic: leave default
  • Add custom method:
    • Bypass: user defines P → enter 1000 dbar
    • Multiple linear regression: N = number of breakpoints
    • Reference method: Climatology / WOA / Annual (Ensure WOA harmonized climatology is included in inputs)

If activated, the tool also updates original Argo NetCDF files with DM information and produces BD Argo files.

  • Requires:
    • Original raw NetCDF data (dataset collection 4903881)
    • CSV file from ODV history manager
    • Institution, BGC contact point, DM operator info
  • If info is missing or inconsistent with original files, BD files are not delivered.

Outputs:

  • ###_C{param}-nitrate.tar.gz (all BD files ready for DAC)

Validate your data

Compare adjusted values with ODV:

Hands On: Validation with ODV
  1. Re-run ODV collection manager including all calibrated NetCDFs from the step (optional) Report QC & Data changes

    1. Expand one of the output datasets of the tool (by clicking on it)
    2. Click re-run galaxy-refresh the tool

    This is useful if you want to run the tool again but with slightly different paramters, or if you just want to check which parameter setting you used.

  2. Launch ODV interactive tool again for visualization
  3. Inspect calibrated vs. raw/reference values

You may validate in two ways:

Case 1: without external datasets

  • Create ODV collection with calibrated NetCDFs
  • Launch ODV without a view
  • ODV opens with new collection ready for validation

Case 2: with external datasets

  • Add an external dataset (ODV spreadsheet, ODV collection, or NetCDF supported by ODV)
  • Create ODV collection with calibrated + external data
  • Launch ODV without automatic load and without view
  • Import datasets manually via File > New
  • Map metadata and variables when prompted
  • Repeat for each external dataset
  • ODV is now ready for side-by-side comparison

Import first the ODV spreadsheet collection created by ODV collection manager. Then import external datasets such as webODV Glodap.

Extra information

Coming up soon even more tutorials on and other Earth-System related trainings. Keep an galaxy-eye open if you are interested!