Visualise amount of objects in Museum Collection

digital-humanities-open-refine-tutorial/visualise-amount-of-objects-in-museum-collection

Author(s)
Daniela Schneider
version Version
1
last_modification Last updated
Sep 24, 2025
license License
CC-BY-4.0
galaxy-tags Tags
openrefine
culturedata
4culture
visualisation
tabulardata
textcleaning

Features
Tutorial
hands_on OpenRefine Tutorial for researching cultural data

Workflow Testing
Tests: ✅
Results: Not yet automated
FAIRness purl PURL
https://gxy.io/GTN:
RO-Crate logo with flask Download Workflow RO-Crate
Launch in Tutorial Mode question
galaxy-download Download
flowchart TD
  0["ℹ️ Input Dataset\nInput"];
  style 0 stroke:#2c3143,stroke-width:4px;
  1["ℹ️ Input Dataset\nstop_words_english"];
  style 1 stroke:#2c3143,stroke-width:4px;
  2["Cut"];
  0 -->|output| 2;
  3["Filter Tabular"];
  2 -->|out_file1| 3;
  4["Column Regex Find And Replace"];
  3 -->|output| 4;
  5["Sort"];
  4 -->|out_file1| 5;
  6["Remove beginning"];
  5 -->|outfile| 6;
  7["Datamash"];
  6 -->|out_file1| 7;
  8["Sort"];
  7 -->|out_file| 8;
  9["Bar chart"];
  8 -->|outfile| 9;
  204d5ba9-e4d0-4bfd-a421-43b71f361bc9["Output\nout_file1"];
  9 --> 204d5ba9-e4d0-4bfd-a421-43b71f361bc9;
  style 204d5ba9-e4d0-4bfd-a421-43b71f361bc9 stroke:#2c3143,stroke-width:4px;
  10["Sort"];
  8 -->|outfile| 10;
  11["Select first"];
  10 -->|outfile| 11;
  12["Cut"];
  11 -->|outfile| 12;
  13["Parse parameter value"];
  12 -->|out_file1| 13;
  14["Compose text parameter value"];
  13 -->|text_param| 14;
  15["Search in textfiles"];
  0 -->|output| 15;
  14 -->|out1| 15;
  16["Cut"];
  15 -->|output| 16;
  17["Generate a word cloud"];
  1 -->|output| 17;
  16 -->|out_file1| 17;
  db4e9713-11f9-4f7b-b2a6-f0661e1f46f8["Output\noutput"];
  17 --> db4e9713-11f9-4f7b-b2a6-f0661e1f46f8;
  style db4e9713-11f9-4f7b-b2a6-f0661e1f46f8 stroke:#2c3143,stroke-width:4px;

Inputs

Input Label
Input dataset Input
Input dataset stop_words_english

Outputs

From Output Label
barchart_gnuplot Bar chart
toolshed.g2.bx.psu.edu/repos/bgruening/wordcloud/wordcloud/1.9.4+galaxy2 Generate a word cloud

Tools

Tool Links
Cut1
Remove beginning1
barchart_gnuplot
param_value_from_file
toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_grep_tool/9.5+galaxy2
toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_head_tool/9.5+galaxy2
toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_sort_header_tool/9.5+galaxy2
toolshed.g2.bx.psu.edu/repos/bgruening/wordcloud/wordcloud/1.9.4+galaxy2
toolshed.g2.bx.psu.edu/repos/galaxyp/regex_find_replace/regexColumn1/1.0.3 View in ToolShed
toolshed.g2.bx.psu.edu/repos/iuc/compose_text_param/compose_text_param/0.1.1 View in ToolShed
toolshed.g2.bx.psu.edu/repos/iuc/datamash_ops/datamash_ops/1.9+galaxy0
toolshed.g2.bx.psu.edu/repos/iuc/filter_tabular/filter_tabular/3.3.1 View in ToolShed

To use these workflows in Galaxy you can either click the links to download the workflows, or you can right-click and copy the link to the workflow which can be used in the Galaxy form to import workflows.

Importing into Galaxy

Below are the instructions for importing these workflows directly into your Galaxy server of choice to start using them!
Hands On: Importing a workflow
  1. Click on galaxy-workflows-activity Workflows in the Galaxy activity bar (on the left side of the screen, or in the top menu bar of older Galaxy instances). You will see a list of all your workflows
  2. Click on galaxy-upload Import at the top-right of the screen
  3. Provide your workflow
    • Option 1: Paste the URL of the workflow into the box labelled “Archived Workflow URL”
    • Option 2: Upload the workflow file in the box labelled “Archived Workflow File”
  4. Click the Import workflow button

Below is a short video demonstrating how to import a workflow from GitHub using this procedure:

Video: Importing a workflow from URL

Version History

Version Commit Time Comments
1 7d6b8c507 2025-09-19 09:25:40 Added workflow files with a test. Updated the answer history.

For Admins

Installing the workflow tools

wget https://training.galaxyproject.org/training-material/topics/digital-humanities/tutorials/open-refine-tutorial/workflows/Visualise-amount-of-objects-in-Museum-Collection.ga -O workflow.ga
workflow-to-tools -w workflow.ga -o tools.yaml
shed-tools install -g GALAXY -a API_KEY -t tools.yaml
workflow-install -g GALAXY -a API_KEY -w workflow.ga --publish-workflows