Analyse HeLa fluorescence siRNA screen

Overview
Creative Commons License: CC-BY Questions:
  • How do I analyze a HeLa fluorescence siRNA screen?

  • How do I segment cell nuclei?

  • How do I extract features from segmentations?

  • How do I filter segmentations by morphological features?

  • How do I apply a feature extraction workflow to a screen?

  • How do I visualize feature extraction results?

Objectives:
  • How to segment cell nuclei in Galaxy.

  • How to extract features from segmentations in Galaxy.

  • How to filter segmentations by morphological features in Galaxy.

  • How to extract features from an imaging screen in Galaxy.

  • How to analyse extracted features from an imaging screen in Galaxy.

Requirements:
Time estimation: 1 hour
Level: Intermediate Intermediate
Supporting Materials:
Published: Aug 13, 2019
Last modification: Nov 7, 2024
License: Tutorial Content is licensed under Creative Commons Attribution 4.0 International License. The GTN Framework is licensed under MIT
purl PURL: https://gxy.io/GTN:T00180
rating Rating: 5.0 (0 recent ratings, 2 all time)
version Revision: 12

This tutorial shows how to segment and extract features from cell nuclei Galaxy for image analysis. As example use case, this tutorial shows you how to compare the phenotypes of PLK1 threated cells in comparison to a control. The data used in this tutorial is available at Zenodo.

RNA interference (RNAi) is used in the example use case for silencing genes by way of mRNA degradation. Gene knockdown by this method is achieved by introducing small double-stranded interfering RNAs (siRNA) into the cytoplasm. Small interfering RNAs can originate from inside the cell or can be exogenously introduced into the cell. Once introduced into the cell, exogenous siRNAs are processed by the RNA-induced silencing complex (RISC).The siRNA is complementary to the target mRNA to be silenced, and the RISC uses the siRNA as a template for locating the target mRNA. After the RISC localizes to the target mRNA, the RNA is cleaved by a ribonuclease. RNAi is widely used as a laboratory technique for genetic functional analysis. RNAi in organisms such as C. elegans and Drosophila melanogaster provides a quick and inexpensive means of investigating gene function. Insights gained from experimental RNAi use may be useful in identifying potential therapeutic targets, drug development, or other applications. RNA interference is a very useful research tool, allowing investigators to carry out large genetic screens in an effort to identify targets for further research related to a particular pathway, drug, or phenotype.

The example used in this tutorial deals with PLK1 knocked down cells. PLK1 is an early trigger for G2/M transition. PLK1 supports the functional maturation of the centrosome in late G2/early prophase and establishment of the bipolar spindle. PLK1 is being studied as a target for cancer drugs. Many colon and lung cancers are caused by K-RAS mutations. These cancers are dependent on PLK1.

Agenda

In this tutorial, we will deal with:

  1. Getting data
  2. Create feature extraction workflow
  3. Apply workflow to screen
  4. Plot feature extraction results
  5. Conclusion

Getting data

The dataset required for this tutorial contains a screen of DAPI stained HeLa nuclei (more information). We will use a sample image from this dataset for training basic image processing skills in Galaxy.

Hands-on: Data upload
  1. If you are logged in, create a new history for this tutorial

    To create a new history simply click the new-history icon at the top of the history panel:

    UI for creating new history

  2. Import galaxy-upload the following dataset from Zenodo or from the data library (ask your instructor).
    • Important: Choose the type of data as zip.
    https://zenodo.org/record/3362976/files/B2.zip
    
    • Copy the link location
    • Click galaxy-upload Upload Data at the top of the tool panel

    • Select galaxy-wf-edit Paste/Fetch Data
    • Paste the link(s) into the text field

    • Press Start

    • Close the window

    As an alternative to uploading the data from a URL or your computer, the files may also have been made available from a shared data library:

    1. Go into Data (top panel) then Data libraries
    2. Navigate to the correct folder as indicated by your instructor.
      • On most Galaxies tutorial data will be provided in a folder named GTN - Material –> Topic Name -> Tutorial Name.
    3. Select the desired files
    4. Click on Add to History galaxy-dropdown near the top and select as Datasets from the dropdown menu
    5. In the pop-up window, choose

      • “Select history”: the history you want to import the data to (or create a new one)
    6. Click on Import

  3. Unzip ( Galaxy version 6.0+galaxy0) with the following parameters:
    • param-file “input_file”: B2.zip
    • “Extract single file”: Single file
    • “Filepath”: B2--W00026--P00001--Z00000--T00000--dapi.tif
  4. Rename galaxy-pencil the dataset to testinput.tif

    • Click on the galaxy-pencil pencil icon for the dataset to edit its attributes
    • In the central panel, change the Name field
    • Click the Save button

  5. Unzip ( Galaxy version 6.0+galaxy0) with the following parameters:
    • param-file “input_file”: B2.zip
    • “Extract single file”: All files
  6. Rename galaxy-pencil the resulting collection to control

    1. Click on the collection
    2. Click on the name of the collection at the top
    3. Change the name
    4. Press Enter

  7. Import galaxy-upload the following dataset from Zenodo or from the data library (ask your instructor).
    • Important: Choose the type of data as zip.
      https://zenodo.org/record/3362976/files/B3.zip
      
    • Copy the link location
    • Click galaxy-upload Upload Data at the top of the tool panel

    • Select galaxy-wf-edit Paste/Fetch Data
    • Paste the link(s) into the text field

    • Press Start

    • Close the window

    As an alternative to uploading the data from a URL or your computer, the files may also have been made available from a shared data library:

    1. Go into Data (top panel) then Data libraries
    2. Navigate to the correct folder as indicated by your instructor.
      • On most Galaxies tutorial data will be provided in a folder named GTN - Material –> Topic Name -> Tutorial Name.
    3. Select the desired files
    4. Click on Add to History galaxy-dropdown near the top and select as Datasets from the dropdown menu
    5. In the pop-up window, choose

      • “Select history”: the history you want to import the data to (or create a new one)
    6. Click on Import

  8. Unzip ( Galaxy version 6.0+galaxy0) to extract the zipped screen:
    • param-file “input_file”: B3.zip
    • “Extract single file”: All files
  9. Rename galaxy-pencil the collection to PLK1
  10. Upload galaxy-upload the following segmentation filter rules as a new pasted file (format: tabular):
    	area	eccentricity
    min	500	0.
    max	100000	0.5
    
    • Click galaxy-upload Upload Data at the top of the tool panel
    • Select galaxy-wf-edit Paste/Fetch Data at the bottom
    • Paste the file contents into the text field
    • Change Type from “Auto-detect” to tabular* Press Start and Close the window

  11. Rename galaxy-pencil dataset to rules

    • Click on the galaxy-pencil pencil icon for the dataset to edit its attributes
    • In the central panel, change the Name field
    • Click the Save button

Create feature extraction workflow

First, we will create and test a workflow which extracts mean DAPI intensity, area, and major axis length of cell nuclei from an image.

Hands-on: Create feature extraction workflow
  1. Filter 2-D image ( Galaxy version 1.12.0+galaxy1) with the following parameters to smooth the image:
    • param-file “Input image”: testinput.tif file
    • “Filter type”: Gaussian
    • “Sigma”: 3
  2. Threshold image ( Galaxy version 0.18.1+galaxy3) with the following parameters to segment the image:
    • param-file “Input image”: output of Filter 2-D image ( Galaxy version 1.12.0+galaxy1)
    • “Thresholding method”: Globally adaptive / Otsu
  3. Convert binary image to label map ( Galaxy version 0.5+galaxy0) with the following parameters to split touching objects:
    • param-file “Binary image”: output of Threshold image ( Galaxy version 0.18.1+galaxy3)
    • “Mode”: Watershed transform
    • “Minimum distance between two objects”: 20
  4. Extract image features ( Galaxy version 0.18.1+galaxy0) with the following parameters to extract features from the segmented objects:
    • param-file “Label map”: output of Convert binary image to label map ( Galaxy version 0.5+galaxy0)
    • “Use the intensity image to compute additional features”: No intensity image
    • “Select features to compute”: Select features
    • “Available features”:
      • param-check Label from the label map
      • param-check Area
      • param-check Eccentricity
      • param-check Major axis length
  5. Filter label map by rules ( Galaxy version 0.0.1-4) with the following parameters to filter the label map from 3. with the extracted features and a set of rules:
    • param-file “Label map”: output of Convert binary image to label map ( Galaxy version 0.5+galaxy0)
    • param-file “Features”: output of Extract image features ( Galaxy version 0.18.1+galaxy0)
    • param-file “Rules”: rules file
  6. Extract image features ( Galaxy version 0.18.1+galaxy0) with the following parameters to extract features the final readout from the segmented objects:
    • param-file “Label map”: output of Filter label map by rules ( Galaxy version 0.0.1-4)
    • “Use the intensity image to compute additional features”: Use intensity image
    • param-file “Intensity image”: testinput.tif file
    • “Select features to compute”: Select features
    • “Available features”:
      • param-check Mean Intensity (requires original image)
      • param-check Area
      • param-check Major axis length
  7. Now we can extract the workflow for batch processing:
    • Name it “feature_extraction”.
    • Don’t treat B2.zip and B3.zip as inputs (the workflow is supposed to be applied to the images directly).
    • Exclude Unzip ( Galaxy version 6.0+galaxy0) by unchecking the tool (3 times).
    1. Clean up your history: remove any failed (red) jobs from your history by clicking on the galaxy-delete button.

      This will make the creation of the workflow easier.

    2. Click on galaxy-gear (History options) at the top of your history panel and select Extract workflow.

      `Extract Workflow` entry in the history options menu

      The central panel will show the content of the history in reverse order (oldest on top), and you will be able to choose which steps to include in the workflow.

    3. Replace the Workflow name to something more descriptive.

    4. Rename each workflow input in the boxes at the top of the second column.

    5. If there are any steps that shouldn’t be included in the workflow, you can uncheck them in the first column of boxes.

    6. Click on the Create Workflow button near the top.

      You will get a message that the workflow was created.

  8. Edit the workflow you just created:
    • Select “Input dataset” from the list of tools. The step param-file 8: Input Dataset appears.
    • Change the “Label” of param-file 8: Input Dataset to input image.
    • Change the “Label” of param-file 1: rules to filter rules.
    • Connect the output of param-file 8: input image to the input of tool 2: Filter 2-D image.
    • Connect the output of param-file 8: input image to the “Intensity image” input of tool 7: Extract image features.
    • Mark the results of tool 6: Filter label map by rules and tool 7: Extract image features as the primary outputs of the workflow (by clicking on the checkboxes of the outputs).

The resulting workflow should look something like this:

feature extraction workflow. Open image in new tab

Figure 1: Feature extraction workflow.

Apply workflow to screen

Now we want to apply our extracted workflow to a series of images and merge the results. For this purpose, we create a workflow which uses the previously created workflow as a sub-workflow.

Hands-on: Create screen analysis workflow
  1. Create a new workflow in the workflow editor.

    1. Click Workflow on the top bar
    2. Click the new workflow galaxy-wf-new button
    3. Give it a clear and memorable name
    4. Clicking Save will take you directly into the workflow editor for that workflow
    5. Need more help? Please see the How to make a workflow subsection here

  2. Select “Input dataset collection” from the list of tools. The step param-collection 1: Input Dataset Collection appears in your workflow. Change the “Label” of this step to input images.
  3. Add the input dataset param-file 2: rules to your workflow (select “Input dataset” from the list of tools and set the “Label” of the newly created step to rules).
  4. Add the workflow feature_extraction workflow as a sub-workflow:
    • Expand the “Workflows” section in the list of tools and select “feature_extraction” to add it to the workflow.
    • Connect the output of param-file 1: input images to the “input image” input of workflow 3: feature_extraction.
    • Connect the output of param-file 2: rules to the “filter rules” input of workflow 3: feature_extraction.
  5. Create the step tool 4: Collapse Collection in the workflow (by choosing “Collapse Collection” from the list of tools).
    • Connect the output “output (tabular)” of workflow 3: feature_extraction to tool 4: Collapse Collection.
    • Set “Keep one header line” of tool 4: Collapse Collection: Yes
    • Set “Prepend File name” of tool 4: Collapse Collection: No
    • Mark the output of tool 4: Collapse Collection as the primary workflow output.
  6. Save your workflow and name it analyze_screen.

The resulting workflow should look something like this:

screen analysis workflow. Open image in new tab

Figure 2: Full screen analysis workflow.
Hands-on: Run screen analysis workflow
  1. Run the “analyze_screen” workflow on the control screen using the rules file.

    • Click on Workflow on the top menu bar of Galaxy. You will see a list of all your workflows.
    • Click on the workflow-run (Run workflow) button next to your workflow
    • Configure the workflow as needed
    • Click the Run Workflow button at the top-right of the screen
    • You may have to refresh your history to see the queued jobs

  2. Run the “analyze_screen” workflow on the PLK1 screen using the rules file.

Plot feature extraction results

Finally, we want to plot the results for better interpretation.

Hands-on: Plot feature extraction results
  1. Click on the Visualize galaxy-barchart icon of the tool 4: Collapse Collection results.
  2. Run Box plot (jqPlot) with the following parameters:
    • “Provide a title”: Screen features
    • “X-Axis label”:
    • “Y-Axis label”:
    • “1: Data series”:
      • “Provide a label”: Mean intensity
      • “Observations”: Column 1
    • “2: Data series”:
      • “Provide a label”: Area
      • “Observations”: Column 2
    • “3: Data series”:
      • “Provide a label”: Major axis length
      • “Observations”: Column 3
    Question

    Plot the feature distribution of PLK1 and control. What differences do you observe between the screens?

    The phenotype of PLK1 threated cells show a higher mean intensity and a shorter major axis in comparison to the control.

One of the resulting plots should look something like this:

feature extraction results box plot.

Conclusion

In this exercise you imported images into Galaxy, segmented cell nuclei, filtered segmentations by morphological features, extracted features from segmentations, scaled your workflow to a whole screen, and plotted the feature extraction results using Galaxy.