Foundations of Data Science

These lessons will help you get your feet in data science and give you tools to help you slice and dice your data into results.

Material

You can view the tutorial materials in different languages by clicking the dropdown icon next to the slides (slides) and tutorial (tutorial) buttons below.

Data Manipulation Olympics

These tutorials take the same dataset, and show you how to slice and dice it in numerous languages

Lesson Slides Hands-on Recordings Input dataset Workflows
Data Manipulation Olympics
Data visualisation Olympics - Visualization in R
Data Manipulation Olympics - SQL
Data Manipulation Olympics - JQ

Bash

These lessons focus on using the Command Line Interface (CLI) and Bash for processing data.

Lesson Slides Hands-on Recordings Input dataset Workflows
Advanced CLI in Galaxy
CLI Educational Game - Bashcrawl
CLI basics

Python

These lessons focus on coding in Python

Lesson Slides Hands-on Recordings Input dataset Workflows
Introduction to Python
Advanced Python
Plotting in Python

Python (Modular)

These lessons cover coding in Python except they attempt to be very modular, having few interdependencies, allowing you to re-arrange or insert them in your existing course curriculum as needed.

Lesson Slides Hands-on Recordings Input dataset Workflows
Python - Math
Python - Functions
Python - Basic Types & Type Conversion
Python - Lists & Strings & Dictionaries
Python - Flow Control
Python - Loops
Python - Files & CSV
Python - Try & Except
Python - Introductory Graduation
Conda Environments For Software Development
Python - Argparse
Python - Testing
Python - Type annotations
Virtual Environments For Software Development
Python - Globbing
Python - Subprocess
Python - Coding Style
Python - Multiprocessing

R

These lessons focus on coding in R

Lesson Slides Hands-on Recordings Input dataset Workflows
R basics in Galaxy
Advanced R in Galaxy
R
dplyr & tidyverse for data processing

SQL

Here you'll learn to use SQL to query databases and extract meaningful results from them.

Lesson Slides Hands-on Recordings Input dataset Workflows
Advanced SQL
Introduction to SQL
SQL Educational Game - Murder Mystery
SQL with Python
SQL with R

Scientific Workflow Management Systems

Here you can find brief tutorials on a couple different workflow engines that you might encounter in the wild.

Lesson Slides Hands-on Recordings Input dataset Workflows
Make & Snakemake

Introduction to Genomics

A data- and galaxy-driven introduction to genomics

Lesson Slides Hands-on Recordings Input dataset Workflows

Other

Assorted Tutorials

Lesson Slides Hands-on Recordings Input dataset Workflows
Basics of using Git from the Command Line
One protein along the UniProt page
Variant Calling Workflow
Version Control with Git

Frequently Asked Questions

Common questions regarding this topic have been collected on a dedicated FAQ page . Common questions related to specific tutorials can be accessed from the tutorials themselves.

Follow topic updates rss-feed with our RSS Feed

Community Resources

Community Home Maintainer Home

Editorial Board

This material is reviewed by our Editorial Board:

orcid logoHelena Rasche avatar Helena Rascheorcid logoSaskia Hiltemann avatar Saskia Hiltemannorcid logoFotis E. Psomopoulos avatar Fotis E. Psomopoulosorcid logoBérénice Batut avatar Bérénice Batutorcid logoYvan Le Bras avatar Yvan Le Bras

Contributors

This material was contributed to by:

Yongbin Li avatar Yongbin Liorcid logoFotis E. Psomopoulos avatar Fotis E. Psomopoulosorcid logoToby Hodges avatar Toby HodgesBazante Sanders avatar Bazante Sandersorcid logoNate Coraor avatar Nate Coraororcid logoHelena Rasche avatar Helena Rascheorcid logoBjörn Grüning avatar Björn Grüningorcid logoSimon Gladman avatar Simon Gladmanorcid logoSaskia Hiltemann avatar Saskia Hiltemannorcid logoPeter van Heusden avatar Peter van HeusdenDonny Vrins avatar Donny VrinsAnton Nekrutenko avatar Anton Nekrutenkoorcid logoSofoklis Keisaris avatar Sofoklis KeisarisMaria Christina Maniou avatar Maria Christina Maniouorcid logoBeatriz Serrano-Solano avatar Beatriz Serrano-Solanoorcid logoCristóbal Gallardo avatar Cristóbal GallardoNU Knight Lab avatar NU Knight LabMarius van den Beek avatar Marius van den Beekorcid logoLisanna Paladin avatar Lisanna PaladinMateusz Kuzak avatar Mateusz Kuzakorcid logoAnthony Bretaudeau avatar Anthony Bretaudeauorcid logoBérénice Batut avatar Bérénice Batutorcid logoWendi Bacon avatar Wendi Baconorcid logoMartin Čech avatar Martin ČechTeresa Müller avatar Teresa Müller

Funding

These individuals or organisations provided funding support for the development of this resource