2  Register An Account with Galaxy

2.1 Introduction

In this workshop you will be working on Galaxy, an open source scientific workflow and data integration platform (Galaxy Community (2024)). Galaxy is a framework for performing reproducible computational biology and bioinformatics, democratising the field so that scientists with no programming or other computing experience can still carry out bioinformatic analyses on their own data, in their web browser.

The Galaxy software is open source and can be downloaded and installed freely on any computer that it can run on. Each such installation is called a Galaxy instance. The Galaxy instance that you will be working on is the EU Galaxy instance at https://usegalaxy.eu/. This is the largest Galaxy instance in Europe and gives access to over 2500 scientific tools. It is free to use, and is maintained by the Galaxy team at Freiburg.

There are many other Galaxy instances, some private, and some public and free for anyone to use, such as:

Galaxy can be thought of as a library of bioinformatics tools and, just as different libraries may contain differing collections of books, each Galaxy instance may make available different sets of tools, or different versions of similar tools. As a result some analyses may be possible on one Galaxy instance, but not another.

When you use Galaxy it is important to be clear when writing up your analysis methods to be clear about which Galaxy instance is used, and which version of each tool was used in the analysis.

2.1.1 Workflows

The key concept in Galaxy is the workflow. A workflow is a series, or chain, of analysis steps that takes some input data (one or more files or datasets) through a series of actions, to generate one or more result files. Typically, the output of one step will serve as the input to the next step, but more complicated branching workflows can also be created (Figure 2.1).

Galaxy workflow editor view of a Galaxy workflow that takes input data and passes it through a series of actions. The links between datasets and actions are shown as noodles on the diagram.
Figure 2.1: Galaxy workflow editor view of a Galaxy workflow that runs the same tool (RXLR Motifs) with three different settings, and compares the results using a Venn diagram (reproduced from Cock et al. (2013)). The workflow runs from left to right. A single Input dataset is fed into the RXLR Motifs tool three times, applying a different method each time. Each run of the RXLR Motifs tool generates a different set of output sequences, which are passed as input to the Filter tool. The output from all three runs of the Filter tool is passed to the Venn Diagram tool, along with the original input dataset, to generate a Venn Diagram

Using Galaxy, you will build up a powerful bioinformatic analysis workflow to analyse your data, step-by-step, using a web browser. Galaxy allows you to save and share your workflows, to encourage reuse and reproducibility.

2.2 Registering with Galaxy

Register with usegalaxy.eu

The first action you should take in this workshop, if you do not already have an account at https://usegalaxy.eu/, is to create an account at https://usegalaxy.eu/.

  1. Go to the Galaxy login/registration page
  2. If you have a usegalaxy.eu account already, then log in using your details and move on to the workshop proper.
  3. If you do not already have a usegalaxy.eu account, then click on the Register here link.
  4. Enter your university email address in the Email address field.
  5. Choose a secure password and enter it into the Password and Confirm password fields.
  6. Choose a public name to show (other people will be able to see this) and enter it in the Public name field.
  7. Click on the Create button.
Important

You will receive a validation email at the account you register with. You must confirm your email to proceed.