24
1 Vall d’Hebron Institut de Recerca (VHIR) Alex Sánchez 15/05/2014 Institut d’Investigació Sanitària acreditat per l’Instituto de Salud Carlos III (ISCIII) Introduction to Galaxy A web-based genome analysis platform BIOINFORMATICS FOR BIOMEDICAL RESEARCH

Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, Barcelona)

  • Upload
    ueb

  • View
    373

  • Download
    7

Embed Size (px)

DESCRIPTION

Course: Bioinformatics for Biomedical Research (2014). Session: 2.2- Introduction to Galaxy. A web-based genome analysis platform. Statistics and Bioinformatisc Unit (UEB) & High Technology Unit (UAT) from Vall d'Hebron Research Institute (www.vhir.org), Barcelona.

Citation preview

Page 1: Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, Barcelona)

1

Vall d’Hebron Institut de Recerca (VHIR)

Alex Sánchez

15/05/2014

Institut d’Investigació Sanitària acreditat per l’Instituto de Salud Carlos III (ISCIII)

Introduction to Galaxy

A web-based genome analysis platform

BIOINFORMATICS FOR

BIOMEDICAL RESEARCH

Page 2: Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, Barcelona)

2

• Galaxy overview and Interface

• Getting Data in Galaxy

• Analyzing Data in Galaxy

– Quality Control

– Mapping Data

• History and workflow

• Galaxy Exercises

NGS Analysis Using Galaxy

Page 3: Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, Barcelona)

3

What is Galaxy

• Galaxy is an open-source framework for integrating various computational tools and databases into a cohesive workspace.

But it can also be seen as

• A web-based service, integrating many popular tools and resources for comparative genomics.

And also

• A completely self-contained application for building your own Galaxy style sites.

Page 4: Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, Barcelona)

4

http://galaxyproject.org

Page 5: Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, Barcelona)

5

Galaxy Conceptual Framework

Page 6: Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, Barcelona)

6 6

Galaxy Interface Sections

contains links to

the downloading,

preparation and

analysis tools.

The center column

is where the

menus and data

will appear

show you the history of your analysis steps,

allow you view data and results, and more.

Register User

Page 7: Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, Barcelona)

7 7

Getting Data

Click Get Data

Page 8: Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, Barcelona)

8 8

Getting Data: Table Browser

Get Table Main

Page 9: Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, Barcelona)

9 9

Getting Data: UCSC Table Browser

Get Output

clade: Mammal genome: Human

assmbly: [current]

group: Genes and… track: UCSC Genes

table: knownGene

region: position, chrX

Output format: BED, and check Send output to

Galaxy

Page 10: Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, Barcelona)

10 10

Getting Data: Upload File

Upload File

Execute

File Format

Species

Upload or paste file

Page 11: Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, Barcelona)

11

Getting Data: Upload File

Specify multiple URLs

into the "URL / Text" box

Page 12: Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, Barcelona)

12

• Sequences and Alignment Format • Galaxy overview and Interface • Getting Data in Galaxy • Analyzing Data in Galaxy

– Text Manipulation tools – Filter and Sort – Operate on Genomic Intervals – Quality Control – Mapping Data

• History and workflow • Galaxy Exercises

NGS Analysis Using Galaxy

Page 13: Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, Barcelona)

13

Text Manipulation Tools

Page 14: Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, Barcelona)

14

Filter and Sort

Page 15: Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, Barcelona)

15

Operate on Genomic Intervals

Page 16: Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, Barcelona)

16

Fasta Manipulation

Page 17: Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, Barcelona)

17 17

Analyzing Data: Next Generation Sequencing

Page 18: Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, Barcelona)

18

Analyzing Data: Next Generation Sequencing

FASTQ file manipulation,

like format conversation,

summary statistics,

trimming reads,

filtering reads

by quality score…

Page 19: Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, Barcelona)

19

Analyzing Data: Next Generation Sequencing

Input: sanger FASTQ

Output: SAM format

Page 20: Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, Barcelona)

20

Analyzing Data: Next Generation Sequencing

Page 21: Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, Barcelona)

21

• Sequences and Alignment Format

• Galaxy overview and Interface

• Getting Data in Galaxy

• Analyzing Data in Galaxy – Quality Control

– Mapping Data

• History and workflow

• Galaxy Exercises

NGS Analysis Using Galaxy

Page 22: Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, Barcelona)

22 Copyright OpenHelix. No use or

reproduction without express written

consent

22

History: History Options

List saved histories and shared histories.

Work on Current History, create new, clone, share,

create workflow, set permissions, show deleted datasets or delete history.

List saved histories

Page 23: Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, Barcelona)

23

Workflow

Creates a workflow, allows

user to repeat analysis using different datasets.

Page 24: Introduction to Galaxy (UEB-UAT Bioinformatics Course - Session 2.2 - VHIR, Barcelona)

24

• Sequences and Alignment Format

• Galaxy overview and Interface

• Getting Data in Galaxy

• Analyzing Data in Galaxy – Quality Control

– Mapping Data

• History and workflow

• Galaxy Exercises

NGS Analysis Using Galaxy