48
‘How to prepare, cluster and sequence an NGS library’ AN OVERVIEW OF NGS IN THE GENOMICS CORE Introduction Understanding library prep Understanding clustering and sequencing Understanding instruments NGS QC NGS applications

How to cluster and sequence an ngs library (james hadfield160416)

Embed Size (px)

Citation preview

Page 1: How to cluster and sequence an ngs library (james hadfield160416)

‘How to prepare, cluster and sequence an NGS library’

AN OVERVIEW OF NGS IN THE GENOMICS CORE– Introduction– Understanding library prep– Understanding clustering and sequencing– Understanding instruments– NGS QC– NGS applications

Page 2: How to cluster and sequence an ngs library (james hadfield160416)

A potted history of Illumina sequencing

Page 3: How to cluster and sequence an ngs library (james hadfield160416)

Understanding library prep

Adapter ligation

End-repairAdenylation

BioAnalyserqPCRPCR

“The next ten slides are the most important I’ll show today “

Page 4: How to cluster and sequence an ngs library (james hadfield160416)

Understanding library prep

Page 5: How to cluster and sequence an ngs library (james hadfield160416)

Understanding library prep

Page 6: How to cluster and sequence an ngs library (james hadfield160416)

Understanding library prep

Page 7: How to cluster and sequence an ngs library (james hadfield160416)

Understanding library prep

Page 8: How to cluster and sequence an ngs library (james hadfield160416)

Understanding library prep

Page 9: How to cluster and sequence an ngs library (james hadfield160416)

Understanding library prep

Page 10: How to cluster and sequence an ngs library (james hadfield160416)

Understanding library prep

Page 11: How to cluster and sequence an ngs library (james hadfield160416)

Understanding library prep

Page 12: How to cluster and sequence an ngs library (james hadfield160416)

Understanding library prep

Page 13: How to cluster and sequence an ngs library (james hadfield160416)

Understanding library prep

– Text

Page 14: How to cluster and sequence an ngs library (james hadfield160416)

Illumina adapters ask for Illumina letter!

ACACTCTTTCCCTACACGACGCTCTTCCGATCT

ADAPTER

PCR PRIMER

SEQ PRIMER

AATGATACGGCGACCACCGAGATCTACACTCTTTCCCTACACGACGCTCTTCCGATCT

CAAGCA

GAAGAC

GGCATA

CGAGCTCTTCCGATCT

Insert DNA

Index

PCR

Index SEQ

ACACTC

TTTCCC

TACACG

ACGCTCTTCCGATCT

InsertDNA A

||||||||||InsertDNAACTCGTATGCCGTCTTCTGCTTG P-GATCGGAAGAG

ACACTCTTTCCCTACACGACG CTCTTCCGATCT T

||||||||||

CTCGTATGCCGTCTTCTGCTTGP-GATCGGAAGAG

ACACTC

TTTCCC

TACACG

ACGCTCTTCCGATCT

T||||||||||

Oligonucleotide sequences © Illumina, Inc. All rights reserved.

Page 15: How to cluster and sequence an ngs library (james hadfield160416)

Understanding library prep

“The next ten slides are the most important I’ll show today “

Page 16: How to cluster and sequence an ngs library (james hadfield160416)

The library prep spike

[DN

A]

Illumina Processing

Page 17: How to cluster and sequence an ngs library (james hadfield160416)

Understanding library prep – Nextera!

– Text

Page 18: How to cluster and sequence an ngs library (james hadfield160416)

Understanding cluster generation (2500 etc)

Page 19: How to cluster and sequence an ngs library (james hadfield160416)

Understanding cluster generation (2500 etc)

A) Diluted & denatured libraries are annealed to lawn oligos at their 3’ end, and a polymerase creates a covalently attached copy of the library molecule.B) The original strand is removed by denaturation with NaOH.C) In non-denaturing conditions the library molecule bends and hybridises to a lawn oligo complementary to the 5’ end, and a polymerase creates a second covalently attached molecule. This amplification is repeated to create a cluster with around 1000 copies of the original library molecule.

A B C

Page 20: How to cluster and sequence an ngs library (james hadfield160416)

Understanding cluster generation (2500 etc)

D E C G H

D) Clusters are linearized by cleavage at the 3’ end of the original library molecule, and denaturation leaves the single stranded DNA which will be sequenced. A sequencing primer is hybridised* and sequencing-by-synthesis generates the first read in your fastq file.-) For single-end indexing the the SBS template is removed by denaturation, and the index 1 sequencing primer is hybridised ready to generate index1 (i7). Dual-indexing is complicated and differs on single- or paired-end flowcells but the process is essentially the same to generate index two (i5).E-G) For paired-end sequencing the SBS template is removed by denaturation, the cluster is re-amplified for several cycles, cleaved at the 5’ end the paired-end sequencing primer hybridised ready to generate read 2.

*Beware: if you create new adapters let us know if you need a custom sequencing primer

Page 21: How to cluster and sequence an ngs library (james hadfield160416)

Understanding cluster generation (X Ten & 4000)Exclusion Amplification

The same hybridisation and solid-surface amplification occurs but in an all-in-one phase called “exclusion amplification” (ExAmp). Once a library molecule “lands” in a well it should occupy it completely.

Page 22: How to cluster and sequence an ngs library (james hadfield160416)

Understanding cluster generation (X Ten & 4000)Exclusion Amplification

Page 23: How to cluster and sequence an ngs library (james hadfield160416)

Understanding sequencing

Page 24: How to cluster and sequence an ngs library (james hadfield160416)

Understanding sequencing: Sanger-seq

Page 25: How to cluster and sequence an ngs library (james hadfield160416)

Understanding sequencing: Pyro-seq

Page 26: How to cluster and sequence an ngs library (james hadfield160416)

Understanding sequencing: Sequencing-by-synthesis

Page 27: How to cluster and sequence an ngs library (james hadfield160416)

Understanding “sequencing by synthesis”

Page 28: How to cluster and sequence an ngs library (james hadfield160416)

Understanding “sequencing by synthesis”

Instrument “colours”

HiSeq, MiSeq 4-colour SBS

NextSeq 2-colour SBS

Firefly 1-colour SBS?

Page 29: How to cluster and sequence an ngs library (james hadfield160416)

Instruments explained – HiSeq 2500 & 4000

Page 30: How to cluster and sequence an ngs library (james hadfield160416)

Different sequencing configurations

2500 Rapid150M readsSE 50bp

85%Q30PE 250bp

75%Q30PE 150 2 days

2500 High output250M readsSE 50bp

85%Q30PE 125bp

80%Q30PE 125 6 days

4000 High output312M readsSE 50bp

85%Q30PE 150bp

75%Q30PE 150 3 days

Page 31: How to cluster and sequence an ngs library (james hadfield160416)

HiSeq 4000 considerations

CLUSTERING IS VERY DIFFERENT FROM 2500– PE150 - >125 is not great*– %Q30 “passes Illumina spec”*

– ExAmp duplicates*– Need to consider how you handle duplicates

– RNA-seq is fine– Exome-seq is fine– Genomes are fine

Page 32: How to cluster and sequence an ngs library (james hadfield160416)

Instruments explained - MiSeq

~600bp fragments

+/- 50bp overlap

300bp reads

Page 33: How to cluster and sequence an ngs library (james hadfield160416)

Instruments explained - NextSeq

Page 34: How to cluster and sequence an ngs library (james hadfield160416)

NGS QC – library prepQUALITY CONTROL OF LIBRARIES IS IMPORTANT.

TITRATION FLOWCELLS AND FAILED RUNS ARE EXPENSIVE.TRY TO IDENTIFY ISSUES BEFORE RUNNING ANY LANES.QC IS SPECIFIC TO YOUR SAMPLES.

QUANTITATION OF LIBRARIES IS IMPORTANT.SOME QC CAN ONLY BE DONE ONCE YOU HAVE GENERATED DATA

Good

Bad

Bioanalyser qPCR Analysis

Page 35: How to cluster and sequence an ngs library (james hadfield160416)

NGS QC – FastQC

Page 36: How to cluster and sequence an ngs library (james hadfield160416)

NGS QC – MGA

Page 37: How to cluster and sequence an ngs library (james hadfield160416)

NGS QC – MGALIBRARY QC – CONTAMINANT DETECTION

SAMPLE 100,000 READS FROM FASTQREADS TRIMMED TO 36BPALIGN TO MULTIPLE GENOMES USING BOWTIE

LIBRARY QC – ADAPTER DETECTIONSAMPLE 100,000 READS FROM FASTQREADS CONVERTED TO FASTAALIGN TO “ADAPT-OME” USING EXONERATE

LIBRARY QC- YIELDCOUNT NUMBER OF READS (SINGLE-END ONLY)DISPLAY NUMBER ON A PRE-DEFINED SCALEDISPLAY LANES IN FLOWCELL CONFIGURATION

Page 38: How to cluster and sequence an ngs library (james hadfield160416)

NGS QC – MGA

Page 39: How to cluster and sequence an ngs library (james hadfield160416)

NGS QC – MGA

Page 40: How to cluster and sequence an ngs library (james hadfield160416)

The Genomics Core sequencing services

James Hadfield NEB March 2016

Page 41: How to cluster and sequence an ngs library (james hadfield160416)

The Genomics Core sequencing services

Page 42: How to cluster and sequence an ngs library (james hadfield160416)

The Genomics Core sequencing services

Page 43: How to cluster and sequence an ngs library (james hadfield160416)

Service metrics Jan 2016

– TAT has been 2-3 weeks (often as little as 1 week)– Most sequencing works very well, but…

Page 44: How to cluster and sequence an ngs library (james hadfield160416)

The Genomics Core sequencing services

Page 45: How to cluster and sequence an ngs library (james hadfield160416)

The Genomics Core sequencing services

Page 46: How to cluster and sequence an ngs library (james hadfield160416)

NGS methods

Page 47: How to cluster and sequence an ngs library (james hadfield160416)

A genomic case report

Page 48: How to cluster and sequence an ngs library (james hadfield160416)

A genomic case report

NFKBIA S32G

SIFT: deleterious(0)PolyPhen: probably_damaging(0.979)