33
Blueprint for NACST/Sim Blueprint for NACST/Sim 2002.10.25 신신신

Blueprint for NACST/Sim 2002.10.25 신수용. © 2002, SNU BioIntelligence Lab, Main concept Implementing Hybridization first. Then, ligation,

Embed Size (px)

Citation preview

Page 1: Blueprint for NACST/Sim 2002.10.25 신수용. © 2002, SNU BioIntelligence Lab,  Main concept Implementing Hybridization first. Then, ligation,

Blueprint for NACST/Sim Blueprint for NACST/Sim

2002.10.25신수용

Page 2: Blueprint for NACST/Sim 2002.10.25 신수용. © 2002, SNU BioIntelligence Lab,  Main concept Implementing Hybridization first. Then, ligation,

© 2002, SNU BioIntelligence Lab, http://bi.snu.ac.kr/

Main conceptMain concept

Implementing Hybridization first. Then, ligation, PCR, and so on..

Can be implemented easily based on hybridization concept.

그외 .. Gel electrophoresis is easy. Bead seperation 은 아직 고려하지 않음

Page 3: Blueprint for NACST/Sim 2002.10.25 신수용. © 2002, SNU BioIntelligence Lab,  Main concept Implementing Hybridization first. Then, ligation,

© 2002, SNU BioIntelligence Lab, http://bi.snu.ac.kr/

Main conceptMain concept

Probabilistic model!

Sequence Collision Probability Tube 에서 sequence 들이 어떻게 분포하고 있으

며 , 어떤 sequence 들이 서로 만날 것인가에 대한 모델

Sequence Hybridization Probability 서로 만난 sequence 들의 결합가능성 판단

Page 4: Blueprint for NACST/Sim 2002.10.25 신수용. © 2002, SNU BioIntelligence Lab,  Main concept Implementing Hybridization first. Then, ligation,

© 2002, SNU BioIntelligence Lab, http://bi.snu.ac.kr/

Collision modelCollision model

Based on Artificial Chemistry Artificial Chemistries: A Review, P. Dittrich, J. Ziegl

er, and W. Banzhaf Multi-Agent Systems inspired by Artificial Chemistri

es: A Case Study in Automated Theorem Proving, J. Bush and W. Banzhaf

Page 5: Blueprint for NACST/Sim 2002.10.25 신수용. © 2002, SNU BioIntelligence Lab,  Main concept Implementing Hybridization first. Then, ligation,

© 2002, SNU BioIntelligence Lab, http://bi.snu.ac.kr/

Collision model (AC)Collision model (AC)

현재의 DNAC 를 위해서는 초기에 입력은 전부 주어지며 , 출력도 없는 단순한 모델로 충분 .

Sequence 들은 uniform distribution 에 근거하여 확률적으로 선택됨

Hybridization 이 되어 긴 sequence 가 새로 생기면 추가적인 input 으로 가정

Page 6: Blueprint for NACST/Sim 2002.10.25 신수용. © 2002, SNU BioIntelligence Lab,  Main concept Implementing Hybridization first. Then, ligation,

© 2002, SNU BioIntelligence Lab, http://bi.snu.ac.kr/

Hybridization modelHybridization model

Based on Sequence Alignment Algorithms Dynamic Programming!

Smith-Waterman Algorithm(?) 좀 더 빠른 알고리즘을 원함 .! 독일에서 만든 알고리즘은 시퀀스 2 개 주고

결과를 얻기 위해서는 대략 5-9 초 정도의 시간이 필요 .

DNASIM: 1998, Jens Niehaus

Blast 도 동일한 알고리즘을 사용

Page 7: Blueprint for NACST/Sim 2002.10.25 신수용. © 2002, SNU BioIntelligence Lab,  Main concept Implementing Hybridization first. Then, ligation,

© 2002, SNU BioIntelligence Lab, http://bi.snu.ac.kr/

Hybridization model (DP)Hybridization model (DP)

Biological sequence analysis: Probabilistic models of proteins and nucleic acids, R. Dublin, S. R. Eddy, A. Krogh, and G. Mitchison

Introduction to Algorithms, T. H. Cormen, C. E. Leiserson, and R. L. Rivest

Page 8: Blueprint for NACST/Sim 2002.10.25 신수용. © 2002, SNU BioIntelligence Lab,  Main concept Implementing Hybridization first. Then, ligation,

© 2002, SNU BioIntelligence Lab, http://bi.snu.ac.kr/

Hybridization model (DP)Hybridization model (DP)

Sequence matching table(?) 이 중요 .

Free enery A T G C -A X O X X XT O X X X XG X X X O XC X X O X X- X X X X X

Page 9: Blueprint for NACST/Sim 2002.10.25 신수용. © 2002, SNU BioIntelligence Lab,  Main concept Implementing Hybridization first. Then, ligation,

© 2002, SNU BioIntelligence Lab, http://bi.snu.ac.kr/

Nearest-neighbor model for Nearest-neighbor model for TmTm

)4/|ln(| Tm CRS

HT

R : Boltzmann’s constant (1.987 cal/(K mol))[C]] : total molar strand concentrationT : Kelvin

3.269][7.01

][log6.16

)4/|ln(| 10

Na

Na

CRS

HT

Tm

[Na+] concentrations different from 1M

Page 10: Blueprint for NACST/Sim 2002.10.25 신수용. © 2002, SNU BioIntelligence Lab,  Main concept Implementing Hybridization first. Then, ligation,

© 2002, SNU BioIntelligence Lab, http://bi.snu.ac.kr/

Nearest-neighbor model for Nearest-neighbor model for TmTm

}{

}{

stackskkinitends

stackskkinitends

SSSS

HHHH

• Self-complement : [CT]/4 [CT]/2• The strands are not in equimolar concentration, but one strand is present in gross excess over the other : [CT]/4 [CT]• mismatched base pairs : “virtual stacks” (see references)

Page 11: Blueprint for NACST/Sim 2002.10.25 신수용. © 2002, SNU BioIntelligence Lab,  Main concept Implementing Hybridization first. Then, ligation,

© 2002, SNU BioIntelligence Lab, http://bi.snu.ac.kr/

Nearest-neighbor dataNearest-neighbor data: Perfect WC basepairing: Perfect WC basepairing

Page 12: Blueprint for NACST/Sim 2002.10.25 신수용. © 2002, SNU BioIntelligence Lab,  Main concept Implementing Hybridization first. Then, ligation,

© 2002, SNU BioIntelligence Lab, http://bi.snu.ac.kr/

ExampleExample

GCTAGC at 0.1mM

H = 2(-11.1) + 2(-6.1) + (-6.3) = -40.7 kcal/molS = 2(-28.4) + 2(-16.1) + (-18.5) – 5.9 –1.4 = -114.8eu

CK

molKcalmolKcal

molcalTm

6.328.305

)10ln() /(998.1) /(8.114

/407004

Page 13: Blueprint for NACST/Sim 2002.10.25 신수용. © 2002, SNU BioIntelligence Lab,  Main concept Implementing Hybridization first. Then, ligation,

© 2002, SNU BioIntelligence Lab, http://bi.snu.ac.kr/

Nearest-neighbor dataNearest-neighbor data: GT mismatch: GT mismatch Thermodynamics and NMR of Internal G.T Mis

matches in DNA, H. T. Allawi and J. SantaLucia, Jr., Biochemistry 1997, 36, 10581-10594

Page 14: Blueprint for NACST/Sim 2002.10.25 신수용. © 2002, SNU BioIntelligence Lab,  Main concept Implementing Hybridization first. Then, ligation,

© 2002, SNU BioIntelligence Lab, http://bi.snu.ac.kr/

Nearest-neighbor dataNearest-neighbor data: GT mismatch: GT mismatch

Page 15: Blueprint for NACST/Sim 2002.10.25 신수용. © 2002, SNU BioIntelligence Lab,  Main concept Implementing Hybridization first. Then, ligation,

© 2002, SNU BioIntelligence Lab, http://bi.snu.ac.kr/

Nearest-neighbor dataNearest-neighbor data: GA mismatch: GA mismatch Nearest Neighbor Thermodynamic Parameters

for Internal GA Mismatches in DNA, H. T. Allawi, J. SantaLucia, Jr., Biochemistry 1998, 37, 2170-2179

Page 16: Blueprint for NACST/Sim 2002.10.25 신수용. © 2002, SNU BioIntelligence Lab,  Main concept Implementing Hybridization first. Then, ligation,

© 2002, SNU BioIntelligence Lab, http://bi.snu.ac.kr/

Nearest-neighbor dataNearest-neighbor data: GA mismatch: GA mismatch

Page 17: Blueprint for NACST/Sim 2002.10.25 신수용. © 2002, SNU BioIntelligence Lab,  Main concept Implementing Hybridization first. Then, ligation,

© 2002, SNU BioIntelligence Lab, http://bi.snu.ac.kr/

Nearest-neighbor dataNearest-neighbor data: CT mismatch: CT mismatch Thermodynamics of internal CT mismatches in

DNA, H. T. Allawi and J. SantaLucia, Jr., Nucleic Acids Research, 1998, 26(11): 2694-2701

Page 18: Blueprint for NACST/Sim 2002.10.25 신수용. © 2002, SNU BioIntelligence Lab,  Main concept Implementing Hybridization first. Then, ligation,

© 2002, SNU BioIntelligence Lab, http://bi.snu.ac.kr/

Nearest-neighbor dataNearest-neighbor data: CT mismatch: CT mismatch

Page 19: Blueprint for NACST/Sim 2002.10.25 신수용. © 2002, SNU BioIntelligence Lab,  Main concept Implementing Hybridization first. Then, ligation,

© 2002, SNU BioIntelligence Lab, http://bi.snu.ac.kr/

Nearest-neighbor dataNearest-neighbor data: AC mismatch: AC mismatch Nearest-Neighbor Thermodynamics of Internal

AC Mismatches in DNA: Sequence Dependence and pH Effects, H. T. Allawi and J. SantaLucia, Jr., Biochemistry, 1998, 37, 9435-9444

Page 20: Blueprint for NACST/Sim 2002.10.25 신수용. © 2002, SNU BioIntelligence Lab,  Main concept Implementing Hybridization first. Then, ligation,

© 2002, SNU BioIntelligence Lab, http://bi.snu.ac.kr/

Nearest-neighbor dataNearest-neighbor data: AC mismatch: AC mismatch

Page 21: Blueprint for NACST/Sim 2002.10.25 신수용. © 2002, SNU BioIntelligence Lab,  Main concept Implementing Hybridization first. Then, ligation,

© 2002, SNU BioIntelligence Lab, http://bi.snu.ac.kr/

Nearest-neighbor dataNearest-neighbor data: AA, CC, GG, and TT : AA, CC, GG, and TT mismatchmismatch Nearest-Neighbor Thermodynamics and NMR

of DNA sequences with Internal AA, CC, GG, and TT Mismatches, N. Peyret, P. A. Senevirante, H. T. Allawi, and J. SantaLucia, Jr., Biochemistry 1999, 38, 3468-3477

Page 22: Blueprint for NACST/Sim 2002.10.25 신수용. © 2002, SNU BioIntelligence Lab,  Main concept Implementing Hybridization first. Then, ligation,

© 2002, SNU BioIntelligence Lab, http://bi.snu.ac.kr/

Nearest-neighbor dataNearest-neighbor data: AA, CC, GG, and TT : AA, CC, GG, and TT mismatchmismatch

Page 23: Blueprint for NACST/Sim 2002.10.25 신수용. © 2002, SNU BioIntelligence Lab,  Main concept Implementing Hybridization first. Then, ligation,

© 2002, SNU BioIntelligence Lab, http://bi.snu.ac.kr/

Nearest-neighbor dataNearest-neighbor data: Single base bulge: Single base bulge The Effect of Base Sequence on the Stability o

f RNA and DNA Single Base Bulges, J. Zhu and R. M. Wartell, Biochemistry 1999, 38, 15986-15993

Page 24: Blueprint for NACST/Sim 2002.10.25 신수용. © 2002, SNU BioIntelligence Lab,  Main concept Implementing Hybridization first. Then, ligation,

© 2002, SNU BioIntelligence Lab, http://bi.snu.ac.kr/

Nearest-neighbor dataNearest-neighbor data: Single base bulge: Single base bulge

Page 25: Blueprint for NACST/Sim 2002.10.25 신수용. © 2002, SNU BioIntelligence Lab,  Main concept Implementing Hybridization first. Then, ligation,

© 2002, SNU BioIntelligence Lab, http://bi.snu.ac.kr/

Nearest-neighbor dataNearest-neighbor data: Hairpin: Hairpin Melting Studies of Short DNA Hairpins: Influen

ce of Loop Sequence and Adjoining Base Pair Identity on Hairpin Thermodynamic Stability, P. M. Vallone, T. M. Paner, and J. Hilario, M. J. Lane, B. D. Faldasz, and A. S. Benight, Biopolymers, 50, 425-442 (1999)

Page 26: Blueprint for NACST/Sim 2002.10.25 신수용. © 2002, SNU BioIntelligence Lab,  Main concept Implementing Hybridization first. Then, ligation,

© 2002, SNU BioIntelligence Lab, http://bi.snu.ac.kr/

Page 27: Blueprint for NACST/Sim 2002.10.25 신수용. © 2002, SNU BioIntelligence Lab,  Main concept Implementing Hybridization first. Then, ligation,

© 2002, SNU BioIntelligence Lab, http://bi.snu.ac.kr/

Nearest-neighbor dataNearest-neighbor data: Dangling ends: Dangling ends Thermodynamic parameters for DNA sequence

s with dangling ends, S. Bommarito, N. Peyret, and J. SantaLucia, Jr., Nucleic Acids Research, 2000, 28(9): 1929-1934

Page 28: Blueprint for NACST/Sim 2002.10.25 신수용. © 2002, SNU BioIntelligence Lab,  Main concept Implementing Hybridization first. Then, ligation,

© 2002, SNU BioIntelligence Lab, http://bi.snu.ac.kr/

Nearest-neighbor dataNearest-neighbor data: Dangling ends: Dangling ends

Page 29: Blueprint for NACST/Sim 2002.10.25 신수용. © 2002, SNU BioIntelligence Lab,  Main concept Implementing Hybridization first. Then, ligation,

© 2002, SNU BioIntelligence Lab, http://bi.snu.ac.kr/

EtcEtc

Ionic condition… Differences between DNA Base Pair Stacking Energ

ies Are Conserved over a Wide Range of Ionic Conditions, T. Johnson, J, Zhu, and R. M. Wartell, Biochemistry 1998, 37, 12343-12350

Page 30: Blueprint for NACST/Sim 2002.10.25 신수용. © 2002, SNU BioIntelligence Lab,  Main concept Implementing Hybridization first. Then, ligation,

© 2002, SNU BioIntelligence Lab, http://bi.snu.ac.kr/

For implementationFor implementation

We need to know HOW MANY MOLECULES ARE NECESSARY? HOW MUCH TIME IS NECESSARY?

Page 31: Blueprint for NACST/Sim 2002.10.25 신수용. © 2002, SNU BioIntelligence Lab,  Main concept Implementing Hybridization first. Then, ligation,

© 2002, SNU BioIntelligence Lab, http://bi.snu.ac.kr/

HOW MANY MOLECULES ARE HOW MANY MOLECULES ARE NECESSARY? (V) NECESSARY? (V) i.e. v1, v2, v3, v4, e1, e2, e3

e1 : v1->v2, e2 : v2->v3, e3 : v3->v4

1. v1, v2, e1 -> p1

2. p1, v3, e2 -> p2

3)(

1

)()()( EVEVEVEV NNNNV

V

NNV

V

NNV

V

3

2

)2)((2)(2)(2)(

1

EVEVEVEV NNV

V

NNV

V

NNV

V

NNV

Page 32: Blueprint for NACST/Sim 2002.10.25 신수용. © 2002, SNU BioIntelligence Lab,  Main concept Implementing Hybridization first. Then, ligation,

© 2002, SNU BioIntelligence Lab, http://bi.snu.ac.kr/

HOW MANY MOLECULES ARE HOW MANY MOLECULES ARE NECESSARY? (V)NECESSARY? (V)3. p2, v4, e3 -> p3

3

2

)4)((4)(4)(4)(

1

EVEVEVEV NNV

V

NNV

V

NNV

V

NNV

0)2)(()(

1 1

1

2

3

EN

i EVEV iNNV

V

NNP

적정한 양 ?

Page 33: Blueprint for NACST/Sim 2002.10.25 신수용. © 2002, SNU BioIntelligence Lab,  Main concept Implementing Hybridization first. Then, ligation,

© 2002, SNU BioIntelligence Lab, http://bi.snu.ac.kr/

HOW MUCH TIME IS HOW MUCH TIME IS NECESSARY? (x)NECESSARY? (x)

0)2)(()(

1 1

1

2

3

EN

i EVEV ixNNV

V

NNP