Markov Chain Monte Carlo (MCMC) for model and …Markov Chain Monte Carlo (MCMC) for model and...

Markov Chain Monte Carlo (MCMC) for model and parameter identification N. Pedroni, nicolapedroni@gmail.com

Multidisciplinary Course: Monte Carlo Simulation Methods for the Quantitative Analysis of Stochastic and Uncertain Systems

Nicola Pedroni

2Problem statement: statistical inference

Real systemSystem model

(parameters)

{ }nθθθ ...,,, 21=θExperimental data

{ }Nxxx ...,,, 21=x

(e.g., a mechanical

component)

(e.g., failure times)

(e.g., failure rate)

On the basis of the experimental data x, estimate the (unknown) parameters θof the (known) model

(e.g., Exponential)

Nicola Pedroni

3Statistical inference – Example: component with constant failure rate

Exponential distribution

f(t) ( ) tt eetf θλ θλ −− ==

x = {x1, x2, …, xN}= t = { t1, t2, …, tN} = {1.71h, 48.8h, …, 0.39h}

(N = 100 failure times from N = 100 ‘identical’ components)

MODEL PARAMETER θ? (= FAILURE RATE λ?)

SYSTEM MODEL (hypothesis)

EXPERIMENTAL DATA (from real components)

Component failure time

Nicola Pedroni

4Statistical inference: classical (frequentist) approach

•Only the (random) data are used to estimate the parameter

(e.g.) MAXIMUM LIKELIHOOD ESTIMATION

h1028.3h1005.3

timeobservedTotal

failurescollectedofNumber

failuresˆˆ −

⋅=⋅

==∑N

x = {x1, x2, …, xN}= t = { t1, t2, …, tN} (N = 100 failure times)

(NB: also confidence intervals can be computed, reflecting the variability in the random data)

Nicola Pedroni

5Statistical inference: the Bayesian approach

• In addition to the (random) data, it uses the analyst’s degree of belief (state of knowledge ) about the true value of the parameter

• The analyst’s state of knowledge about the parameter beforeobtaining the data is represented by a (prior) probability distribution

(Bayesian interpretation of probability)

x = t = { t1, t2, …, tN}

(N = 100 failure times)+

0 0.01 0.02 0.03 0.04 0.05 0.06 0.070

Expert “prior”knowledge

( ) ( )λpp =θ

COMBINATION TO OBTAIN THE “POSTERIOR” DISTRIBUTION ( ) ( )txθ || λpp =

Nicola Pedroni

6Bayesian statistical inference: definitions

Bayes theorem( ) ( ) ( )( )

( ) ( )( ) ( )∫

==θθθx

θθxxθθx

xθdpp

( ) ( )xθθx || Lp = Likelihood of the parameters given dataset(dependent on the functional form of the chosenmodel)

( )θp Prior distribution of the parameters(dependent on the a priori knowledgeon the values of the parameters)

( )xp Probability of the experimental dataset(“normalization” factor: requires the evaluation of a complex integral)

( ) ( )xθxθ || π=p Posterior distribution of the parameters given the data(objective of the analysis)

Definition of ‘and’ ( ) ( ) ( ) ( ) ( )θθxxxθxθ ppppp ||and ==

Nicola Pedroni

7Bayesian statistical inference – Example: component with constant failure rate

Exponential model for the failure times � “Exponential likelihood”

x = t = { t1, t2, …, tN}

(N = 100 failure times)+

0 0.01 0.02 0.03 0.04 0.05 0.06 0.070

0.01 [h-1]

Expert “prior”knowledge

( ) ( )λpp =θ

− ∑=

exp λλ( ) ( ) [ ]( ) [ ]( ) [ ]( ) =−⋅⋅−⋅−== NtttLL λλλλλλλθ exp...expexp|| 21tx

0 0.01 0.02 0.03 0.04 0.05 0.060

120 Prior

PosteriorLikelihood

0.01 0.03 [h-1]λ

( )t|λπ

( ) tetf λλ −=

Nicola Pedroni

8Bayesian statistical inference: Markov Chain Monte Carlo (MCMC) simulation approach

It generates samples according to any desired probability distribution

0θRandomly chosen value

MCMC rules Data, x+

{ }sNjθθθθ ...,,,...,, 21 ~ ( )xθ |πIf Ns is large enough, then

{ }sNjθθθθ ...,,,...,, 21

independently on the initial point θ0

Nicola Pedroni

9MCMC simulation: the Metropolis-Hastings algorithm (1)

- Randomly choose an initial value θ0 = {θ10, θ2

0, …, θi0, …, θn

0} for the Markov chain

- For j = 0, 1, …, Ns (chain length)

1. Sample a proposal value θ’ = { θ1’, θ2’, …, θi’, …, θn'} from a proposal distribution K(θ’|θ j)

K(θi’ | θi j)

K(θij | θi’)

SYMMETRIC PROPOSAL FOR CONVENIENCE …

Example for variable θi, i = 1, 2, …, n

θij θi'

θijθi'

K(θi’ | θi j)

K(θij | θi’)

K(θi’ | θi j)

K(θij | θi’)

Nicola Pedroni

10MCMC simulation: the Metropolis-Hastings algorithm (2)

3. Accept/reject proposal value θ’

- Draw a uniform random number r ~ U[0, 1)

- If r ≤ α, then accept the proposal value θ ’, i.e. set θ j+1 = θ ’

else reject the proposal value θ ’, i.e. set θ j+1 = θ j

- End For (j = 0, 1, …, Ns)

2. Calculate the acceptance probability α for proposal value θ’

( ) ( )( ) ( )

θθxθ

'||',1minππα

Since ( ) ( ) ( )( )x

θxθxθ

pL || =π

( ) ( ) ( )( ) ( ) ( )

θθθxθ

'|'|',1minα NO NEED FOR NORMALIZATION

FACTOR CALCULATION

Nicola Pedroni

Application:component with constant failure rate

Nicola Pedroni

12Application: component with constant failure rate

Exponential distribution

f(t) ( ) tetf λλ −=Exponential distribution

Experimental dataset:

x = {x1, x2, …, xN}=

= t = { t1, t2, …, tN} (N = 100 failure times)

Failure rate λ = constant

Likelihood function (given by the chosen model)

− ∑=

exp λλ( ) ( ) [ ]( ) [ ]( ) [ ]( ) =−⋅⋅−⋅−== NtttLL λλλλλλλθ exp...expexp|| 21tx

Prior distribution

( ) ( ) [ ] maxmax 1,0 λλλθ === Upp

Proposal distribution

( ) ( ) ( ) ( )σλλλσλλλ ,''|;,|' NKNK ==

( ) ( ) ( )( ) ( ) ( )

( )( )

,1min|

|',1min

'|'|',1min

λλλλλλλλα

(“uninformative”)

(“symmetric”)

Nicola Pedroni

13Application: results

Posterior distribution Convergence of the chain

Burn-in period

GOOD AGREEMENT WITH “TRUE” VALUE, λ = 0.01

Nicola Pedroni

The reversible-jump MCMC algorithm

Nicola Pedroni

15Problem statement

Presence of changepoints in model parameter θ

Assumption: stepchanges in model parameter θ

ts0 s1 s2 sk sk+1…

si …smin= =smax

Nicola Pedroni

16Objectives of the Bayesian inference

� Location(s) si, i = 1, 2, …, k, of the changepoints

� Values θi, i = 0, 1, …, k, of the parameter(s) before and after the transitions

� Number k of changepoints

( ) kisi ...,,2,1,| =xπ

( ) kii ...,,1,0,| =xθπ

( )x|kπTHE NUMBER OF CHANGEPOINTS IS NOT

KNOWN � IT IS AN UNCERTAIN VARIABLE!

Nicola Pedroni

17Reversible-jump MCMC: prior distributions

Prior distribution p(k) for the number of changepoints k:

kλλ−

= (Truncated) Poisson distribution

Given the step positions s1, s2, …, sk, prior distributions for parameters θ0, θ1, …, θk:

( ) ( )βθα

βθ −−

Γ= ep i

1 Gamma distribution( )ip θ

Given k, prior distribution for step positions s1, s2, …, sk:

Even numbered order statistics from

(2k + 1) points uniform in [s0, sk+1]t

smin=s0smax=sk+1

Nicola Pedroni

18Reversible-jump MCMC: possible random moves

1. The parameter value θ is varied at a random location si

2. A randomly chosen step location si is moved

3. A new step location is created at random in the interval [s0, sk+1] (birth move)

4. A randomly chosen location si is eliminated (death move)

Four possible random moves to explore the uncertain parameter space:

Nicola Pedroni

19Reversible-jump MCMC - Random move 1: variation of the parameter value

sisi-1 si+1

selected at random

θi = “old” value

θi’ = proposed value

[ ] Uii

i eU ⋅=→+−≈

θθ '

5.0,5.0logProposed value:

Acceptance probability: ( )

⋅= −− iiel

i θθβα

η θθα

where( )( )x

θθ= = likelihood ratio (dependent on the problem)

Nicola Pedroni

20Reversible-jump MCMC - Random move 2: variation of a step location

sisi-1 si+1

tselected at random

[ ]11' , +−≈ iii ssUsProposed value:

Acceptance probability:( )( )( )( )

−−−−⋅=

1,1miniiii

sssslχα

where( )( )x

sLl = = likelihood ratio (dependent on the problem)

Nicola Pedroni

21Reversible-jump MCMC - Random move 3: “birth” of a new location

sisi+1

tselected at random

s* sisi+1

θi+ 1

θi’

θi+ 1’

Proposed values: [ ]10* , +≈ kssUs (and identify the interval where it lies)

[ ]1,0,1

i ≈−=+

θθ (imposed)

( ) ( ) ( ) iiiiiii ssssss θθθ −=−+− +++ 1'

1'* (perturbation of θi)

Acceptance probability, αb: very complex structure! � see (Green, 1995)

Nicola Pedroni

22Reversible-jump MCMC - Random move 4: “death” of a new location

sisi-1 si+1

tselected at random

sisi-1 si+1

tθi-1

θi-1’

Proposed values: ( ) ( ) ( ) '111111 −−++−− −=−+− iiiiiiiii ssssss θθθ

Acceptance probability, αd: very complex structure! � see (Green, 1995)

“Reverse” the “birth” move:

Nicola Pedroni

23Reversible-jump MCMC: sequential steps of the algorithm (1)

1. Select prior values of the Markov chain

kλλ−

= � Prior value for k (number of changepoints)

� Prior positions of the changepoints: s1, s2, …, sk

( ) ( )βθα

βθ −−

Γ= ep i

1� Prior values of the parameter:θ0, θ1, …, θk

2. Calculate the probabilities for each possible move

1=+++ kkkk dbχη Normalization

000 max=== dbkχ “Physical” considerations

kk χη = Suggested by (Green, 1995) to

obtain “good” chains (i.e., fast convergence

and exploration of the parameter space)

( )( )

1,1min,

1,1min

if k ≠ 0

Nicola Pedroni

3. Sample a uniform random number r in [0, 1) and select oneof the possible moves according to the probabilities computed in step 2.

4. Generate proposal values according to the selected moveIf move 1: variation of parameter value (θi’)

If move 2: variation of step location (si’)

If move 3: “birth” of a new location (s*, θi’ and θi+1’)

If move 4: “death” of a location (si removed and θi’ proposed)

5. Calculate the acceptance probability for the move(s) proposed in step 4. (i.e., αη, αχ, αb or αd)

kηkb kd

Nicola Pedroni

5. Sample a uniform random number u in [0, 1) and accept/reject the proposed move(s) according to the acceptance probabilities computed in step 4.

6. Update the values of the uncertain parameters in the chain: New number k of changepoints, New locations si, i = 1, 2, …, k, of the changepoints, New values θi, i = 0, 1, …, k, of the parameter in each interval

7. Return to step 2. above (i.e., update the values of the probabilities for each possible move and so on …)kkkk db,χ,η ,

8. Stop the algorithm when the number of iterations (i.e., the number of samples in the chain) reaches a predefined (large) value

bα bα−1

Nicola Pedroni

26Reversible-jump MCMC: example (1)

- Iteration 0 (i.e., sample the prior values)

)( max ===−

λλλ� k0 = 2 (prior number of changepoints)

{ }[ ][ ][ ]

∈∈∈

,if,75.5

,if,51.1

,if,04.4

θθθ

Sample 0

k0 s10 s2

0 θ10 θ2

0 θ30

2 35 61 4.04 1.51 5.75

( ) ( ) 2,3,1 ==Γ

= −− βαθα

βθ βθαα

(Sort)

[ ] [ ] { } { }87,61,43,35,1087,43,10,35,61100,0, maxmin →→=ss(uniform random sampling

of 2k0+1 values)

Nicola Pedroni

- Calculate the probabilities for each possible move given k = 21=+++ kkkk dbχη

kk χη =

( )( )

1,1min,

1,1min

6750.0

2250.0

0500.0

- Sample one of the possible moves

2η 2χ2b 2d

r = 0.085

VARIATION OF A STEP LOCATION

- Iteration 1:

Nicola Pedroni

s10smin s2

sselected at random

- Select randomly one of the changepoints 3501 =→ s

- Sample a new proposal value s1’ from a uniform distribution:

[ ) [ ) 5161,0, '1

02min =→= sUssU

- Accept/reject the candidate according to αχ� e.g., accept: s11 = s1’

Sample 1

k1 s11 s2

1 θ11 θ2

1 θ31

2 51 61 4.04 1.51 5.75

Nicola Pedroni

- Calculate the probabilities for each possible move: since k still = 2 the probabilities are the same as before!

========

6750.0

2250.0

0500.0

χχηη

- Iteration 2:

- Sample one of the possible moves

2η 2χ2b 2d

r = 0.65

“DEATH” OF A LOCATION

Nicola Pedroni

smin s21

selected at random

s11 = 51

θ11 = 4.04

θ21 = 1.51

θ31 = 5.75

- Select randomly one of the changepoints 6112 =→ s

- Proposed value:( ) ( ) ( ) 88.4'2

12 =→−=−+− θθθθ ssssss

θ2’ = 4.88

- Accept/reject the candidate according to αd � e.g., accept: θ22 = θ2’

Sample 2

k2 s12 s2

2 θ12 θ2

2 θ32

1 51 / 4.04 4.88 /

Nicola Pedroni

Sample 1

k s1 s2 θ1 θ2 θ3

2 51 61 4.04 1.51 5.75

Sample 0 2 35 61 4.04 1.51 5.75

Sample 2 1 51 4.04 4.88… …

Sample u 3 s1u s2

u θ1u θ2

u θ3u

Sample u+1 3 s1u+1 s2

u+1 θ1u+1 θ2

u+1 θ3u+1s3

u+1 θ4u+1

Sample Ns 1 s1Ns θ1

Ns θ2Ns

Number of changepoints k

Posterior distribution

of the number of

changepoints

Nicola Pedroni

Sample 1

k s1 s2 θ1 θ2 θ3

2 51 61 4.04 1.51 5.75

Sample 0 2 35 61 4.04 1.51 5.75

Sample 2 1 51 4.04 4.88… …

Sample u 3 s1u s2

u θ1u θ2

u θ3u

Sample u+1 3 s1u+1 s2

u+1 θ1u+1 θ2

u+1 θ3u+1s3

u+1 θ4u+1

Sample Ns 1 s1Ns θ1

Ns θ2Ns

( )2,| =ks xπ ( )1,| =ks xπ ( )3,| =ks xπ

( ) ( ) ( )∑=k

kkss xxx |,|| πππ

Nicola Pedroni

Application 1:Component degradation

due to reparations or aging

Nicola Pedroni

34Application 1: inhomogeneous Poisson process

Synthetic dataset

λ1 = 1

λ2 = 2

λ3 = 4

s1 = 84 s2 = 120

Inhomogeneous Poisson process: λ = λ(t)

x = {x1, x2, …, xN}

t = { t1, t2, …, tN} (N = 240 failure times)

Likelihood function

( ) ( ) ( ) ( )

−== ∫∏

dtttLL1

exp|| λλλθ tx

Nicola Pedroni

35Application 1 – Results: posterior distributions

Number of changepoints k Changepoint locations si

Failure rates, λi

λ1 = 1

λ2 = 2

λ3 = 4

s1 = 84

s2 = 120

Nicola Pedroni

Application 2:Deterioration due to fatigue

Nicola Pedroni

37Application: crack growth due to fatigue

Synthetic datasetParis-Erdogan law:

t XZadt

a0, b = constantZt = crack sizeXt = multiplicative noise ~ exp[N(µ, σ)]

x = {x1, x2, …, xN}

Z = {Z1, Z2, …, ZN} (N = 90 observations)

s1 = 20

s2 = 60

µ1 = log(1.2)µ2 = log(1.6)

σ1 = 0.9σ2 = 0.4

Likelihood function

( )( ) ( )

−−∝ ∑=

σσµ

Setting ( ) ( ) ( )σµ,log1

ttt ≈=

= − then

Nicola Pedroni

38Application 2 – Results: posterior distributions

Number of changepoints k Changepoint locations si

Standard deviation, σi

Paramater ai = exp(µi)

a1 = 1.2

a2 = 1.6 σ1 = 0.9

s1 = 20

s2 = 60k = 2

σ2 = 0.4

Markov Chain Monte Carlo (MCMC) for model and …Markov Chain Monte Carlo (MCMC) for model and...

Documents

Markov Chain Quasi Monte Carlo

2 315 176 - CORE · posterior (pp) de cada rama se aproximó por muestreo de los árboles ﬁlogenéticos por el método Markov Chain Mon-te Carlo (MCMC). Se empleó el programa MrBayes

PENERAPAN MODEL FTS-MARKOV CHAIN UNTUK PERAMALAN … · PENERAPAN MODEL FTS-MARKOV CHAIN UNTUK PERAMALAN CUACA DI JALUR PENYEBERANGAN GRESIK-BAWEAN SKRIPSI Diajukan untuk Memenuhi

MODELO DE PREVISÃO COM VOLATILIDADE …pro.poli.usp.br/wp-content/uploads/2012/pubs/modelo-de-previsao... · MCMC Markov Chain Monte Carlo MVE Modelo de Volatilidade Estocástica

Markov Chain Monte Carlo (MCMC) estimation in · PDF fileE-3 σ2 using MCMC,but how MCMCwill allow much greater ﬂexibility for more complex problems than the ‘method of moments’

Markov Chain Monte Carlo Technology

Markov Chain - 수리과학부에 오신 것을 환영합니다.islee/Markov_Chain.pdf · 2018-07-04 · Markov Chain 李仁碩 article version 대개 “학부 2학년 선형대수학”

Lecture 6a: Introduction to Hidden Markov Models · Lecture 6a: Introduction to Hidden Markov Models ... Markov Chain/Hidden Markov Model ... The states are hidden from the

Markov chain Monte Carlo - School of Informatics

On Markov Chain Monte Carlo - Oregon State University

10a. Markov Chain Monte Carlo · Prof. Daniel Cremers 10a. Markov Chain Monte Carlo. Dr. Rudolph Triebel Computer Vision Group Machine Learning for Computer Vision Markov Chain Monte

7. Metropolis Algorithm. Markov Chain and Monte Carlo Markov chain theory describes a particularly simple type of stochastic processes. Given a transition

Stochastik–Praktikum Markov Chain Monte Carlo–Methodendickhaus/downloads/Praktikum-WS10-11/tag6.pdf · Eine Markov Chain Monte Carlo (MCMC)–Methode zur Simulation einer Wahrscheinlichkeitsverteilung

Multilevel Sampling Monte Carlo Methods for sequences of · 2013. 10. 16. · 2 MONTE CARLO METHODS FOR SEQUENCES † Importance Sampling † Markov Chain Monte Carlo (MCMC) [Metropolis

Chain e Markov

Markov Chain Monte Carlo in Practice

統計モデリング入門 2015 (g)221.114.158.246/~bunken/statistics/kubostat2015g.pdf同じような推定をMCMC でやってみる最尤推定とMarkov chain Monte Carlo (MCMC)

Adaptive Markov Chain Monte Carlo: Theory and Methodsdept.stat.lsa.umich.edu/~yvesa/afmp.pdfAdaptive Markov Chain Monte Carlo: Theory and Methods Yves Atchad e 1, Gersende Fort and

Polymerase Chain Reaction: A Markov Process Approach

Monte Carlo via Cadeias de Markov (MCMC) · Monte Carlo via Cadeias de Markov (MCMC) Ricardo Ehlers ehlers@icmc.usp.br Departamento de Matem´atica Aplicada e Estat´ıstica Universidade