Pat Hagan Markovian IR Models

Embed Size (px)

DESCRIPTION

hagan paper

Citation preview

  • Markov interest rate modelsPATRICK S . HAGAN1 and DIANA E. WOODWARD2

    1NumeriX, 546 Fifth Avenue, 17th Floor, New York, NY 10036.2The Bank of Tokyo-Mitsubishi, Ltd., 1251 Avenue of the Americas, New York, NY 10020.

    Received October 1997. Revised March 1998. Accepted October 1999.

    A general procedure for creating Markovian interest rate models is presented. The models created by thisprocedure automatically t within the HJM framework and t the initial term structure exactly. Therefore theyare arbitrage free. Because the models created by this procedure have only one state variable per factor, two-and even three-factor models can be computed efciently, without resorting to Monte Carlo techniques. Thiscomputational efciency makes calibration of the new models to market prices straightforward. Extended HullWhite, extended CIR, BlackKarasinski, Jamshidians Brownian path independent models, and Flesaker andHughstons rational log normal models are one-state variable models which t naturally within this theoreticalframework. The separable n-factor models of Cheyette and Li, Ritchken, and Sankarasubramanian whichrequire n(n 3)=2 state variables are degenerate members of the new class of models with n(n 3)=2factors. The procedure is used to create a new class of one-factor models, the b g models. These models canmatch the implied volatility smiles of swaptions and caplets, and thus enable one to eliminate smile error. Theb g models are also exactly solvable in that their transition densities can be written explicitly. For these modelsaccurate but not exact formulas are presented for caplet and swaption prices, and it is indicated how theseclosed form expressions can be used to efciently calibrate the models to market prices.

    1. Introduction

    Heath, Jarrow and Morton (1990, 1992) created a broad framework for developing arbitrage-freeterm structure models. In general, HJM models are non-Markovian and require extensive MonteCarlo simulation to calibrate model parameters to market prices, to value contingent claims, andto determine hedges. This computational burden can be greatly reduced by using special cases ofthe HJM models which are Markovian. Notable among these special cases are the separable n-factor models of Cheyette (1992) and Li et al. (1995) and Ritchken and Sankarasubramanian(1995). Although the separable n factor models are Markovian, they require n(n 3)=2 statevariables, which still imposes a stiff computational burden. Moreover, to date there has been nosystematic procedure for nding HJM models which are Markovian.Here we use the method of the undetermined numeraire to develop a general procedure for

    creating Markovian term structure models. The resulting models automatically t within the HJMframework and match the initial discount curve. Thus they are arbitrage free. Extended HullWhite(1990a, b), extended CIR (Cox et al., 1985), Black-Karasinski (1991), Jamshidians (1991) Brownianpath independent models, and Flesaker and Hughstons rational log normal models (1996) all tnaturally within our theoretical framework.Unlike the separable models, the new models require only n state variables for an n-factor

    Applied Mathematical Finance 6, 233260 (1999)

    Applied Mathematical Finance ISSN 1350-486X print/ISSN 1466-4313 online # 1999 Taylor & Francis Ltdhttp://www.tandf.co.uk/journals

  • model. This greatly reduces their computational complexity, allowing two- and even three-factormodels to be used without requiring Monte Carlo simulations. Although we can re-create theseparable models within our framework as degenerate models with n(n 3)=2 factors and statevariables, we can also create models which are direct analogues of the separable models and requireonly n factors and state variables. These analogues are asymptotically equivalent to their separablecounterparts in the limit of small volatilities, and, at normal market volatilities, the two sets ofmodels should give essentially identical results.We use our procedure to create a new class of one factor models, the b g models. Unlike other

    one-factor models, these models can match the implied volatility smiles of swaptions and caplets,and thus enable us to largely eliminate smile error from our books. The b g models are also exactlysolvable in that their transition densities (Greens functions) can be written exactly. For these modelswe present accurate but not exact formulas for caplet and swaption prices, and indicate howthese closed form expressions can be used to efciently calibrate the models to exact market prices.

    2. Derivation

    Consider a continuous trading economy on the interval [0, Tmax]. To x notation, let f0(T ) betodays instantaneous forward rate for date T for this economy. Then the current value of a zerocoupon bond which pays $1 at maturity T is

    D(0, T ) e - T0

    f0(T 9 ) dT 9 (2:1)

    and the current discount factor from time t to T is D(t, T ) D(0, T )=D(0, t).To model this economy, recall that arbitrage-free term structure models have three essential

    elements. First is a set of stochastic processes which drive the evolution of interest rates. Second isa numeraire, a tradable instrument with positive value and no cash ows. Commonly the moneymarket account or a specic zero coupon bond is used as the numeraire. The nal element is avaluation formula which states that, in the absence of any cash ows, the value of any tradableinstrument1 is a Martingale when expressed in units of the numeraire (Harrison and Pliska, 1981;Harrison and Kreps, 1979).In a Markovian model, the entire term structure at any time t must be determined solely by the

    values X 1, X 2, . . . , X n of a nite set of random state variables X1(t), X2(t), . . . , X n(t). We assumethat these state variables evolve according to the Ito processes

    dX i(t) l i(t, X )dt r i(t, X )d bW i(t), X i(0) 0, i 1, 2, . . . , n (2:2a)Here bW1(t), bW2(t), . . . , bW n(t) represent n Brownian motions, which can be correlated

    d bW i(t)d bW j(t) rij(t)dt (2:2b)To obtain a Markovian model, we need to choose a numeraire whose value N depends only on the

    values x1, x2, . . . , xn of the state variables: N eH( t,x). We stress that at this point, the function

    1 The term tradable instruments is used to refer to both fundamental instruments that are actively traded and all derivativesand synthetic instrument that can be created.

    234 Hagan and Woodward

  • H (t, x) is arbitrary and we do not need to know which nancial instrument is represented by thenumeraire. Simply requiring N (t, x) to be a market instrument (with no cash ows) will enable usto derive all interest rates.To express the valuation formula, consider a tradable instrument which has the value V (T , X (T ))

    at some date T and pays a cash ow C(t, X (t)). The value of the instrument expressed in units ofthe numeraire is V (t, x)=N (t, x) V (t, x) e - H( t,x). Requiring V (t, x)=N (t, x) to be a Martingale inthe absence of any cash ows then yields

    V (t, x) eH ( t,x)bE V (T , X (T )) e - H (T ,X(T )) TtC(t 9 , X (t 9 )) e - H( t 9 ,X( t 9 )) dt 9 X (t) x

    ( ),

    (2:3)

    where x X (t) is the state of the economy at time t. Equation 2.3 gives the value of theinstrument at any time t , T . Note that bE is the expected value under the probability measure2determined by Equation 2.2.Without loss of generality, we re-write the numeraire as

    N (t, x) 1D(0, t)

    eh( t,x)A( t) (2:4a)

    where

    h(t, 0) 0, A(0) 0 (2:4b)Then the valuation formula becomes

    V (t, x) eh( t,x)A( t)bE V (T , X (T ))D(t, T ) e - h(T ,X(T )) - A(T )TtC(t 9 , X (t 9 ))D(t, t 9 ) e - h(t 9 ,X( t 9 )) - A( t 9 ) dt 9 X (t) x (2:5)

    The valuation formula (2.5), with the state variables X (t) determined by the stochastic process(2.2), denes a term structure model. As we shall see, if this model is consistent with the initialdiscount curve then it is arbitrage free. In particular, this model automatically satises the crucialforward rate drift restriction of HJM (1992) theory.3 We stress that apart from some mathematicalregularity conditions, the drift rates l i(t, x), the volatilities r i(t, x), and the function h(t, x) arearbitrary. The remaining function A(t) will be determined by requiring the valuation formula (2.5) tobe consistent with the initial discount curve.To impose consistency, dene Z(t, x; T ) to be the value at t, x of a zero coupon bond which pays

    $1 at maturity T. From (2.5),

    2 Equation 2.2 thus denes the risk neutral probability measure induced by the numeraire N (t, x), and does not represent realworld probabilities.3 This is not surprising: (2.2) and (2.5) self-consistently dene the probability measure of the risk neutral-world under aspecic (if unknown) numeraire N( t, x); this then uniquely denes the risk neutral probability measure under every othernumeraire, including the money market numeraire; and HJM (1992) showed that for any self-consistent risk-neutral world, theinstantaneous forward rates must satisfy the forward drift restriction under the probability measure induced by the moneymarket numeraire.

    Markov interest rate models 235

  • Z(t, x; T ) D(t, T ) eh( t,x)A( t) - A(T )S(t, x; T ) (2:6a)where

    S(t, x; T ) bEfe - h(T ,X (T )) j X (t) xg (2:6b)As we are taking X (0) 0, consistency with the initial discount curve requires Z(0, 0; T )D(0, T ). Since h(t, 0) A(0) 0, this requirement becomes

    A(T ) log S(0, 0; T ) log bEfe - h(T ,X (T )) j X (0) 0g for all T . 0 (2:7)Thus consistency with the initial discount curve requires choosing A(T ) so that the expectedvalue of e - h(T ,X (T )) - A(T ) is 1 for all T . 0.Note that the current discount factors D(0, T ) do not appear in (2.7); choosing A(T) so that the

    model is consistent with the initial term structure is independent of the actual initial term structure.Once A(T ) has been selected according to (2.7), then if the initial term structure changes, all weneed do is replace the discount factors D(t, T ) with the new discount factors, and the model will beconsistent with the new initial term structure. This is useful during calibration since it separates thevolatility functions r i(t, X ), l i(t, X ), and h(t, X ) endogenous to the model from the currentdiscount factors D(t, T ), which are exogenous.With the economy in state X (t) x at date t, the instantaneous forward rates f (t, x; T ) are

    dened implicitly by Z(t, x; T ) expf - Tt f (t, x; T 9 )dT 9 g. From (2.6),Ttf (t, x; T 9 )dT 9

    Ttf0(T 9 )dT 9 - h(t, x) - A(t) A(T ) - log S(t, x; T ) (2:8)

    Hence the forward rate curve at t, x is

    f (t, x; T ) f0(T ) A9 (T ) - ST (t, x; T )=S(t, x; T ) (2:9)where we are using subscripts to denote partial derivatives. The short rate r(t, x) is dened asf (t, x; t), so we have

    r(t, x) f0(t) A 9 (t) - ST (t, x; t)=S(t, x; t) (2:10a)In Appendix A it is shown that this expression for the short rate is equivalent to

    r(t, x) f0(t) A9 (t) h t(t, x)Xi

    l i(t, x)hxi (t, x)

    12

    Xij

    rij(t)r i(t, x)r j(t, x)[hxi x j (t, x) - hxi (t, x)hx j (t, x)] (2:10b)

    With A(t) dened by (2.7), the valuation formula (2.5) and random processes (2.2) uniquelydene a term structure model which is consistent with the initial discount curve. The instantaneousforward interest rates and short rate for this model are given by (2.9) and (2.10), respectively. At thispoint it would be natural to use Martingale theory (Harrison and Pliska, 1981; Harrison and Kreps,1979) to show that this model is arbitrage free. Instead, in Appendix A we show that this model tswithin the HJM framework. There it is found that using the money market as a numeraire, theprocess for the instantaneous forward rate, F(t, T ) f (t, X (t); T ), satises

    236 Hagan and Woodward

  • dF(t, T )Xij

    rij(t)aiT (t, X ; T )aj(t, X ; T )dt -

    Xi

    aiT (t, X ; T )deW i(t) (2:11a)in the risk-neutral probability world, with

    a i(t, x; T ) r i(t, x) Sxi( t, x; T )

    S(t, x; T )-Sxi (t, x; t)S(t, x; t)

    i 1, 2, . . . , n: (2:11b)

    Inspection shows that the drift terms in (2.11a) satisfy the forward drift restriction of HJM(1992) theory. Consequently, any term structure model given by (2.5), (2.2), and (2.7) ts withinthe HJM framework, and provided it satises innocuous regularity conditions, the model isarbitrage free.4

    Equation 2.5 can be used to directly value any contingent claim whose payoff V (T , X(T )) andcash ow C(t, X (t)) can be expressed as explicit functions of the state variables. Since (2.9)expresses the entire forward rate curve f (t, x; T ) explicitly, this includes all instruments whosepayoff and cash ow depend only on the then current yield curve. This includes European,Bermudan, and American swaptions, caps, and most yield curve options. However, there is noguarantee that instruments whose payoff or cash ow are path dependent (e.g. instruments withpayments determined by the average short rate over the preceding period) can be evaluated withoutadding extra state variables or resorting to path dependent valuation techniques.

    3. One factor models

    We now use the general framework for arbitrage-free Markovian models developed in Section 2to create a special class of one factor models. In Section 4 we will further specialize this classof models to obtain the b g models.Although one-factor models are usually expressed in terms of the money market numeraire using

    the short rate r as the state variable, any such short rate model can be re-cast within theframework of Section 2 by using a zero coupon bond as the numeraire. Conversely, any one-factormodel within the framework of Section 2 can be transformed into a short rate model. Thesetransformations are carried out in Appendix B.Within the framework of Section 2, a one-factor model is dened by the stochastic process for the

    state variable,

    dX (t) l (t, X )dt r (t, X )d bW (t) X (0) 0 (3:1)and the numeraire

    N (t, x) 1D(0, t)

    eh( t,x)A( t) (3:2)

    which creates the link between the state variable and nancial markets.

    4We do not restate these conditions here; we refer the reader to HJM (1992). Although these conditions are innocuousmathematically, each points out a more stringent business condition. For example, the mathematical requirement marketcompleteness points out that risks must be hedgeable with liquid instruments with minimal transaction costs.

    Markov interest rate models 237

  • We argue that with our current knowledge about interest rates, we are not justied in using both acomplicated function h(t, x) in the numeraire and a complicated stochastic process for X (t). Insteadwe should choose either a relatively simple function h(t, x) or a relatively simple stochastic process.We arbitrarily choose to use a simple numeraire

    N (t, x) 1D(0, t)

    ek ( t)xA( t) (3:3)

    for our model. The consistency condition (2.7) then becomes

    A(t) log bEfe - k ( t)X ( t) j X (0) 0g (3:4)Now consider the drift term, l (t, X )dt. Suppose the model had, say, a positive drift so that X (t)

    increased on average. Then A(t) would be negative and would effectively cancel out the averageeffect of the drift term in the numeraire. Similarly, if the drift term was negative, A(t) would bepositive, again canceling out most of the drift. Indeed (3.4) can be written asbEfe - k ( t)X (t) - A( t) j X (0) 0g 1 (3:5)Since A(t) cancels out the main effects of the drift term, any residual inuence of the drift terml (t, X )dt on the model would be quite subtle. Therefore, in the interests of simplicity, we takeour model to be driftless.So consider the special class of one factor models

    dX (t) r (t, X )d bW (t) X (0) 0 (3:6)with the numeraire (3.3). As shown in Appendix C, this class of models includes the extendedHullWhite, the extended CIR, and BlackKarasinski like models. However, the BlackKarasinski model itself cannot be included without adding a drift term to (3.6).With the numeraire (3.3), the valuation formula becomes

    V (t, x) e k ( t)xA( t)bE V (T , X (T ))D(t, T ) e - k (T )X (T ) - A(T )TtC(t 9 , X (t 9 ))D(t, t 9 ) e - k (t 9 )X ( t9 ) - A( t9 ) dt 9 X (t) x (3:7)

    so the value of a zero coupon bond becomes

    Z(t, x; T ) D(t, T )e - [k (T ) - k (t)]x - [A(T ) - A(t)]M( t,x;T ) (3:8)Here we have dened

    M (t, x; T ) log bEfe - k (T )[X (T ) - x] j X (t) xg k (T )x log S(t, x, T ) (3:9a)and the consistency condition is now

    A(T ) M (0, 0; T ) for all T (3:9b)It is instructive to examine the forward rate curve. Since Z(t, x; T ) expf - Tt f (t, x; T 9 )dT 9 g,

    Equation 3.8 shows that if the economy is in state X (t) at time t, then the instantaneous forwardrate for date T would be

    238 Hagan and Woodward

  • f (t, X (t); T ) f0(T ) k 9 (T )X A9 (T ) - M T (t, X ; T ) (3:10a)The corresponding short rate would be

    r(t, X (t)) f0(t) k 9 (t)X A9 (t) - M T (t, X ; t) (3:10b)For the moment, let us neglect the last two terms in (3.10a) and (3.10b). Then the forward rate

    curve is just the curve k 9 (T )X added to the original forward curve f0(T ). Note that k 9 (T ) is agearing factor: if the state variable X (t) is shifted by an amount D , then the short rate would shiftby d r(t) k 9 (t)D and the forward rates f (t, X ; T ) would shift by d f k 9 (T )D k 9 (T )d r=k 9 (t).Since we expect shocks to affect short maturities more than longer maturities, we expect k 9 (T ) to bea decreasing function of T .

    Note that Equation 3.6 implies that X (t) is a Martingale, so for all times T . t, the expectedvalue of X (T ) is just X (t). Thus, if X (t) was shifted by an amount D , then for all later times T theexpected value of X (T ) would also shift by D . Consequently, if a shock shifted the short rate byd r(t) k 9 (t)D , then for all later times T the expected value of the short rate r(T , X (T )) would shiftby k 9 (T )d r= k 9 (t). As T increases, the expected value of the short rate returns to the initial forwardcurve f0(T ) at a rate determined by how rapidly k 9 (T ) decreases. So mean reversion of interest ratesis directly determined by the function k (T ).With f (t, X ; T ) f0(T ) k 9 (T )X . . . , the distribution of forward rates is determined by the

    distribution of X (t). The width of this distribution is determined mainly by the overall magnitude ofr (t, x), and the shape of this distribution (i.e. the deviation from Gaussian or log normal behavior)is determined mainly by the functional form of the dependence of r (t, x) on x. As noted above,mean reversion of interest rates is determined mainly by the function k (T ). Since skews and smiles(the changes in an options implied volatility as the strike changes) depend mainly on the shape ofthe distribution, and since the change in the implied volatility as the duration of the underlyinginstrument changes depends mainly on mean reversion, this separation makes it easy to calibratethese models to both at-the-money and off-market instruments of differing durations. This makesthese models very useful for pricing, hedging, and understanding instruments in the presence ofimplied volatility skews and smiles.The third term A9 (T ) in (3.10) embodies the consistency requirement, and is much smaller than

    the rst two terms. Even though (3.6) implies that X (t) has mean zero, the convex relation betweenbond prices and interest rates would cause bond prices to drift, on average, as the distribution ofX (t) spreads out. The term A 9 (T ) cancels the expected value of this convexity effect. This is whythe consistency term A(T ) depends on the functions k (T ) and r (t, x), but not on the initial termstructure f0(T ). The last term M T (t, x; T ) arises because as X (t) evolves away from X (0) 0, theexpected value of the convexity effect changes.We can clarify the relation between the distribution of X (T ) and the convexity terms A(T ) and

    M(t, x; T ) by expanding (3.9) in powers of k . Dene the moments

    mk (t, x; T ) bEf[X (T ) - x]k j X (t) xg k 1, 2, . . . (3:11)and note that m1(t, x; T ) 0. Expanding (3.9) yields

    M(t, x; T ) 12k 2(T )m2(t, x; T ) - 16k 3(T )m3(t, x; T ) 124k 4(T )[m4(t, x; T ) - 3m22(t, x; T )] . . .(3:12a)

    Markov interest rate models 239

  • A(T ) 12k 2(T )m2(0, 0; T ) - 16k 3(T )m3(0, 0; T ) 124k 4(T )[m4(0, 0; T ) - 3m22(0, 0; T )] . . .(3:12b)

    Our experience calibrating one-factor models to US swaption and caplet markets shows that themagnitude of the quadratic term 12k

    2(T )m2(t, x; T ) is roughly (T - t)T 2 3 10 - 4, where time ismeasured in years. The cubic and quartic terms, which arise from the skewness and kurtosis of thedistribution, are much smaller and have magnitudes of roughly (T - t)2T 3 3 10 - 8 and(T - t)2T 4 3 10 - 9, respectively.5 We conclude that it is safe to truncate after the quadratic termexcept for unusually sensitive instruments, unusually long durations, or unusually competitivemarkets.Truncating after the quadratic terms yields the forward rates

    f (t, x; T ) f0(T ) k 9 (T )x 12@

    @Tf k 2(T )[m2(0, 0; T ) - m2(t, x; T )]g . . . (3:13)

    Consequently, the expected forward rate isbEf f (t, X (t); T ) j X (0) 0g f0(T ) k 9 (T )k (T )m2(0, 0; t) . . . (3:14a)Note that m2(0, 0; t) is the variance of X (t), not X (T ). The covariance between forward rates atdifferent maturities is

    Covf f (t, X (t); T1), f (t, X (t); T2) j X (0) 0g k 9 (T1)k 9 (T2)m2(0, 0; t) . . . (3:14b)showing that the two rates are perfectly correlated through this order.Before continuing, let us briey address the unknown numeraire in (3.2). By construction,

    N (t, x) is always a valid numeraire.6 Suppose we arbitrarily set k (T ref ) 0 for some date T ref . Then(3.9) would imply that A(T ref ) 0 and M (t, x; T ref ) 0, so the value of the zero coupon bond ofmaturity T ref would be D(t, T ref )e k (t)A( t) (see (3.8)). So if we made this choice, then the unknownnumeraire N (t, x) would represent a zero coupon bond paying 1=D(0, T ref ) dollars at maturity T ref at least for times t < T ref . Although N (t, x) would remain a valid numeraire for times t . T ref , itwould no longer represent any simple security. In the same spirit, we could restrict the k (t) curve sothat k (t)!0 as t!1; then the numeraire would correspond to Flesaker and Hughstons (1996)absolute terminal measure.

    4. The b g models

    We now specialize to volatilities of the form r (t, x) a (t)[1 b X (t)]g , where b and g areconstant, and dene the b g models by

    dX (t) a (t)[1 b X (t)]g d bW (t) X (0) 0 (4:1a)5 This is equivalent to the change in interest rates over a period D t having a standard deviation of sizeO(100 bps [ D t=yrs]1=2), variance of size O(10- 4 D t=yrs), kurtosis of size O(1), and skewness of size O(10 - 2[ D t=yrs]1=2).6 SubstitutingV ( t, x) N( t, x) and C(t, x) 0 into the valuation formula (2.5) shows that for any Tfinal . 0, N (t, x) is thevalue of the interest rate option which has no cash ow and has the payoff N(Tfinal, X(T final )) at T final.

    240 Hagan and Woodward

  • with the numeraire

    N (t, x) 1D(0, t)

    ek ( t)xA( t) (4:1b)

    Without loss of generality, we normalize k (t) so that k 9 (0) 1.A key advantage of b g models is that there is a closed form solution for the transition density

    p(t, x; t , x )dx Probf x , X ( t ) , x dx j X (t) xg (4:2)The transition density p satises the backward equation

    pt 12a 2(t)j1 b xj2g pxx 0 t , t (4:3a)p! d (x - x ) as t! t (4:3b)

    In terms of the new variables

    s (t) t0a 2(t 9 )dt 9 y(x) j1 b xj

    1- g

    b (1 - g )(4:4)

    this becomes

    ps 12( pyy -2m - 1

    ypy ) 0 s , s (4:5a)

    p![ b (1 - g ) y ] - 2m 1 d (y - y ) as s ! s (4:5b)with m 1=(2 - 2g ). In particular, s (t) is roughly the variance of X (t), which is around 10- 4 t inUS dollar markets, where t is measured in years.Equation 4.5 can be solved by a combination of Laplace transform and Greens function

    techniques. For exponents 0 , g , 1=2 this yields

    p(t, x; t , x ) (y= y )m ys - s

    f12I - m (y y=( s - s )) 12I m (y y=( s - s ))g e - ( y2 y2)=2( s - s ) d y

    dxfor x . - 1= b (4:6a)

    p(t, x; t , x ) sin p mp

    (y= y )my

    s - sK m (y y=( s - s )) e -

    ( y 2 y2)=2( s - s ) d ydx

    for x , - 1=b

    (4:6b)

    Here

    d ydx

    [(1 - g )b y ] - 2m 1 (4:7)

    and K m and I m are the modied Bessel functions.The b g models have an articial barrier at x - 1= b , where the volatility j1 b xjg goes to

    zero. For exponents g , 1=2, the volatility does not decay fast enough as y!0 to prevent y frombecoming negative. If we wished to avoid negative values of y, we could put a reecting boundarycondition at y 0; this would eliminate (4.6b) and change the Bessel functions in (4.6a) from12(I - m I m ) to I - m .The probability of y approaching zero is extremely small, of order e - 1= b

    2(1- g )2 s . This is much too

    Markov interest rate models 241

  • small to affect the pricing of realistic products. Since b and g can be chosen to reproduce thevolatility smile reasonably well, indicating that the model is reproducing the right shape of the risk-neutral probability distribution near the distributions centre, we argue that the benet of increasedaccuracy near the distributions centre far outweighs the liability of having an articial barrier in theextreme tails of the distribution. In fact, we do not expect any model to be valid in the extreme tailsof the distribution where there is no relevant market experience.For exponents 1=2 < g , 1 we have

    p(t, x; t , x ) (y= y )m ys - s

    I m (y y=( s - s ))e - ( y2 y2)=2( s - s ) d y

    dx C (m , y

    2=2( s - s ))C (m )

    d (x 1= b )(4:8)

    where C (m , h ) is the incomplete gamma function. For these exponents the volatility declines fastenough to prevent y from becoming negative, but it doesnt prevent y from reaching the origin.This causes a very small, but nite, probability of y being exactly zero. As before, thisprobability is much too small to affect the pricing of realistic deals.For both sets of exponents, the transition probability can be expanded as

    p(t, x; t , x ) (y= y )m - 1=2

    [2p ( s - s )]1=2e - ( y - y)

    2=2( s - s ) d ydx

    1 -4m 2 - 18y y

    ( s - s ) . . .( )

    (4:9)

    for small values of the variance s - s .Finally, for exponent g 1 we need to re-dene y as

    y(x) b - 1 log (1 b x) (4:10a)Then the transition density is simply

    p(t, x; t , x ) e- b y

    [2p ( s - s )]1=2e - ( y - y b ( s - s )=2)

    2=2( s - s ) (4:10b)

    With the transition density in hand, M(t, x; T ) can be found by integration,

    M (t, x; T ) logp(t, x; T , x )e - k (T )( x - x) dx (4:11a)

    Close inspection of (4.5) and (4.11a) reveals that M(t, x; T ) is of the form

    M (t, x; T ) M (S, H ) (4:11b)

    where

    S b 2( s - s )=(1 b x)2(1- g ) H k (T )(1 b x)= b (4:11c)This is useful since for each exponent g , the function M (S, H ) can be calculated and tabulatedonce, eliminating most of the computational burden of the b g model. Once these tables havebeen calculated, the values Z(t, x; T ) of zero coupons bonds, the forward rate curve f (t, x; T ),and the short rate r(t, x) can be obtained directly from (3.8)(3.10).

    Three exponents merit special attention for their simplicity: g 0, g 1=2, and g 1. InAppendix C we analyse the b g model for these exponents, allowing b to depend on time:

    242 Hagan and Woodward

  • dX (t) a (t)[1 b (t)X (t)]g dbW (t) (4:12)Here we quote only the results with b constant.When g 0, the b g model becomes the extended HullWhite model. In this case A(T ) and

    M(t, x; T ) are given by

    A(t) 12k 2(T ) s M(t, x; T ) 12k 2(T )( s - s ) (4:13)where

    s T0a 2(t 9 )dt 9 s

    t0a 2(t 9 )dt 9 (4:14)

    Equations 3.83.10 now yield explicit expressions for the value of zero coupon bonds, theforward rate curve, and the short rate.Similarly, A(T ) and M (t, x; T ) can be written explicitly when g 1=2,

    A(T ) k2(T ) s

    2 b k (T ) s M (t, x; T ) (1 b x)k 2(T )( s - s )2 b k (T )( s - s ) (4:15)

    where s and s are given by (4.14). This again enables Z(t, x; T ), f (t, x; T ) and r(t, x) to beobtained from (3.8)(3.10). Besides deriving the corresponding formulas when b b (t), inAppendix C we nd that b (t) can be chosen so that this model is exactly the CIR model.Alternatively, one can use the extra freedom of choosing b (t) to calibrate the model to a seriesof off-market instruments as well as a series of at-the-money instruments, and thus reproduce thevolatility smile.The exponent g 1 (BlackKarasinski like models) is more difcult. Although we have been

    unable to derive explicit expressions for A(T ) and M(t, x; T ), the moments mk (t, x; T ) can bewritten explicitly. When b is constant,

    m2(t, x; T ) [(1 b x)= b ]2[e b 2( s - s ) - 1] (4:16a)m3(t, x; T ) [(1 b x)= b ]3[e3b 2( s - s ) - 3e b 2( s - s ) 2] (4:16b)m4(t, x; T ) [(1 b x)= b ]4[e6b 2( s - s ) - 4e3b 2( s - s ) 6e b 2( s - s ) - 3] (4:16c)

    We can use the rst few terms in expansion (3.12) for M (t, x; T ) and A(T ), and substitute theseexpansions into (3.8) and (3.10) to obtain Z(t, x; T ), f (t, x; T ), and r(t, x). Even if only thesecond moment is used, this should be accurate enough to price all but the most sensitiveinstruments.

    4.1. European option prices

    A vanilla receiver swaption is a European option which gives the holder the right to receive apredetermined series of cash payments C i on dates t i, i 1, 2, . . . , n, in return for essentially7paying K on the settlement date ts. A payor swaption is a European option which gives the

    7We are assuming that the oating leg re-values to K at the settlement date.

    Markov interest rate models 243

  • holder the right to receive the strike K at settlement in return for making a predetermined seriesof payments. Similarly, a caplet or oorlet is a European option which gives the holder the rightto exchange specic payments with the issuer.Consider a receiver swaption with exercise date te. Since N (0, 0) 1, the value of the option

    today is

    V (0, 0) D(0, te )p(0, 0; te, xe )Q(te, xe )e - k ( te )xe - A( te ) dxe (4:17a)

    Here

    Q(te, xe )Xi

    C i Z(te, xe ; t i) - KZ(te, xe ; ts)

    ( )(4:17b)

    is the value of the payoff if the economy is in state X (te ) xe at expiry. Substituting (3.8) forthe zero coupon bonds gives

    V (0, 0)p(0, 0; te, xe )

    Xi

    C iD(0, t i) e - k ixe - AiM(te,xe ;t i )

    (

    - KD(0, ts) e - k s xe - AsM( te ,xe ; ts )

    dxe (4:18)

    where k i k (t i), k s k (ts), etc. Since the transition densities p(0, 0; te, xe ) are known and thefunction M (t, x; T ) is pre-tabulated, these options can be valued exactly with a single integration.Let F0 be todays forward value of the cash ow at settlement,

    F0 Xi

    C iD(ts, t i ) (4:19)

    The implied price vol of the option8 is dened as the volatility r B for which Blacks formula,

    V0 D(0, ts)[F0N (d1) - KN (d2)] (4:20a)

    d1,2 log F0=K 12r

    2B te

    r B

    t

    pe

    (4:20b)

    yields the correct price V0 given by (4.18).In Hagan and Woodward (in preparation) singular perturbation techniques are used to obtain

    accurate approximations to the option price (4.18), with the results stated in terms of implied pricevols. There it is found that to leading order the option price is

    r B 1te

    te0a 2(t 9 )dt 9

    1=2

    K 1(1 b x0)g . . . (4:21a)

    where x0 is dened implicitly by

    8 The price vol dened here should not be confused with the more commonly quoted rate vol.

    244 Hagan and Woodward

  • 12(F0 K )

    Xi

    C iD(ts, t i) e -( k i - k s )x0 (4:21b)

    Here

    K 1

    PiC iD(ts, t i)( k i - k s) e - k i x0P

    iC iD(ts, t i) e - k ix0(4:21c)

    A much more accurate formula is quoted in Appendix F.Equation 4.21a illustrates the roles played by local volatility, mean reversion, and skew in

    pricing. The local volatility a (t) only enters (4.21a), which shows that the implied price vol r B isproportional to the mean-square-average of the local volatility between today and the expirationdate.Recall that k 9 (t) is a decreasing function, and that mean reversion refers to how rapidly k 9 (t)

    decreases (see (3.10) et seq). As mean reversion increases, k (t i) - k (ts ) k i - k s decreases,which decreases the value of K 1. Thus, as mean reversion increases, the price vol r B of theswaption decreases, as expected. Increasing the mean reversion also increases the importance ofthe earlier payments C i relative to the later payments.Equations 4.19 and 4.21b imply that x0 0 when the option is struck at-the-money (K F0), and

    that x0 is a decreasing function of the strike K . Thus, the price of an at-the-money swaption isindependent of b and g , at least to within the approximations used to obtain (4.21). If b or g is zero,the price vol is independent of the strike K ; as b and g increase from zero, the implied price volbecomes a decreasing function of the strike K .Besides giving insight into how the model parameters affect option prices, Equation 4.21 can

    be used effectively in model calibration. Calibration is the process of choosing the modelparameters a (t) and k (t) to minimize or eliminate the errors between the models predictedprices and the actual prices of a set of standard market instruments. This is typically aniterative process, with most of the computational time spent calculating derivatives of thepredicted prices with respect to variations in the model parameters. By using the approximateformula (4.21) to obtain a shrewd initial guess for the model parameters, and then using thederivatives of (4.21) in place of the actual derivatives, one can greatly speed up the calibrationprocess.

    5. Conclusions.

    We used HJM theory to develop a framework for creating arbitrage-free Markovian term structuremodels. This framework makes designing arbitrage-free interest rate models which match theinitial discount curve a straightforward task, freeing the designer to concentrate on imbuing themodel with less fundamental attributes. Not only are the resulting models Markovian, but theyhave no more state variables than are needed to handle the random factors. This parsimony opensup new methods for evaluating and using multi-factor models: trees, explicit and implicit nitedifference schemes, and Greens function techniques can be used as well as Monte Carlomethods.All arbitrage-free Markovian term structure models we are aware of t within this framework.

    Markov interest rate models 245

  • Appendices B and C explicitly demonstrate that the extended HullWhite (1990a, b), extended CIR(Cox et al. 1985), BlackKarasinski (1991), and Jamshidians (1991) Brownian path independentmodels t within this framework, and Appendix D shows that Flesaker and Hughstons (1996)rational interest rate models also t within the framework. The separable n-factor models are moredifcult since they require n(n 3)=2 state variables. Appendix E shows that these models can beconstructed as degenerate n(n 3)=2 factor models. There we also create n-factor models which areanalogues of the separable models and are asymptotically equivalent to their separable counterpartsin the limit of small volatilities.In Section 4 we used this framework to create the b g models, a class of one-factor models

    dened by the process

    dX (t) a (t)[1 b X (t)]g d bW (t) X (0) 0 (5:1)and the numeraire

    N (t, x) 1D(0, t)

    ek ( t)xA( t) (5:2)

    These models enable us to account for local volatility, mean reversion, and skew, and allow bothat-the-money and off-market instruments of varying maturities to be priced simultaneously. Theiranalytic tractability is an added benet.

    Describing local volatility, mean reversion, and skew is as much as can be expected from a one-factor model. However, some products are sensitive to risks which can not be described by a one-factor model. The two most common risks are sensitivity to rate decorrelation/curve exing andforward/stochastic volatilities. Decorrelation risk occurs because forward rates become progressivelyless correlated as the difference between the maturity dates (and start dates) increases; thisdecorrelation tends to make the forward rate curve ex. In contrast, all forward rates in a one-factor model are essentially perfectly correlated. Forward volatility risk occurs in deals whichcontain an option on an option. Examples are captions (options on caps), and Bermudan andAmerican swaptions. In each case exercising the option involves either receiving or giving up anoption, so the exercise decision must balance the intrinsic value of the payoff against the value ofthe option received or lost, which is determined by the volatility environment that pertains at theexercise date.The theoretical framework makes it easy to extend popular one-factor models to include these

    risks. For example, we can convert the b g model to a stochastic volatility model by changing theequation for the state variable to

    dX (t) a (t)[1 b (t)X (t)]g Y (t)dbW1(t) X (0) 0 (5:3a)dY (t) c (t)Y (t)d bW2(t) Y (0) 1 (5:3b)

    and using the same numeraire as before,

    N (t, x) 1D(0, t)

    ek ( t)xA( t) (5:3c)

    Finally, note that the stochastic differential equations for the state variables play only a peripheral

    246 Hagan and Woodward

  • role in our theoretical framework; one could choose to directly model the risk neutral transitiondensities p(t, x; T ; X ), dispensing with these equations entirely.

    Acknowledgements

    We gratefully acknowledge the assistance of A. Berner, Olivier Van Eyseren, our colleagues onParibas New York Swaps desk and Paribas London FIRST team. The views presented in thisreport do not necessarily reect the views of NumeriX, The Bank of Tokyo-Mitsubishi, or any oftheir afliates.

    References

    Black, F. and Karasinski, P. (1991) Bond and option pricing when short rates are lognormal, FinancialAnalysts Journal, 46 5259.

    Cheyette, O. (1992) Term structure dynamics and mortgage valuation, Journal of Fixed Income 2841.Cox, J.C., Ingersoll, J.E. and Ross, S.A. (1985) A theory of the term structure of interest rates, Economics 53

    385407.Flesaker, B. and Hughston, L.P. (1996) Positive interest, in Vasicek and Beyond, Lane Hughston (ed.), (Risk

    Publications, London).Hagan, P.S. and Woodward, D.E. Swaption and caplet prices for one factor models, in preparation.Harrison, J.M. and Kreps, D. (1979) Martingales and arbitrage in multiperiod securities markets, Journal of

    Economic Stochastic Processes and Theory, 20 381408.Harrison, J.M. and Pliska, S.R. (1981) Martingales and stochastic integrals in the theory of continuous trading,

    Stochastic Processes and Applications, 11 215260.Heath, D., Jarrow. R. and Morton, A. (1990) Bond pricing and the term structure of interest rates: A discrete

    time approximation, Journal of Finance and Quantitative Analysis, 25 419440.Heath, D., Jarrow. R. and Morton, A. (1992) Bond pricing and the term structure of interest rates: A new

    methodology, Econometrica, 60 77105.Ho, T.S.Y. and Lee, S.B. (1986) Term structure movements and pricing interest rate contingent claims, Journal

    of Finance, 41 10111028.Hull, J.C. (1997) Options, Futures, and Other Derivatives (Prentice-Hall, Upper Saddle River).Hull, J. and White, A. (1990a) Pricing interest rate derivative securities, Revue of Financial Studies, 3 573592.Hull, J. and White, A. (1990b) Efcient procedures for valuing European and American path-dependent

    securities, Journal of Finance and Quantitative Analysis, 25 87100.Jamshidian. F. (1991) Forward induction and constructin of yield curve diffusion models, Journal of Fixed

    Income, 6274.Li, A., Ritchken, P. and Sankarasubramanian, L. (1995) Lattice models for pricing American interest rate

    claims, Journal of Finance, 50 719737.Longstaff, F.A. and Schwartz, E.S. (1992) Interest rate volatility and the term structure: a two-factor general

    equilibrium model, Journal of Finance, 47 12591282.Rebonato, R. (1996) Interest-Rate Option Models (Wiley, New York).Ritchken, P. and Sankarasubramanian, L. (1995) Volatility structures of forward rates and the dynamics of the

    term structure, Mathematical Finance, 5 5572.

    Markov interest rate models 247

  • Appendix

    A Connection with HJM

    Here we show that the random process followed by the forward rate

    F(t ; T ) f (t, X (t); T ) f0(T ) A 9 (T ) - ST (t, X (t); T )=S(t, X (t); T ) (A:1)satises the forward rate drift restriction of HJM.The backward Kolmogorov equation for (2.6b) implies that S(t, x; T ) satises the PDE

    S t Xi

    l i(t, x)Sxi 12

    Xij

    rij(t)r i(t, x)r j(t, x)Sxi x j 0 0 , t , T (A:2a)

    S(T , x; T ) e - h(T ,x) (A:2b)Applying Itos lemma to (A.1), and using (A.2) to simplify the result, then yields

    dF(t; T ) 12

    Xij

    rij(t)r i(t, X )r j(t, X )(Sxi Sx j=S2)T dt -

    Xi

    r i(t, X )(Sxi=S )T dbW i(t) (A:3)Equation A.3 appears to violate the forward drift restriction of HJM (1992) theory. However, this

    restriction is phrased in terms of the risk-neutral probability measure in the money marketnumeraire. To switch to the money market numeraire, we apply the backwards Kolmogorov equationto the valuation formula

    V (t, x) eh( t,x)A( t)bE V (T , X (T ))D(t, T ) e - h(T ,X(T )) - A(T )TtC(t 9 , X (t 9 ))D(t, t 9 ) e - h(t 9 ,X( t 9 )) - A( t 9 ) dt 9 X (t) x (A:4)

    This shows that the value of a marketable instrument expressed in units of the numeraire,

    bV (t, x) V (t, x) e - H( t,x) V (t, x) e - t0 f0(T 9 ) dT 9 - h( t,x) - A( t) (A:5a)satises the partial differential equation

    bV t Xi

    l i(t, x)bVxi 12Xij rij(t)r i(t, x)r j(t, x)bVxix j - C(t, x)e - t0f0(T 9 ) dT 9 - h( t,x) - A( t)

    (A:5b)Consequently, V (t, x) satises the partial differential equation

    V t Xi

    ~l i(t, x)Vxi 12

    Xij

    rij(t)r i(t, x)r j(t, x)Vxix j - r(t, x)V - C(t, x) (A:6a)

    where

    ~l i(t, x) l i(t, x) - r i(t, x)Xj

    rij(t)r j(t, x)hx j(t, x) (A:6b)

    248 Hagan and Woodward

  • and

    r(t, x) f0(t) A9 (t) h t(t, x)Xi

    l i(t, x)hxi (t, x)

    12

    Xij

    rij(t)r i(t, x)r j(t, x)[hxix j (t, x) - hxi (t, x)hx j (t, x)]: (A:6c)

    Expression (A.6c) for the short rate appears to be different from (2.10a). We can show that thesetwo expressions are equivalent by applying Itos lemma to e - h(T ,X (T )) in (2.6b), taking the expectedvalue, and letting T ! t. This yields

    -ST (t, x; t)S(t, x; t)

    h t Xi

    l i(t, x)hxi 12

    Xij

    rij(t)r i(t, x)r j(t, x)[hxi x j - hxi hx j ] (A:7a)

    as is needed for (A.6c) to agree with (2.10a). For future reference note that

    -Sxi (t, x; t)S(t, x; t)

    hxi (t, x) (A:7b)

    Dene the money market numeraire B(t) by

    dB(t)B(t)

    r(t, X (t)) dt (A:8)

    Inspection of (A.6a) reveals that it is the backwards Kolmogorov equation for

    V (t, x) eE B(t)B(T )

    V (T , X (T ))Tt

    B(t)B(t 9 )

    C(t 9 , X (t 9 )) dt 9 X (t) x( )

    (A:9)

    where eE refers to the expected value in a probability measure for whichdX i(t) ~l i(t, X (t)) dt r i(t, X (t)) deW i(t) i 1, 2, . . . , (A:10a)

    Here eW1(t), eW2(t), . . . , eW n(t) are Brownian motions with the same correlation structure asbefore:

    d eW i(t)d eW j(t) rij(t)dt (A:10b)Clearly, (A.9) is the valuation formula using the money market as the numeraire.Comparing (A.10) with (2.2) shows that changing from N (t, x) to the money market numeraire is

    equivalent to using Girsanovs theorem to change the probability measure so that

    d eW i(t) dbW i(t)Xj

    rij(t)r j( t, X )hx j (t, X )dt i 1, 2, . . . , n, (A:11)

    are n Brownian motions with the same correlation structure as before. Substituting (A.11) intothe forward rate process (A.3) now yields

    dF(t, T )Xij

    rij(t)aiT (t, X ; T )aj(t, X ; T )dt -

    Xi

    aiT (t, X ; T )deW i(t) (A:12a)

    Markov interest rate models 249

  • with

    a i(t, x; T ) r i(t, x) Sxi(t, x; T )S(t, x; T ) -Sxi (t, x; t)S(t, x; t)

    (A:12b)

    where we have used (A.7b) to replace hxi (t, x). Inspection of the drift term in (A.12a) shows thatthe forward rate process satises the HJM restriction. See Equation 18 of HJM (1992).

    B Short rate models

    Arbitrage free short rate models are expressed in the money market numeraire,

    V (t, r) eE e - Tt R(t 9 ) dt9 V (T , R(T )) TtC(t 9 , R(t 9 )) e -

    t 9tR( t 0 ) dt 0 dt 9 R(t) r

    ( )(B:1a)

    and assume that the risk neutral process for the short rate R(t) is of the form

    dR(t) [ h (t) l (t, R)] dt s(t, R)deW (t) (B:1b)under this numeraire. Here the function h (t) must be selected to match the initial discount curve.With some ingenuity, h (t) can be found analytically for some models; otherwise a forwardinduction scheme can be used (Jamshidian, 1991).

    B.1. Recasting short rate models

    We rst recast these models in the framework of Section 2 by choosing a zero coupon bond asthe numeraire. Equations B.1a and B.1b imply that V (t, r) satises

    V t [ h (t) l (t, r)]V r 12s2(t, r)V rr - rV - C(t, r) (B:2)Let us select a maturity T ref , and dene the zero coupon bond

    Z(t, r; T ref ) eEfe - Treft R( t 9 ) dt9 j R(t) rg (B:3)Then Z solves the partial differential equation

    Z t [h (t) l (t, r)]Z r 12s2(t, r)Z rr - rZ 0 t , T ref (B:4a)Z(T ref , r; T ref ) 1 (B:4b)

    Suppose we denominate the value V (t, r) of tradable instruments in units of Z(t, r; T ref ),

    bV (t, r) V (t, r)Z(t, r; T ref )

    (B:5)

    Substituting (B.5) into (B.2) then yields

    bV t [ h (t) ^l (t, r)]bV r 12s2(t, r)bV rr - C(t, r)Z(t, r; T ref ) (B:6awhere

    250 Hagan and Woodward

  • ^l (t, r) l (t, r) s2(t, r) Z r(t, r; T ref )Z(t, r; T ref )

    (B:6b)

    This is equivalent to

    V (t, r) Z(t, r; T ref )bE V (T , R(T ))Z(T , R(T ); T ref )Tt

    C(t 9 , R(t 9 ))Z(t 9 , R(t 9 ); T ref )

    dt 9 R(t) r( )

    (B:7a)

    where R(t) evolves according to

    dR(t) [ h (t) ^l (t, r)] dt s(t, r)d bW (t) (B:7b)as can be seen by applying the backwards Kolmogorov equation to (B.7a). Note that thisargument represents an extension of Jamshidians (1991) Brownian path independent models.

    B.2 Recasting one-factor models as short rate models

    We now consider the general class of one factor models,

    dX (t) l (t, X (t)) dt r (t, X (t)) d bW (t) (B:8a)under the numeraire

    N (t, x) 1D(0, t)

    eh( t,x)A( t) (B:8b)

    To match the initial term-structure, A(t) must be chosen as

    A(t) log bEfe - h( t,X (t)) j X (0) 0g (B:8c)Once this is done, the short rate is given by

    r(t, x) f0(t) A9 (t) h t(t, x) l (t, x)hx(t, x) 12r 2(t, x)[hxx(t, x) - h2x(t, x)] (B:9)See (A.6c).To re-write this as a short-rate model requires switching to the money market numeraire. Using

    (A.11), the risk-neutral process for X (t) is

    dX (t) [ l (t, X ) - r 2(t, X )hx(t, X )] dt r (t, X )d eW (t): (B:10)under the money market numeraire. Applying Itos lemma to (B.9) now shows that the processfor the short rate, R(t) r(t, X (t)), is

    dR(t) [rt ( l - r 2hx )rx 12r 2rxx ]dt r rx d eW (t): (B:11)We see that expressing an arbitrary one-factor model (B.8a)(B.8c) as a short rate model requires

    inverting the relationship R r(t, x) given in (B.9) to obtain x x(t, R), and then substituting thisinto (B.11) to obtain the drift and volatility terms as functions of t and R. This is a tedious (andneedless) process, but it simplies signicantly for several classes of models. For example, considerthe class of models examined in Section 3. For these models l (t, x) 0 and h(t, x) k (t)x, so thecorresponding short rate models are

    Markov interest rate models 251

  • dR(t) [rt - k (t)r 2(t, x)rx 12r 2(t, x)rxx] dt r (t, x)rx d eW (t) (B:12a)where

    r(t, x) f0(t) A9 (t) k 9 (t)x - 12k 2(t)r 2(t, x) (B:12b)

    C Special exponents

    The b g models are dened by the numeraire

    N (t, x) 1D(0, t)

    ek ( t)xA( t) (C:1a)

    and the process

    dX (t) a (t)[1 b (t)X (t)]g dbW (t) (C:1b)where A(t) M (0, 0; T ). Here k (t), a (t), and b (t) are arbitrary model parameters that can beused to calibrate the model to the prices of market instruments. We now consider the exponentsg 0, g 1=2, and g 1 and solve for

    M (t, x; T ) log bEfe - k (T )[X (T ) - x] j X (t) xg (C:2)The value Z(t, x; T ) of zero coupon bonds, the instantaneous forward rates f (t, x; T ), and theshort rate r(t, x) can then be read off from (3.8) and (3.10).

    C.1. Extended Hull-White ( g 0)When g 0, the transition density p(t, x; t , x ) is Gaussian,

    p(t, x; t , x ) e- ( x - x)2=2( s - s )

    [2p ( s - s )]1=2(C:3)

    where

    s t0a 2(t 9 )dt 9 s

    t0a 2(t 9 )dt 9 (C:4)

    Integrating to compute the expected value in (C.2) yields

    M (t, x; T ) 12k 2(T )( s - s ) A(T ) 12k 2(T ) s (C:5)To show that this is identical to the extended HullWhite model, note that (3.10) yields

    r(t, x) f0(t) k 9 (t)x k 9 (t)k (t)s (t) (C:6)Equation B.12 now shows that the equivalent short rate model is

    dR(t) [ h (t) - l (t)R(t)] dt s(t)d eW (t) (C:7a)under the money market numeraire, where

    252 Hagan and Woodward

  • l (t) - k 0 (t)= k 9 (t) s(t) k 9 (t)a (t) h (t) f 90(t) l (t) f0(t) [ k 9 (t)]2 s (t) (C:7b)The extended HullWhite model is usually expressed as the short rate process (C.7a), where l (t)

    and s(t) are arbitrary model parameters and h (t) is chosen as

    h (t) f 90(t) l (t) f0(t) t0s2(t 9 ) e - 2

    tt 9l ( t 0 ) dt 0 dt 9 (C:8)

    to match the initial discount curve. Note that mean reversion is expressed through the parameterl (t) in the extended HullWhite model, while it is expressed through the gearing factor k (t) inthe equivalent b g model.

    C.2. CIR-like models ( g 12 )When g 12 the state variable obeys

    dX (t) a (t)[1 b (t)X (t)]1=2 d bW (t) (C:9)Consequently, the transition density p(t, x; t , x ) satises the forward Kolmogorov equation

    p t 12a 2( t )([1 b ( t )x ]p) x x t . t (C:10a)p! d (x - x) as t! t (C:10b)

    To obtain M (t, x; T ) we take the two-sided Laplace transform

    P(t, x; t , k )1- 1

    e - k x p(t, x; t , x )dx (C:11)

    which yields

    P t 12a 2( t )k 2fP - b ( t )Pk g t . t (C:12a)P e - k x at t t (C:12b)

    This is a rst-order hyperbolic equation which can be solved by the method of characteristics.We obtain

    log P(t, x; t , k ) - k x k 2 x[B( t ) - B(t)]2 k [B( t ) - B(t)]

    tt

    2a 2(t 9 )dt 9(2 k [B( t ) - B(t 9 )])2

    ( )(C:13a)

    where

    B(t) t0a 2(t 9 )b (t 9 )dt 9 (C:13b)

    Consequently, M (t, x; T ) and A(T ) are

    M (t, x; T ) k 2(T ) x[B(T ) - B(t)]2 k (T )[B(T ) - B(t)]

    Tt

    2a 2(t 9 )dt 9(2 k (T )[B(T ) - B(t 9 )])2

    ( )(C:14a)

    Markov interest rate models 253

  • A(T ) k 2(T )T0

    2a 2(t 9 )dt 9(2 k (T )[B(T ) - B(t 9 )])2 (C:14:b)

    The values of zero coupon bonds, the forward rate curve, and the short rate can now be obtaineddirectly from (3.8) and (3.10). In particular, the short rate is

    r(t, x) f0(t) K(t)[x C(t)] (C:15a)with

    K(t) k 9 (t) - 12k 2(t)a 2(t)b (t) C(t) k (t) t0

    a 2(t 9 )dt 9(1 k (t)[B(t) - B(t 9 )]=2)3 (C:15b)

    Let us cast this model as the equivalent short rate model. Using (B.12) shows that this model is

    dR(t) [ h (t) - l (t)R(t)] dt s(t)

    d(t) R(t)

    pdeW (t) (C:16a)

    under the money market numeraire, where

    l (t) - [K 9 (t) - k (t)a 2(t)b (t)]=K(t) (C:16:b)h (t) f 90(t) l (t) f0(t) K(t)C 9 (t) - k (t)a 2(t)K(t)[1 - b (t)C(t)] (C:16c)s(t) a (t)

    K(t)b (t)

    pd(t) K(t)= b (t) - f0(t) - K(t)C(t) (C:16d)

    The extended CIR model is usually expressed as

    dR(t) [ h (t) - l (t)R(t)] dt s(t)

    R(t)

    pd eW (t) (C:17)

    Here l (t) and s(t) are arbitrary model parameters, and h (t) must be chosen to t the initial termstructure. To state this requirement explicitly, recall that the value of a zero coupon bond is

    Z(t, r; T ) eEfe - Tt R( t9 ) dt 9 j R(t) rg (C:18)Equivalently, Z(t, r; T ) is the solution of the backwards equation

    Z t [h (t) - l (t)r]Z r 12s2(t)rZ rr - rZ 0 t , T (C:19a)with

    Z(T , r; T ) 1 (C:19b)The solution of (C.19) is

    Z(t, r; T ) e - B(t,T )r - A( t,T ) (C:20a)where B(t, T ) satises the Ricatti equation

    B t l (t)B 12s2(t)B2 - 1 t , T (C:20b)with the boundary condition B(T , T ) 0, and

    A(t, T ) Tth (t 9 )B(t 9 , T )dt 9 (C:20c)

    254 Hagan and Woodward

  • Matching the initial discount curve requires Z(0, 0; T ) D(0, T ) e - T0

    f0(T 9 ) dT 9 , which requiresh (t) to be chosen so thatT

    0h (t 9 )B(t 9 , T )dt 9

    T0f0(T 9 )dT 9 - B(0, T ) f0(0) (C:21)

    We see that matching the extended CIR model to the initial discount curve requires solving aRicatti equation to obtain the zero coupon bond values. This can be done efciently, but it isdifcult to understand the qualitative impact of yield curve shifts on pricing.Comparing (C.16) and (C.17) shows that the extended CIR model is a special case of the b g

    model with g 1=2 and with b (t) chosen so that d(t) 0. Unsurprisingly, making this choice ofb (t) also requires solving a Ricatti equation.A simpler alternative is to use the g 1=2 model without requiring d(t) 0. The g 1=2 model

    then gives the zero coupon bond values directly, without solving a Ricatti equation. In addition, b (t)can be used to match the implied volatility smile by calibrating the model to the prices of off-market instruments. Although the CIR model is more aesthetically appealing, with its natural barrieroccuring at R(t) 0 and its simple relation between the volatility and the short rate, we stress thatneither model can be expected to be valid near R(t) 0. Since the barrier in the g 1=2 model isirrelevant for pricing commonly traded US instruments, it seems better to use b (t) to match theimplied volatility smile.

    C.3. BlackKarasinski-like models (g 1)When g 1 we have been unable to nd the Laplace transform of the transition density, andthus M (t, x; T ). However, we can nd the moments mk (t, x; T ) explicitly, and then use theexpansions

    M (t, x; T ) 12k 2(T )m2(t, x; T ) - 16k 3(T )m3(t, x; T ) . . . (C:22a)A(T ) 12k 2(T )m2(0, 0; T ) - 16k 3(T )m3(0, 0; T ) . . . (C:22b)

    Equations 3.8 and 3.10 then determine the zero coupon values, the forward rate curve, and theshort rate as before. Although these expressions are only approximate, using the rst termsufces to price all but the most sensitive instruments.With the exponent g 1,

    dX (T ) a (T )[1 b (T )X (T )] d bW (T ) (C:23)and applying Itos lemma yields

    d[X (T ) - x]k 12k(k - 1)a 2(T )[1 b (T )X ]2[X - x]k - 2 dT

    k a (T )[1 b (T )X ][X - x]k - 1 d bW (T ) (C:24)Taking the expected value then yields equations for the moments mk (t, x; T ):

    dmkdT

    12k(k - 1)a 2(T )f(1 b x)2m k - 2 2b (1 b x)mk - 1 b 2mkg (C:25)

    Markov interest rate models 255

  • See (3.11). Clearly at T t we have m0(t, x; t) 1 and mk (t, x; t) 0 for all k . 1. Solvingthese equations successively yields

    m0(t, x; T ) 1 m1(t, x; T ) 0 (C:26a)

    m2(t, x; T ) Tta 2(t 9 )[1 b (t 9 )x]2exp

    Tt 9a 2(T 9 )b 2(T 9 )dT 9

    ( )dt 9 (C:26b)

    m3(t, x; T ) 6Tt

    t 9ta 2(t 9 )a 2(t 0 )b (t 9 )[1 b (t 9 )x][1 b (t 0 )x]2

    . exp 3Tt 9a 2(T 9 )b 2(T 9 )dT 9

    t 9t 0a 2(T 9 )b 2(T 9 ) dT 9

    ( )dt 0 dt 9 (C:26c)

    As earlier, Equation B.12 can be used to obtain the equivalent short rate model. Even though thedistribution of X (t) is roughly lognormal, the short rate model is not the BlackKarasinski modelregardless of how b (t) is chosen. The BlackKarasinski model requires adding a drift term to thestochastic differential equation for X (t).

    D Rational interest rate models

    Consider the interest rate model dened by the valuation formula

    V (t, x) N (t, x)bE V (T , X (T ))N (T , X (T ))

    Tt

    C(t 9 , X(t 9 ))N (t 9 , X (t 9 ))

    dt 9 X (t) x( )

    (D:1a)

    with the numeraire

    N (t, x) 1D(0, t) b(t) . x (D:1b)

    and assume that the state variables X (t) evolve according to

    dX i(t) r i(t, X )d bW i(t) X i(0) 0 i 1, 2, . . . , n (D:1c)under this numeraire in the risk-neutral world. Noting that X (t) is a Martingale, we nd that thezero coupon bond prices are

    Z(t, x; T ) bE D(0, T ) b(T ) . X (T )D(0, t) b(t) . x X (t) x

    ( ) D(0, T ) b(T )

    . xD(0, t) b(t) . x (D:2)

    Consequently, the instantaneous forward rate curve is

    f (t, x; T ) - DT (0, T ) b9 (T ). x

    D(0, T ) b(T ) . x (D:3)

    Flesaker and Hughstons (1996) rational lognormal model is the special caser i(t, x) a i(t)(1 xi). These models have the advantage that all forward rates f (t, x; T ) areguaranteed to remain positive provided that b 9i(T ) < 0 for each i and DT (0, T ) ,

    Pb 9i(T ) for all T.

    256 Hagan and Woodward

  • Moreover, the one-factor version of this model has closed-form prices for swaptions and caplets(Flesaker and Hughston, 1996).

    E Separable models

    The general n-factor HJM model for the forward rate is

    dF(t, x (t); T )Xij

    rij(t)aiT (t, x (t); T )aj(t, x (t); T )dt -

    Xi

    a iT (t, x (t); T )dfW i(t) (E:1)in the risk-neutral world under the money market numeraire:

    V (t) eE e - Tt R( t 0 ,x 0 ) dt 0 V (T , x (T )) TtC(t 9 , x (t 9 )) e -

    t 9tR( t 0 ,x 0 ) dt 0 dt 9 F t

    ( )(E:2)

    Here x (t) indicates that the volatilities aiT , payoffs, and cash ows can depend on all measurableevents that can be resolved by time t.The separable models (Li et al., 1995; Ritchken and Sankarasubramanian, 1995; Cheyette, 1992)

    are derived by presuming that the volatilities have the form

    a iT (t, x (t); T ) k 9i(T )r i(t, x (t)) i 1, 2, . . . , n (E:3a)so that

    a i(t, x (t); T ) [ k i(T ) - k i(t)]r i(t, x (t)) i 1, 2 . . . , n (E:3b)Then the stochastic process for the forward rates becomes

    dF(t, x (t); T )Xij

    k 9i(T )[k j(T ) - k j(t)]rij(t)r i(t, x (t))r j(t, x (t)) dt

    -Xi

    k 9i(T )r i(t, x (t)) deW i(t) (E:4)Integrating over t now yields

    F(t, x ; T ) f0(T )Xi

    k 9i(T )X i(t, x )Xij

    k 9i(T )k j(T )Yij(t, x ) (E:5)

    where

    X i(t, x ) - t0r i(t 9 , x 9 )d eW i(t 9 ) - X

    j

    t0k j(t 9 )rij(t 9 )r i(t 9 , x 9 )r j(t 9 , x 9 )dt 9 i 1, . . . , n

    (E:6a)

    Yij(t, x ) t0rij(t 9 )r i(t 9 , x 9 )r j(t 9 , x 9 )dt 9 i, j 1, . . . , n (E:6b)

    One now assumes that the coefcients r i(t, x (t)) depend on the path taken by the Brownian

    Markov interest rate models 257

  • motion only through the value of the state variables X (t), Y (t) at time t. This is realistic since (E.5)expresses the entire term structure in terms of these variables. With this assumption, (E.6) becomes

    dX i(t) - r i(t, X , Y ) d eW i(t)Xj

    k j(t)rij(t)r j(t, X , Y )dt( )

    X i(0) 0 i 1, . . . , n

    (E:7a)

    dYij(t) rij(t)r i(t, X , Y )r j(t, X , Y )dt Yij(0) 0 i, j 1, . . . , n (E:7b)and the instantaneous forward rate curve is

    f (t, x, y; T ) f0(T )Xi

    k 9i(T )xi Xij

    k 9i(T )k j(T )yij (E:8)

    where xi X i(t) and yij Yij(t) are the values of the state variables at time t.Under the money market numeraire, the general separable model is dened by the forward rate

    curve (E.8) and the stochastic processes (E.7). We can put this model into the context of Section 2by switching to the numeraire

    N (t, x, y) 1D(0, t)

    expXi

    k i(t)xi 12Xij

    k i(t)k j(t)yij

    ( )(E:9)

    Under this numeraire the value of a tradable instrument is

    V (t, x, y)

    N (t, x, y)bE V (T , X (T ), Y (T ))N (T , X (T ), Y (T ))

    Tt

    C(t 9 , X (t 9 ), Y (t 9 ))N (t 9 , X (t 9 ), Y (t 9 ))

    dt 9 X (t) x, Y (t) y( )

    (E:10)

    Reversing the steps in Appendix A, we nd that the state variables evolve according to

    dX i(t) - r i(t, X , Y )dbW i(t) X i(0) 0 i 1, . . . , n (E:11a)dYij(t) rij(t)r i(t, X , Y )r j(t, X , Y )dt Yij(0) 0 i, j 1, . . . , n (E:11b)

    under the new numeraire. Finally, integrating the forward rate curve (E.8) with respect to thematurity T yields the zero coupon bond prices:

    Z(t, x, y; T ) D(t, T )exp -Xi

    [k i(T ) - k i(t)]xi -12

    Xij

    [ k i(T )k j(T ) - k i(t)k j(t)]yij

    ( )(E:12)

    Note that Z(0, 0, 0; T ) D(0, T ), so the separable models are automatically consistent with theinitial term structure.Since Y ij(t) Y ji(t), the n-factor separable model generally requires n(n 3)=2 state variables.

    As the random processes Yij(t) are not being driven directly by Brownian motions, we can only viewthe separable models within the framework of Section 2 by considering (E.11) as a highlydegenerate n(n 3)=2 factor model.The zero coupon bond prices in (E.12) show that the extra state variables yij are needed to

    258 Hagan and Woodward

  • account for convexity effects in an arbitrage-free manner. Since convexity effects are quite small,and since (E.11b) shows that the Yij(t) are only driven indirectly by the stochastic processes dcW i(t),the distributions of Yij(t) are highly peaked about their means, which are near zero. Specically, asthe volatilities r i decrease, the means of the variables Yij(t) scale like r 2 t and the variances ofYij(t) scale like r 6t3. This suggests that the convexity effects can be accounted for by replacing thedistributions of Yij(t) by d -functions, selecting the position of the d -functions to make the theoryarbitrage free. Carrying this idea through leads to the arbitrage-free n-state variable model denedby the valuation formula

    V (t, x) N (t, x)bE V (T , X (T ))N (T , X (T ))

    Tt

    C(t 9 , X(t 9 ))N (t 9 , X (t 9 ))

    dt 9 X (t) x( )

    the numeraire

    N (t, x) 1D(0, t)

    ek ( t).xA( t)

    and the risk neutral processes

    dX i(t) - r i(t, X )d bW i(t) X i(0) 0 i 1, . . . , nunder this numeraire. Here A(T ) is dened by

    A(T ) log bEfe - k(T ).X(T )jX (0) 0gThis model is the multi-factor generalization of the one-factor models in Section 3. Following the

    analysis there shows that the zero coupon bond prices are

    Z(t, x; T ) D(t, T )e - [k(T ) - k( t)].x- A(T )A( t)M( t,x;T )

    where

    M (t, x; T ) log bEfe - k(T ).[X(T ) - x] j X (t) xgConsequently the instantaneous forward rates are

    f (t, x; T ) f0(T ) k9 (T ) . x A 9 (T ) - M T (t, x; T )Typically k 9i(T )r i is around 10 - 2 in US dollar markets. At these volatilities the n-state variable

    model (E.13) is virtually identical to the analogous separable model in (E.9)(E.11). Although then-state variable model has the disadvantage of requiring computation of M(t, x; T ) to obtain thezero coupon bond prices, this is usually outweighed by having many fewer state variables than theseparable models.

    F Approximate prices of European options

    As in Section 4.1, let te be the expiration date of a European option to receive the cash owsC i on dates t i, i 1, 2, . . . , n in return for paying the strike K on the settlement date ts. Alsodene

    Markov interest rate models 259

  • F0 Xi

    C iD(ts, t i ) s e

    te0a 2(t 9 )dt 9 (F:1)

    as before. To express the option prices succinctly, dene t0 by the relation t00a 2(t 9 )dt 9 s e=2 (F:2)

    dene x by the implicit relation

    12(F0 K )

    Xi

    C iD(ts, t i) e - [ k i - k s]x - AiAsM(t0,x ;t i) - M( t0,x ;t s ) (F:3)

    and dene K k by

    K k P

    iC iD(ts, t i)[k i - k s - M x(t0, x ; t i) M x(t0, x ; ts )]k e - k i x - A iM( t0,x ; t i )PiC iD(ts, t i) e - k i x - AiM( t0,x ;t i)

    (F:4)

    Then the implied price vol of the option is (Hagan and Woodward, in preparation)

    r B K 1( s e= te )1=2(1 b x )g 1 124(1 b x )2g s e[ K 21 -g (2 - g )b 2

    (1 b x )2 2K 3

    K 1- 3K 22K 21

    ]

    (

    (F0 - K )2

    6K 21(F0 K )2[2K 21 -

    g (1 g )b 2(1 b x )2

    3g b K 2(1 b x )K 1

    K 3K 1- 3K 22

    K 21] . . . (F:5)

    This expression yields prices that are usually accurate to within a tenth of a basis point, althoughthe error can be several times larger in unusual situations.

    260 Hagan and Woodward