20
E S D T RB Tracy A. Heath University of Kansas University of California, Berkeley Iowa State University (in 2015) @trayc7 Sebastian Höhna University of California, Davis iEvoBio 2014 – Raleigh, NC

Estimating Species Divergence Times in RevBayes – iEvoBio 2014

Embed Size (px)

DESCRIPTION

Phylogenetic analyses of macroevolutionary processes require estimates of species divergence times. Critically, this requires a framework for modeling lineage-specific substitution rates and speciation times while accounting for uncertainty in the tree topology. Bayesian inference methods are well suited to such analyses. However, implementations of these methods have historically been limited by the available models and priors in each program. RevBayes is a new statistical programming environment that provides a flexible framework for phylogenetic inference. We have implemented phylogenetic inference under a diverse set of relaxed-clock and branching-process models in RevBayes. The user specifies the model and analysis details in Rev -- an interpreted programming language based on R. I will present the theory behind the implementation of phylogenetic models in RevBayes that gives the software its flexibility and show the results of empirical analyses.

Citation preview

Page 1: Estimating Species Divergence Times in RevBayes – iEvoBio 2014

E S D T

RB

Tracy A. HeathUniversity of Kansas

University of California, BerkeleyIowa State University (in 2015)

@trayc7

Sebastian HöhnaUniversity of California, Davis

iEvoBio 2014 – Raleigh, NC

Page 2: Estimating Species Divergence Times in RevBayes – iEvoBio 2014

C S P

Prior options in MrBayes v3.2

Page 3: Estimating Species Divergence Times in RevBayes – iEvoBio 2014

C S P

Prior options in MrBayes v3.2

Page 4: Estimating Species Divergence Times in RevBayes – iEvoBio 2014

M B P S

Several software packages inphylogenetics are moving toward amore modular framework

• reuse code

• easier to extend existing modelsand implement new modelsthrough a rich, language-basedinterface

• provides a unified framework foranalyses under complex models

RevBayes

Bali-Phy

BEAST2

Page 5: Estimating Species Divergence Times in RevBayes – iEvoBio 2014

RB

Fully integrative Bayesian inference ofphylogenetic parameters

http://sourceforge.net/projects/revbayes/

Development team

Höhna

Landis Lartillot Ronquist

Boussau Heath Huelsenbeck

& others...

Page 6: Estimating Species Divergence Times in RevBayes – iEvoBio 2014

G M RB

Graphical models provide tools forvisually & computationally representingcomplex, parameter-rich probabilisticmodels

We can depict the conditionaldependence structure of variousparameters and other random variables

Höhna, Heath, Boussau, Landis, Ronquist, Huelsenbeck. 2014.Probabilistic Graphical Model Representation in Phylogenetics.

Systematic Biology in press. (doi: 10.1093/sysbio/syu039)

Page 7: Estimating Species Divergence Times in RevBayes – iEvoBio 2014

G M RBModels are represented by directed acyclic graphs (DAGs)

Page 8: Estimating Species Divergence Times in RevBayes – iEvoBio 2014

G M RBModels are represented by directed acyclic graphs (DAGs)

Page 9: Estimating Species Divergence Times in RevBayes – iEvoBio 2014

G M RBModels are represented by directed acyclic graphs (DAGs)

Page 10: Estimating Species Divergence Times in RevBayes – iEvoBio 2014

G M RB

A phylogenetic graphical model:GTR, birth-death process, global molecular clock

Page 11: Estimating Species Divergence Times in RevBayes – iEvoBio 2014

C M RB

Page 12: Estimating Species Divergence Times in RevBayes – iEvoBio 2014

C M RB

Page 13: Estimating Species Divergence Times in RevBayes – iEvoBio 2014

C M RB

Single Rate Model (Zuckerkandl & Pauling, 1962)

Page 14: Estimating Species Divergence Times in RevBayes – iEvoBio 2014

C M RB

Uncorrelated Lognormal Branch-Rate Model (Drummond et al. 2006)

Page 15: Estimating Species Divergence Times in RevBayes – iEvoBio 2014

C M RB

Dirichlet Process Prior on Branch Rates (Heath, Holder, Huelsenbeck 2012)

Page 16: Estimating Species Divergence Times in RevBayes – iEvoBio 2014

C M RB

Autocorrelated Lognormal Branch-Rate Model (Thorne, Kishino & Painter 1998; Kishino, Thorne & Bruno 2001)

Page 17: Estimating Species Divergence Times in RevBayes – iEvoBio 2014

C M RB

Autocorrelated Lognormal Branch-Rate Model — with a Dirichlet Process Prior on the Variance

Page 18: Estimating Species Divergence Times in RevBayes – iEvoBio 2014

T R LAn R-like language for specifying models and MCMC

# An example (partial)

# read the data

D <- readCharacterData("data/primates.nex")[1]

# lognormal prior on strict clock rate

theta <- 0.5

sigma <- 0.75

mu ∼ exponential(theta)

clock_rate ∼ lognormal(mu, sigma)

RealPos branch_rates

for (i in 1:n) {

branch_rates[i] := clock_rate

}

# Moves on model parameters

moves[1] <- mScale(mu, 0.5, true, 5.0)

moves[2] <- mScale(clock_rate, 1.0, true, 2.0)

. . .

Page 19: Estimating Species Divergence Times in RevBayes – iEvoBio 2014

R-C I RB

Graphical-model framework allowsfor implementation of complexmodels

Test and develop new proposalmechanisms for models withcorrelated parameters

Joint estimation of allphylogenetic parameters

Simulation under any definedmodel

1.0 0.00.250.50.75

Relative time

A

B

C

D

E

F

G

H

I

J

true age

age 95% CI

DPP Relative Time Tree

Page 20: Estimating Species Divergence Times in RevBayes – iEvoBio 2014

ATrilogenetics: Likelihood Methods for Phylogenetics ofFossil Taxa: http://phylo.bio.ku.edu/fossil/

Thanks to:

• Michael Landis• Bastien Boussau• Johan Dunfalk• Chi Zhang• John Huelsenbeck

• Fredrik Ronquist• Nicolas Lartillot• Mark Holder• Tanja Stadler• Brian Moore

Support:

• NSF DEB-1256993• NIH GM-069801 & GM-086887• NESCent Academy (http://bit.ly/P11bhy)

Twitter: @trayc7 Slides: www.slideshare.net/trayc7

Thanks!