Detecting Adaptation in Protein-Coding Genes Using a Bayesian Site-Heterogeneous Mutation-Selection Codon Substitution Model

Bloom

JD.

2014

.

An experimentally informed evolutionary model improves phylogenetic fit to divergent lactamase homologs

.

Mol Biol Evol

.

31

:

2753

–

2769

.

Castresana

J.

2000

.

Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis

.

Mol Biol Evol

.

17

:

540

–

552

.

Clark

AG

Glanowski

S

Nielsen

R

Thomas

PD

Kejariwal

A

Todd

MA

Tanenbaum

DM

Civello

D

Lu

F

Murphy

B

, et al. .

2003

.

Inferring nonneutral evolution from human-chimp-mouse orthologous gene trios

.

Science

302

:

1960

–

1963

.

Crespi

B

Summers

K.

2004

.

In defense of the cell: TRIM5alpha interception of mammalian retroviruses

.

Proc Natl Acad Sci U S A

.

101

:

10496

–

10497

.

Cutler

DJ.

2000

.

Understanding the overdispersed molecular clock

.

Genetics

154

:

1403

–

1417

.

Echave

J

Spielman

SJ

Wilke

CO.

2016

.

Causes of evolutionary rate variation among protein sites

.

Nat Rev Genet

.

17

:

109

–

121

.

Edgar

RC.

(

2004

).

MUSCLE: multiple sequence alignment with high accuracy and high throughput

.

Nucleic Acids Res

.

32

:

1792

–

1797

.

Eyre-Walker

A

Keightley

PD.

2009

.

Estimating the rate of adaptive molecular evolution in the presence of slightly deleterious mutations and population size change. Mol

Biol Evol

.

26

:

2097

–

2108

.

Galtier

N.

2016

.

Adaptive protein evolution in animals and the effective population size hypothesis

.

PLoS Genet

.

12

:

e1005774.

Goldman

N

Yang

Z.

1994

.

A codon-based model of nucleotide substitution for protein-coding DNA sequences

.

Mol Biol Evol

.

11

:

725

–

736

.

Gong

LI

Bloom

JD.

2014

.

Epistatically interacting substitutions are enriched during adaptive protein evolution

.

PLoS Genet

.

10

:

e1004328.

Guindon

S

Rodrigo

AG

Dyer

KA

Huelsenbeck

JP.

2004

.

Modeling the site-specific variation of selection patterns along lineages

.

Proc Natl Acad Sci U S A

.

101

:

12957

–

12962

.

Halligan

DL

Oliver

F

Eyre-Walker

A

Harr

B

Keightley

PD.

2010

.

Evidence for pervasive adaptive protein evolution in wild mice

.

PLoS Genet

.

6

:

e1000825.

Halpern

AL

Bruno

WJ.

1998

.

Evolutionary distances for protein-coding sequences: modeling site-specific residue frequencies

.

Mol Biol Evol

.

15

:

910

–

917

.

Holder

MT

Zwickl

DJ

Dessimoz

C.

2008

.

Evaluating the robustness of phylogenetic methods to among-site variability in substitution processes

.

Phil Trans R Soc B

363

:

4013

–

4021

.

Huelsenbeck

JP

Jain

S

Frost

SWD

Pond

SLK.

2006

.

A Dirichlet process model for detecting positive selection in protein-coding DNA sequences

.

Proc Natl Acad Sci U S A

.

103

:

6263

–

6268

.

Keightley

PD

Eyre-Walker

A.

2007

.

Joint inference of the distribution of fitness effects of deleterious mutations and population demography based on nucleotide polymorphism frequencies

.

Genetics

177

:

2251

–

2261

.

Keightley

PD

Eyre-Walker

A.

2010

.

What can we learn about the distribution of fitness effects of new mutations from DNA sequence data?

Philos Trans R Soc Lond B Biol Sci

.

365

:

1187

–

1193

.

Kimura

M.

1983

.

The neutral theory of molecular evolution

.

Cambridge

:

Cambridge University Press

.

Kosiol

C

Vinar

T

da Fonseca

RR

Hubisz

MJ

Bustamante

CD

Nielsen

R

Siepel

A.

2008

.

Patterns of positive selection in six Mammalian genomes

.

PLoS Genet

.

4

:

e1000144.

Laguette

N

Rahm

N

Sobhian

B

Chable-Bessia

J

Münch

C

Snoeck

J

Sauter

D

Switzer

WM

Heneine

W

Kirchhoff

F

, et al. .

2012

.

Evolutionary and functional analyses of the interaction between the myeloid restriction factor SAMHD1 and the lentiviral Vpx protein

.

Cell Host Microbe

11

:

205

–

217

.

Lartillot

N

Delsuc

F.

2012

.

Joint reconstruction of divergence times and life-history evolution in placental mammals using a phylogenetic covariance model

.

Evolution

66

:

1773

–

1787

.

Lartillot

N

Rodrigue

N

Stubbs

D

Richer

J.

2013

.

PhyloBayes-MPI: phylogenetic reconstruction with infinite mixtures of profiles in a parallel environment

.

Syst Biol

.

62

:

611

–

615

.

Lee

K

KewalRamani

VN.

2004

.

In defense of the cell: TRIM5alpha interception of mammalian retroviruses

.

Proc Natl Acad Sci U S A.

101

:

10496

–

10497

.

Lunzer

M

Golding

GB

Dean

AM.

2010

.

Pervasive cryptic epistasis in molecular evolution

.

PLoS Genet

.

6

:

e1001162.

McCandlish

DM

Rajon

E

Shah

P

Ding

Y

Plotkin

JB.

2013

.

The role of epistasis in protein evolution

.

Nature

497

:E1–2, discussion E2–3.

McDonald

JH

Kreitman

M.

1991

.

Adaptive protein evolution at the Adh locus in Drosophila

.

Nature

351

:

652

–

654

.

Muse

SV

Gaut

BS.

1994

.

A likelihood approach for comparing synonymous and nonsynonymous nucleotide substitution rates, with application to the chloroplast genome

.

Mol Biol Evol

.

11

:

715

–

724

.

Mustonen

V

Lässig

M.

2009

.

From fitness landscapes to seascapes: non-equilibrium dynamics of selection and adaptation

.

Trends Genet

.

25

:

111

–

119

.

Nielsen

R

Yang

Z.

1998

.

Likelihood models for detecting positively selected amino acid sites and applications to the HIV-1 envelope gene

.

Genetics

148

:

929

–

936

.

Pollock

DD

Thiltgen

G

Goldstein

RA.

2012

.

Amino acid coevolution induces an evolutionary Stokes shift

.

Proc Natl Acad Sci U S A

.

109

:

E1352

–

E1359

.

Pond

SLK

Murrell

B

Fourment

M

Frost

SDW

Delport

W

Scheffler

K.

2011

.

A random effects branch-site model for detecting episodic diversifying selection

.

Mol Biol Evol

.

28

:

3033

–

3043

.

Robinson

DM

Jones

DT

Kishino

H

Goldman

N

Thorne

JL.

2003

.

Protein evolution with dependence among codons due to tertiary structure

.

Mol Biol Evol

.

18

:

1692

–

1704

.

Rodrigue

N.

2008

.

Phylogenetic structural modeling of molecular evolution. Doctoral dissertation

,

Université de Montréal

,

Canada

.

Google Preview

Rodrigue

N.

2013

.

On the statistical interpretation of site-specific variables in phylogeny-based substitution models

.

Genetics

193

:

557

–

564

.

Rodrigue

N

Kleinman

CL

Philippe

H

Lartillot

N.

2009

.

Computational methods for evaluating phylogenetic models of coding sequence evolution with dependence between codon

.

Mol Biol Evol

.

26

:

1663

–

1676

.

Rodrigue

N

Lartillot

N.

2014

.

Site-heterogeneous mutation-selection models within the phylobayes-mpi package

.

Bioinformatics

30

:

1020

–

1021

.

Rodrigue

N

Philippe

H

Lartillot

N.

2010a

.

Mechanistic revisions of phenomenological modeling strategies in molecular evolution

.

Trends Genet

.

26

:

248

–

252

.

Rodrigue

N

Philippe

H

Lartillot

N.

2010b

.

Mutation-selection models of coding sequence evolution with site-heterogeneous amino acid fitness profiles

.

Proc Natl Acad Sci U S A

.

107

:

4629

–

4634

.

Sawyer

SA

Hartl

DL.

1992

.

Population genetics of polymorphism and divergence

.

Genetics

132

:

1161

–

1176

.

Sawyer

SA

Kulathinal

RJ

Bustamante

CD

Hartl

DL.

2003

.

Bayesian analysis suggests that most amino acid replacements in Drosophila are driven by positive selection

.

J Mol Evol

.

57 Suppl 1

:

S154

–

S164

.

Sawyer

SL

Emerman

M

Malik

HS.

2004

.

Ancient adaptive evolution of the primate antiviral DNA-editing enzyme APOBEC3G

.

PLoS Biol

.

2

:

E275.

Sawyer

SL

Wu

LI

Emerman

M

Malik

HS.

2005

.

Positive selection of primate TRIM5alpha identifies a critical species-specific retroviral restriction domain

.

Proc Natl Acad Sci U S A

.

102

:

2832

–

2837

.

Shah

P

McCandlish

DM

Plotkin

JB.

2015

.

Contingency and entrenchment in protein evolution under purifying selection

.

Proc Natl Acad Sci U S A

.

112

:

E3226

–

E3235

.

Spielman

SJ

Wilke

CO.

2015

.

The relationship between dN/dS and scaled selection coefficients

.

Mol Biol Evol

.

32

:

1097

–

1108

.

Tamuri

AU

Goldman

N

dos Reis

M.

2014

.

A penalized likelihood method for estimating the distribution of selection coefficients from phylogenetic data

.

Genetics

197

:

257

–

271

.

Tamuri

AU

dos Reis

M

Goldstein

RA.

2012

.

Estimating the distribution of selection coefficients from phylogenetic data using sitewise mutation-selection models

.

Genetics

190

:

1101

–

1115

.

Thorne

JL

Lartillot

N

Rodrigue

N

Choi

SC.

2012

. Codon models as a vehicle for reconciling population genetics with inter-specific sequence data. In:

Cannarozzi

GM

Schneider

A

, editors.

Codon evolution

.

Oxford

:

Oxford University Press

. p.

97

–

110

.

Weinreich

DM

Knies

JL.

2013

.

Fisher’s geometric model of adaptation meets the functional synthesis: data on pairwise epistasis for fitness yields insights into the shape and size of phenotype space

.

Evolution

67

:

2957

–

2972

.

Yang

Z.

2007

.

Paml 4: phylogenetic analysis by maximum likelihood

.

Mol Biol Evol

.

24

:

1586

–

1591

.

Yang

Z

Nielsen

R.

1998

.

Synonymous and nonsynonymous rate variation in nuclear genes of mammals

.

J Mol Evol

.

46

:

409

–

418

.

Yang

Z

Nielsen

R.

2002

.

Codon-substitution models for detecting molecular adaptation at individual sites along specific lineages. Mol

Biol Evol

.

19

:

908

–

917

.

Yang

Z

Nielsen

R.

2008

.

Mutation-selection models of codon substitution and their use to estimate selective strengths on codon usage

.

Mol Biol Evol

.

25

:

568

–

579

.

Yang

Z

Nielsen

R

Goldman

N

Pedersen

AM.

2000

.

Codon-substitution models for heterogeneous selection pressure at amino acid sites

.

Genetics

155

:

431

–

449

.