Analysis of enzyme kinetic data for mtDNA replication

doi:10.1016/j.ymeth.2010.02.019

Methods

Volume 51, Issue 4, August 2010, Pages 385-391

https://doi.org/10.1016/j.ymeth.2010.02.019 Get rights and content

Abstract

A significant amount of experimental data on the reaction kinetics for the mitochondrial DNA polymerase gamma exist, but interpreting that data is difficult due to the complex nature of the function of the polymerase. In order to model how these measured kinetics values for polymerase gamma affect the final function of the polymerase, the replication of an entire strand of mitochondrial DNA, we implement a stochastic simulation of the series of reaction events that the polymerase carries out. These reactions include the correct and incorrect polymerization events, exonuclease events which may remove both incorrectly and correctly matched base pairs, and the disassociation of the polymerase from the mitochondrial DNA template. We also describe other reactions which may be included, such as the addition of nucleoside analog tri-phosphates as substrates. The simulation analysis of the kinetics data is implemented through a standard Gillespie algorithm. We describe the methods necessary to define, code and test this algorithm, as well as describing the hardware and software options that are available.

Introduction

Mitochondrial polymerase gamma is the sole DNA polymerase active in mitochondria [1] and is responsible for the replication of mitochondrial DNA (mtDNA). Vertebrate polymerase gamma is composed of two subunits: a catalytic core, Pol γ–α, that contains the DNA polymerase and 3′–5′ exonuclease activities, and an accessory subunit, Pol γ–β, which enhances catalytic activity and serves as a processivity factor during DNA synthesis [2].

The function of polymerase gamma is far more complicated than the function of a typical enzyme which converts a substrate into a product. For that reason, the analysis of the enzyme kinetics of polymerase gamma must also be far more complicated. The function of polymerase gamma is the replication of a complete mitochondrial DNA molecule, while the enzyme kinetics data for polymerase gamma are at the level of individual reactions at the base-pair level. One tool that can be used to bridge that large gap between the enzyme kinetics data and the final product of a replicated DNA molecule is simulation. A computational simulation can reproduce each individual reaction of the polymerase activity, at the lowest level for which we have experimental data on the relevant reaction kinetics of the polymerase. The simulation can tirelessly repeat this process for the several tens of thousands of reactions necessary for the replication of a single strand of human mtDNA. Since mutations introduced through polymerase errors are of great practical interest, we need to use a stochastic simulation approach that will allow such replication errors to occur with probabilities that are calculated from the experimentally measured enzyme kinetics.

The “Gillespie algorithm” (or “Stochastic Simulation Algorithm”) is a well-known Monte Carlo simulation method for chemical reactions [3]. This algorithm takes a defined list of possible reactions and uses the reaction rates, based on the measured enzyme kinetics data and the substrate concentrations, to calculate the probability of each reaction on the list. These probabilities are then used to randomly choose the sequence of reactions which occur. The original Gillespie algorithm is particularly useful for simulating reactions involving a small number of molecules, while more elaborate and approximate versions of the basic algorithm have been created in order to handle large numbers of molecules [4].

Traditional continuous and deterministic biochemical rate equations are modeled as a set of coupled ordinary differential equations but these methods rely on bulk reactions that require the interactions of millions of molecules. In contrast, the Gillespie algorithm allows a discrete and stochastic simulation of a system with few reactants because every individual reaction is explicitly simulated. The physical basis of the Gillespie algorithm is the collision of molecules within a reaction vessel where the reaction environment is assumed to be well mixed. The general Gillespie algorithm can be summarized by the following series of steps:

Step 1, Initialization: Initialize the number of molecules in the system, the reaction kinetics constants, and the random number generators.

Step 2, Calculate reaction probabilities from reaction rates: Based on the substrate concentrations and the enzyme kinetics data, reaction rates R_i are calculated for each possible reaction, where i is an index denoting the particular reaction from the reaction list. The probability P_i for each reaction is then calculated from the list of n reaction rates through the following equation: $P_{i} = R_{i} / R_{total}$

The parameter R_total is the sum of all of the reaction rates, as follows: $R_{total} = \sum_{j = 1}^{n} R_{j}$ Note that the sum of the probabilities is 1, with this definition.

Step 3, Choose a reaction: Use a pseudo-random number generator to generate a uniform random number r_reaction in the range [1] and use this random number to choose the one reaction that occurs from the list of n possible reactions. The reaction number j is chosen if it satisfies the following condition: $\sum_{i = 1}^{j - 1} P_{i} < r_{reaction} ⩽ \sum_{i = 1}^{j} P_{i}$

Then a second uniform random number r_time, also in the range [0,1] is chosen. This random number is used to set the time τ required for this reaction by the following equation: $τ = - (1 / R_{total}) \ln (r_{time})$ It is important to note here that the sum of all of the reaction rates is used in defining the time τ, not just the rate of the one particular reaction that was chosen.

Step 4, Update the variables: Increase the time by τ. Update the molecule count based on the reaction that occurred.

Step 5, Iterate the process: Check to see if any stopping condition has been met. If not, return to step 2 to recalculate the reaction rates, which may have changed due to the reaction chosen and carried out in steps 3–4.

These five steps define the basic Gillespie method for stochastic simulation. For applications of the stochastic simulation that involve large numbers of interacting molecules this basic method is inefficient in many ways, though it is an exact solution of the relevant stochastic master equation. In those cases more advanced and efficient methods should be used [4]. However, for modeling polymerase gamma activity we are generally concerned with only a single enzyme molecule, the one molecule of polymerase gamma that is replicating a particular strand of mtDNA. In this special case, the simple Gillespie algorithm outlined above is the preferred simulation method.

Section snippets

Applying the Gillespie algorithm to polymerase gamma

The Gillespie algorithm is fundamentally a way of analyzing the outcome of a set of defined reactions which are competing with each other. The heart of this method is the definition of the reaction list. As we stated above, the function of polymerase gamma is complicated, and the reaction list could conceivably be defined in many different ways. However, our purpose is the analysis of the measured enzyme kinetics data, and for that purpose the available kinetics data are the primary determinant

Michaelis–Menten kinetics for incorporation rates after a matched or an unmatched base pair

The reaction kinetics for polymerization reactions are given in Table 1, Table 2, Table 3, Table 4. The reaction rates for the polymerization are calculated using a standard Michaelis–Menten equation, using assumed values for the four dNTP substrate concentrations. The dNTP concentrations are the major parameters for the simulation, and the results of the simulation depend greatly on the pattern of these four concentrations.

Given the complexity of polymerase gamma activity, it should not be

Details of the experiments measuring the enzyme kinetics of polymerase gamma

One must take care to consider the idealized conditions under which the experiments measuring the enzyme kinetics of polymerase gamma were conducted. Differences between these idealized conditions and the real in vivo conditions must always be kept in mind. In this section we summarize the conditions of the experiments as described in references [6], [7], [8], [9], [14].

Problems

As the previous two sections illustrate, the primary problem with using a stochastic simulation to analyze the enzyme kinetics of polymerase gamma is the incomplete reaction kinetics data. The function of the polymerase, as represented in this simulation model, depends on the competition between different reactions which have different reaction rates. The outcome of that competition depends on the complete set of reaction rates. A change in the reaction rate of any single reaction alters the

Equipment

No specialized hardware is required for carrying out these simulations. We have run the simulation code on both a relatively standard desktop PC under LINUX with a 4400 Intel CPU (dual core, 2.0 GHz with 2.0 GB of RAM) and on a central server (Intel Xeon CPU, 2.33 GHz). Typically, we simulate the replication of a single strand of mtDNA and repeat that simulate 10,000 times in order to gather statistics on rare events, such as specific point mutations. On the central server a set of 10,000

Troubleshooting

It is necessary to have quantitative and qualitative predictions against which the simulation output can be compared. There are statistical properties which the output data of the simulation must have, based on the assumptions of the algorithm. These properties should be used as checks to see if the algorithm has been properly implemented.

Using the simulation to analyze polymerase gamma enzyme kinetics data

The process of the complete replication of a single strand of mitochondrial DNA by polymerase gamma is the result of tens of thousands of individual reactions. The exact sequence of reactions that occurs, the number and pattern of the point mutations introduced through polymerase errors, the number of exonuclease events and the number of disassociation events all have strong random components. The dependence of all of these important quantities on the basic measured kinetics of polymerase gamma

Acknowledgment

This work was supported by the National Institutes of Health through Grant GM073744.

References (24)

L. Fan et al.
Journal of Molecular Biology
(2006)
D.T. Gillespie
Journal of Computational Physics
(1976)
Y. Cao et al.
Methods in Enzymology: Computer Methods
(2009)
A.A. Johnson et al.
Journal of Biological Chemistry
(2001)
A.A. Johnson et al.
Journal of Biological Chemistry
(2001)
A.A. Johnson et al.
Journal of Biological Chemistry
(2001)
J. Kang et al.
Mitochondrion
(2008)
H.R. Lee et al.
Journal of Biological Chemistry
(2006)
J.W. Hanes et al.
Journal of Biological Chemistry
(2007)
L.S. Kaguni
Annual Review of Biochemistry
(2004)

S.S. Patel et al.

Biochemistry

(1991)

A.A. Johnson et al.

Biochemistry

(2000)

Cited by (1)

Replication pauses of the Wild-Type and mutant mitochondrial DNA polymerase gamma: A simulation study
2011, PLoS Computational Biology

View full text

Analysis of enzyme kinetic data for mtDNA replication

Abstract

Introduction

Section snippets

Applying the Gillespie algorithm to polymerase gamma

Michaelis–Menten kinetics for incorporation rates after a matched or an unmatched base pair

Details of the experiments measuring the enzyme kinetics of polymerase gamma

Problems

Equipment

Troubleshooting

Using the simulation to analyze polymerase gamma enzyme kinetics data

Acknowledgment

Journal of Molecular Biology

Journal of Computational Physics

Methods in Enzymology: Computer Methods

Journal of Biological Chemistry

Journal of Biological Chemistry

Journal of Biological Chemistry

Mitochondrion

Journal of Biological Chemistry

Journal of Biological Chemistry

Annual Review of Biochemistry

Biochemistry

Biochemistry