Transient and sustained elementary flux mode networks on a catalytic string-based chemical evolution model
Graphical abstract
Introduction
Our planet begun as a “fire ball”, too hot and unstable to sustain life. Since only non-biologic substances were present after the Earth's cool down there had to be a chemical evolution, within the fundamental natural laws, for life to have emerged from those initial molecular conditions. Prebiotic chemical evolution is commonly accepted, almost by default, as the first step in the process by which life originated on Earth but, concerning the path from early molecular pools to the contemporary biological cell, the only fact known for sure is where this path has ended. The biological cell's main molecular and structural features, experimentally established in the 20th century, remain the main source of inspiration for theoretical and experimental research on chemical and molecular evolution.
Considering the prebiotic evolution models proposed until now, a main dichotomy between “genetics-first” and “metabolism-first” models is apparent (Orgel, 1998, Davies, 2001, Mann, 2013). The former are based on the remarkable properties of RNA polymers that can act as catalysts as well as information storage. Although the RNA-world works, and is a base to model further steps of chemical evolution towards a living cell, it remains to be explained how relatively complex molecules like the ribonucleotides were synthesized on the prebiotic world without the help of a previous metabolism that can produce them from the much simpler molecules available (Orgel, 2004).
The “metabolism-first” models, on the other hand, prescribe reaction networks of catalytic polymers that start with the simplest compounds as the food set to produce a variety of larger molecules. In the absence of genetic support, its pathways would have to endure and evolve without an information storage system for the catalysts. On the experimental front, there is evidence for the prominent prebiotic role of amino acids and peptides since the Miller experiment, following previous hypothesis of Oparin and Urey (Miller, 1953). The plausibility of prebiotic protometabolic peptide cycles, of activated polymerisation and degradation, was concluded in a recent review (Danger et al., 2012). A prebiotic pathway for oligopeptide synthesis from CO2 and N2/NO3−/NH3, involving metal cations crucial to biological chemistry, was also proposed (Pratt, 2011). Peptides are known to catalyse the condensation of other amino acids, peptides and peptide-nucleic acids hybrids (Gorlero et al., 2009). It was recently found that the peptide Ser-His is able to catalyse the formation of RNA-phosphodiester bonds in water–ice phases (Wieczorek et al., 2013). Non-peptidic “worlds” have also been proposed at the onset of chemical evolution. Metal ions are known for their catalytic properties and were proposed as main catalytic and redox agents for the autocatalytic synthesis of small molecular weight non-hydrolysable carbon compounds from dissolved gases in hydrothermal conditions (Wächtershäuser, 2007).
This and other data suggest a mutating prebiotic polymer world, catalysing the formation of other polymers, like peptides or nucleic acids but also of other kinds of carbon polymers. Metabolism and genetics evolutionary models are complementary because together they span evolution from the simplest to the most complex biological molecules, supporting each other, but it seems more likely that a genetic world is in need of a metabolism than the other way around (Norris et al., 2012, Bernharth, 2012).
Regarding theoretical approaches, the characterization of the living cell as a self-reproducible (autopoietic) autonomous organization is a reference point, expanded by formalizations of the biological organization (Eigen, 1971, Varela et al., 1974, Fontana and Buss, 1994, Rasmussen et al., 2001, Gánti, 2003). One of the main theoretical strategies has been to develop models that achieve sustained agent organizations upon simulation, starting from disorganised systems.
A crucial element of the cell's autopoietic organization are chemical cycles, notably self-catalysed cycles that dissipate external chemical potential by working in non-equilibrium conditions (Eigen and Schuster, 1977). Recently, Kreyssig et al. demonstrated, using chemical organization theory methods, how the existence of cycles is a necessary condition in non-trivial stationary states in reaction networks (Kreyssig et al., 2012). Autocatalysis is the property of a self-catalysed cycle, that is, a cycle that produces its own catalysts, and, naturally, it is also an important aspect of chemical evolution models (Hordijk et al., 2010). Autocatalysis is not a sufficient condition for life but it is nonetheless a necessary condition (Gánti, 2003; Ruiz-Mirazo and Moreno, 2004). It is the main mechanism by which the cell's autopoietic organization is achieved and regulated, nucleic acids included, and it is a property to look for in chemical evolution model simulations because it assures the maintenance and reproduction of the reaction set.
Chemical evolution models aim to simulate the evolution of quantity and quality of complex molecules, like polymers, but the rigorous simulation of a real small polymer on a computational environment is still very difficult, let alone a variety of complex polymers. The traditional workaround to this problem is to use an artificial chemistry (Dittrich et al., 2001, Suzuki et al., 2003). Artificial chemistries are abstractions of real chemistry where only certain key aspects of molecules, interactions, dynamics and space are retained. Their interpretation may be literal or algorithmic. A general way of representing molecules in an artificial chemistry model is to use alphanumerical strings where the properties of the agents are coded in several ways on the character sequence. Catalytic properties, for example, may be algorithmic interpretations of string sequences while reactions or template replication can work orthographically on strings, changing its character sequences (Tominaga et al., 2008).
On many artificial chemistry models, the energy involved in agent transformations is not explicitly calculated, although it may be deducible from kinetic mechanisms and parameters, if present. Some approaches to a formal definition of energy transformations use tools from physics to derive kinetic and potential energies for the artificial chemistry agents (Benkö et al., 2002, Mayer and Rasmussen, 1998, Yoshii and Kakazu, 1997). A different approach is to apply the chemical thermodynamics concept of standard formation energies to assign energetic value to agents and their reactions (Fernando and Rowe, 2007, Fernando and Rowe, 2008).
Motivated by the metabolism-first hypothesis of autocatalytic sets of proteins (or polymers in general), Kauffman, Hordijk and Steel (Kauffman et al., 1986, Hordijk and Steel, 2004, Hordijk et al., 2012), Jain (Jain and Krishna, 2000), Bagley and Farmer (Bagley and Farmer, 1991, Farmer et al., 1986) and Vasas et al. (Vasas et al., 2012), among others, developed models of catalysed reactions of string cleavage and condensation using string food sources. The species evolve in variety and concentration on an open well-stirred reactor and the network of catalysed reactions was analysed using graph theory methods. The set of differential equations used for concentration simulation regularly changes to accommodate for new reactions and string species. These non-genetic models revealed sustained metabolisms but, although quite general, their correspondence to biological metabolisms is not easy to establish. In addition, the focus is mainly on kinetics and not on energetics, which makes reasoning about energy flows more difficult. The matter flow would also be easier to follow if some molecular analogies to biological compounds were used. These analogies, if taken coherently, can add a layer of meaning to the strings, the reactions and the energetics, making the results more easy to interpret in chemical or biochemical terms.
In an attempt to develop a chemical evolution model that adheres to these guidelines, the model SSE (“steady state evolution”) was developed and is now reported. This model uses an artificial catalytic string-based chemistry, sustained by a food source, but it does not include concentrations or kinetics; the evolution is only on string quality. A detailed approach on string orthography, reaction syntax and energetics was used, expecting some new insights on catalytic systems without the need to make kinetic assumptions like rate constants, reaction mechanisms or initial concentrations, therefore increasing the generality of the model in that respect. Energetics allows to reason about reaction or pathway extension in a time-independent manner by interpreting equilibrium constant values. Although not explicit in the model, time is nevertheless implicit in the SSE algorithm for state evolution, described below. Time can be identified with the number of strings exchanged or the number of updates made, even if the precise relation between these changes and time is unspecified.
Simulations begin with a pool of strings, some permanent, the food set, and some exchangeable. The exchange of strings represents the unspecific and uncatalytic breakdown and random reconstitution of polymers driven by a virtual external energy source. This update mechanism adds random novelty to the pool and is a metaphor of an open, non-conservative, dissipative system with structural instability (Prigogine and Kondepudi, 1999). After each exchange event, the specific catalysts are identified and the catalysed reaction instances are defined, adding reaction products to the pool. After analysis of the pool state, the algorithm returns to the string exchange step.
Since the model's reactions are all thermodynamically characterized and the reaction networks have a defined food set, the transformation pathways from food set to end-products were found by subjecting the reaction set to elementary flux mode, EFM, analysis, a concept introduced and developed by Schuster and coworkers (Schuster and Hilgetag, 1994, Schuster et al., 1999, Schuster et al., 2000) and used mainly for biochemical networks analysis. A mini-review can be found in Trinh et al., 2009. Given a catalytic network of reactions, an elementary flux mode is a minimal, non-decomposable, route on that network through which a set of precursors can be transformed into a set of products in a steady state regime. The complete inhibition, or absence, of only one of the catalysts on an EFM disrupts the flux from precursor(s) to product(s). Several EFMs can normally be identified in a network, even in small ones, and they may overlap to form decomposable flux modes. The calculation of EFM sets in a network is a complex problem that grows harder fast with the size of the network and the precursor set (Acuña et al., 2010). It requires the knowledge of the stoichiometric structure of the reaction set and its thermodynamic constraints, namely reaction reversibility or irreversibility, but not its kinetic parameters. Since the evolving networks of the SSE model are completely unknown prior to simulation, EFM analysis is a particularly valuable tool (although computationally heavy for larger networks) because it offers a non-biased way to determine which strings can be materially sustained by the fixed food source at steady state, here called the network's end-products. SSE's simulation results are therefore a sequence of EFM sets, one for each pool update.
The SSE model is described below in some detail and compared with other models, followed by a report of results, illustrated by some representative simulations, and its discussion.
Section snippets
SSE model description
The code was written and simulated in MATLAB, with calls to elementary flux mode analysis software efmtool (Terzer and Stelling, 2008) through a MATLAB-Java interface. The source code for results reproduction is available by request to the author.
Results
The raw results are extensive and will be summarized below with the aid of four representative network examples from the last state of four simulations, Fig. 2, Fig. 3, Fig. 4, Fig. 5, that illustrate typical properties of results data. The main results are indicated by the following subtitles: 3.1 recursive reactions, 3.2.1 transient EFM networks, 3.3.1 robustness, 3.3.2 energy coupling, 3.3.3 material flow, 3.3.4 catalysis regulation, 3.3.5 intermediate cycling, 3.3.6 metabolic cycling, 3.4.1
SSE relation to kinetic models
Although SSE is not a kinetic model, the SSE algorithm still models time and string quantity in an implicit way; the EFM network qualifiers “transient” and “sustained” are a consequence of that. The pool updating procedure in SSE simulations models time in a discontinuous way, performing changes by leaps from state to state where string implicit quantity changes in a binary way, zero or non-zero, as in being or not being in the pool. A kinetic model, on the other hand, calculates the
Conclusion
The SSE model differs from other constructive food source catalytic models mainly because it does not use kinetics and is more detailed and explicit in its use of reaction types and reaction energy values. It also uses elementary mode analysis and string age records as its probing tools. This approach revealed some important similarities between SSE networks and biological networks, suggesting that a purely catalytic metabolism may have many biochemical features without a modern genetic system.
Acknowledgements
The author would like to thank Anake Kijjoa, Maria Strecht and Maria Rangel for all kinds of support.
References (50)
- et al.
A note on the complexity of finding and enunerating elementary modes
Biosystems
(2010) - et al.
Autocatalytic replication of polymers
Physica D
(1986) - et al.
Natural selection in chemical evolution
J. Theor. Biol.
(2007) - et al.
The origin of autonomous agents by natural selection
Biosystems
(2008) - et al.
“The arrival of the fittest”: toward a theory of biological organization
Bull. Math Biol.
(1994) Self-organization: why early life did not evolve through natural selection
J. Theor. Biol.
(2006)- et al.
Ser-His catalyses the formation of peptides and PNAs
FEBS Lett.
(2009) - et al.
Detecting autocatalytic self-sustaining sets in chemical reaction systems
J. Theor. Biol.
(2004) - et al.
Autocatalytic sets of proteins
J. Theor. Biol.
(1986) The origin of life – a review of facts and speculations
Trends Biochem. Sci.
(1998)