Breeding novel solutions in the brain: a model of Darwinian neurodynamics

András Szilágyi; István Zachar; Anna Fedor; Harold P. de Vladar; Eörs Szathmáry

doi:10.12688/f1000research.9630.1

Home Browse Breeding novel solutions in the brain: a model of Darwinian neurodynamics

ALL Metrics

Views

Downloads

Get PDF

Get XML

Export

▬

✚

Research Article

Breeding novel solutions in the brain: a model of Darwinian neurodynamics

[version 1; peer review: 1 approved, 2 approved with reservations]

András Szilágyi^1,3, István Zachar^2,3, Anna Fedor^1,3, Harold P. de Vladar³, Eörs Szathmáry^1-4

András Szilágyi^1,3, István Zachar^2,3, [...] Anna Fedor^1,3, Harold P. de Vladar³, Eörs Szathmáry^1-4

PUBLISHED 28 Sep 2016

Author details Author details

¹ MTA-ELTE Theoretical Biology and Evolutionary Ecology Research Group, Budapest, H-1117, Hungary
² Department of Plant Systematics, Ecology and Theoretical Biology, Institute of Biology, Eötvös University, Budapest, H-1117, Hungary
³ Parmenides Center for the Conceptual Foundations of Science, Munich/Pullach, 82049, Germany
⁴ Institute of Advanced Studies, Kőszeg, H-9730, Hungary

OPEN PEER REVIEW

REVIEWER STATUS

This article is included in the INCF gateway.

Abstract

Background: The fact that surplus connections and neurons are pruned during development is well established. We complement this selectionist picture by a proof-of-principle model of evolutionary search in the brain, that accounts for new variations in theory space. We present a model for Darwinian evolutionary search for candidate solutions in the brain.
Methods: We combine known components of the brain – recurrent neural networks (acting as attractors), the action selection loop and implicit working memory – to provide the appropriate Darwinian architecture. We employ a population of attractor networks with palimpsest memory. The action selection loop is employed with winners-share-all dynamics to select for candidate solutions that are transiently stored in implicit working memory.
Results: We document two processes: selection of stored solutions and evolutionary search for novel solutions. During the replication of candidate solutions attractor networks occasionally produce recombinant patterns, increasing variation on which selection can act. Combinatorial search acts on multiplying units (activity patterns) with hereditary variation and novel variants appear due to (i) noisy recall of patterns from the attractor networks, (ii) noise during transmission of candidate solutions as messages between networks, and, (iii) spontaneously generated, untrained patterns in spurious attractors.
Conclusions: Attractor dynamics of recurrent neural networks can be used to model Darwinian search. The proposed architecture can be used for fast search among stored solutions (by selection) and for evolutionary search when novel candidate solutions are generated in successive iterations. Since all the suggested components are present in advanced nervous systems, we hypothesize that the brain could implement a truly evolutionary combinatorial search system, capable of generating novel variants.

Keywords

attractor network, autoassociative neural network, evolutionary search, Darwinian dynamics, neurodynamics, learning, problem solving

Corresponding author: András Szilágyi

Competing interests: No competing interests were disclosed.

Grant information: The research leading to these results has received funding from the European Union Seventh Framework Program (FP7/2007-2013) under grant agreement numbers 308943 (INSIGHT project) and 294332 (EvoEvo project).
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Copyright: © 2016 Szilágyi A et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The author(s) is/are employees of the US Government and therefore domestic copyright protection in USA does not apply to this work. The work may be protected under the copyright laws of other jurisdictions when used in those jurisdictions.

How to cite: Szilágyi A, Zachar I, Fedor A et al. Breeding novel solutions in the brain: a model of Darwinian neurodynamics [version 1; peer review: 1 approved, 2 approved with reservations]. F1000Research 2016, 5:2416 (https://doi.org/10.12688/f1000research.9630.1) First published: 28 Sep 2016, 5:2416 (https://doi.org/10.12688/f1000research.9630.1) Latest published: 29 Jun 2017, 5:2416 (https://doi.org/10.12688/f1000research.9630.2)

Introduction

The idea that functional selection on a large set of neurons and their connections takes place in the brain during development^1–3 is now experimentally validated^4–7. As originally portrayed, this process is only one round of variation generation and selection, even if it requires several years. Evolution by natural selection works differently: variants are generated and then selected in iterative rounds. The field of “Neural Darwinism”^1–3 fails to include generation of variants and thus could justifiably be regarded as a misnomer because the process that it describes is not evolutionary in the strict sense⁸. Evidence indicated that the development of the brain is more “constructivist”⁹ than pictured by the original selectionist accounts: for example, repeated rounds of neuron addition and loss happen during development¹⁰. Structural plasticity (synaptic remodelling) is now known to be a lifelong process with implications for memory and learning (e.g. 11,12). The addition and deletion of synapses and neurons takes several hours or days¹³. Our main goal here is to present a proof of principle that bona fide evolutionary dynamics could happen in the brain on a much faster time scale.

Maynard Smith¹⁴ identified multiplication, inheritance and variability as necessary features of evolution. In genetic evolution the variability operators are mutation and recombination. If there are hereditary traits that affect the survival and/or the fecundity of the units, then in a population of these units, evolution by natural selection can take place. While this characterization qualitatively outlines the algorithmic aspect of evolution¹⁵, concrete realizations require also quantitative conditions: population size cannot be too small (if it is too small, neutral drift dominates over selection¹⁶) and replication accuracy cannot be too low (if it is too low, hereditary information is lost¹⁷). Note, that this description says nothing about the nature of the units: they could be genes, organisms, linguistic constructions or anything else.

The proper implementation of an evolutionary process within the nervous system could have major implications for neuroscience and cognition^8,18–25. A main benefit of neuro-evolutionary dynamics would be that it could harness the parallelism inherent in the nervous system and the redistribution of resources at the same time. The latter process means that hopeless variants are thrown away and are replaced in the “breeding space” by more promising ones⁸. Another important aspect of the process is that it is generative: it could explain where new hypotheses and new policies come from in Bayesian approaches to cognition^26,27 and reinforcement learning^28–30, respectively. Bayesian inference and natural selection are analogous^31,32 in that candidate hypotheses in the brain (the prior distribution) represent a population of evolutionary units, which are evaluated, or selected, based on the evidence. There is a mathematical isomorphism between the discrete-time replicator equation and Bayesian update³¹. The likelihood function is analogous to the fitness function and the posterior distribution to the selected population. Relations like this suggest that Bayesian update could be one of the pillars of “universal Darwinism”³³. We believe that convincing models for neuro-evolution could empower Bayesian approaches by providing a mechanism to generate candidate hypotheses.

Attractor networks have been used (among others) as models of long-term memory, which are able to complete partial input^34,35. These networks consist of one layer of units that recurrently connect back to the same layer. The recurrent connections can learn (store) a set of patterns with a Hebbian learning rule. Later, if these patterns or their noisy versions are used to provoke the network, it settles on the original patterns after several rounds of activation updates on the recurrent weights (recall), thus stored patterns act as attractors. It is of high importance that the existence of such networks has been experimentally validated in the visual cortex of awake mice by optogenetic methods³⁶. Some versions of the learning rule allow for iterative learning without catastrophic forgetting and enable palimpsest memory. A network with palimpsest memory is able to learn new patterns one-by-one, while sequentially forgetting earlier patterns.

In this paper we describe a model that implements evolution of activation patterns in the brain with the help of attractor networks. We see it as a model of problem solving, which is able to generate new candidate solutions to a problem based on past experiences. Any cognitive problem of the brain is encoded by the activity pattern of neurons. We represent neurons as binary units, being able to continuously maintain firing in one state or the other. A group of neurons at any time therefore has a binary activation pattern. In our model, the units of evolution are these activation patterns, represented as bitstrings. Attractor neural networks can store activation patterns stably for a considerable time in form of corresponding attractors and are able to recall them given the appropriate trigger (Figure 1A). This memory allows for heredity, which is indispensable for Darwinian dynamics (in genetic populations memory is the genotype pool). Attractor neural networks can generate new pattern variants in different ways (corresponding to mutation in a genetic system), see below under Discussion. Owing to memory and pattern generation, the possibility of iterated selection over a population of activation patterns becomes feasible. Our approach thus offers a more natural way to incorporate hereditary dynamics in models of cognitive problem solving at a faster scale that could be provided by, say, structural plasticity (cf. 37). This fast-scale dynamics is missing from Edelmanian Neural Darwinism.

Figure 1.

A) Architecture of multiple attractor networks performing Darwinian search. Boxed units are attractor networks. Each network consists of N neurons (N = 5 in the figure, represented as black dots). Each neuron receives input from the top (1) and generates output at the bottom (2). Each neuron projects recurrent collaterals to all other neurons (but not to itself), forming thus N × (N – 1) synapses. The weight matrix of the synapses is represented here as a checkerboard-like matrix, where different shades indicate different weights on the connections. Selection and replication at the population level is as follows: 1) Each network receives a different noisy copy of the input pattern. 2) According to its internal attractor dynamics, each network returns an output pattern. 3) All output patterns are pooled in the implicit working memory (grey box with dashed outline), where they are evaluated and a fitness w_i is assigned to the i^th pattern. 4) The best pattern(s) is selected based on fitness. 5) One of the networks is randomly chosen to learn the pattern that was selected, with additional noise (dashed arrow). 6) The selected pattern is copied back to the networks as input to provoke them to generate the next generation of output patterns. B) Lifecycle of candidate solution patterns during a cognitive task. Patterns are stored in the long-term memory as attractors of autoassociative neural networks. When provoked, networks produce output patterns, which are stored in implicit working memory. These patterns are evaluated and selected. Patterns that are good fit to the given cognitive problem can increase their chance to appear in future generations in two possible, non-exclusive ways: 1) selected patterns are retrained to some networks (learning) and 2) selected patterns are used as inputs for the networks (provoking). The double dynamics of learning and provoking ensures that superior solutions will dominate the system. Erroneous copying of patterns back to the networks for provoking and learning and noisy recall are the sources of variation (like mutations).

The patterns represent candidate hypotheses or candidate solutions to a problem, which are evaluated based on a fitness function that measures their goodness as a solution. The best patterns are selected and copied (with variation) back to the networks, which in turn generate the next generation of patterns (Figure 1B). Stored patterns constitute the long-term memory; output patterns constitute the working memory (Figure 1B). While pattern generation is a simple recall task, which is only able to reproduce previously learnt patterns, the whole system is able to generate new variants due to noisy recall, spurious patterns (see later), noisy copying of patterns, and iterative learning, thus enabling the evolution of novel solutions.

Methods

Recurrent attractor networks. The basic units in our model are attractor networks. Attractor networks are recurrent neural networks consisting of one layer of units that are potentially fully connected. An attractor neural network produces the same (or highly correlated) output whenever the same input is provided (omitting retraining). The pattern that was learned becomes the attractor point of a new basin of attraction, i.e. it is the prototype pattern that the attractor network should return. Consequently, an attractor with a non-zero sized basin should also return the same output to different input patterns. However, the amount and type of correlation of input patterns that retrieve the same prototype, i.e., the actual structure of the basin of attraction, is hard to assess, let alone visualize. Still, it is safe to assume that most input patterns correlated with the prototype, produce the same output – the prototype itself.

The Hopfield network is a recurrent artificial neural network with binary neurons at nodes and weighted connectivity between nodes, excluding self-connections. According to the usual convention, the two states of binary neurons are +1 and -1. In our model, a neuron fires (state +1) if the total sum of incoming collaterals is greater than 0. Accordingly, the update rule has the following form:

x_{i} (t + 1) = sgn (\sum_{j = 1 (\neq i)}^{N} w_{i j} x_{j} (t)) .

The original Hebbian (covariance) learning rule has the following form (where m is the index of the patterns):

w_{i j}^{0} = 0, \forall i, j \in {1, 2, \dots, N},

w_{i j}^{m} = w_{i j}^{m - 1} + \frac{1}{N} ξ_{i}^{m} ξ_{j}^{m} .

The Hebb rule is both local and incremental. A rule is local if the update of a connection depends only on the information available on either side of the connection (including information coming from other neurons via weighted connections). A rule is incremental if the system does not need information from the previously learnt patterns when learning a new one, thus the update process uses the present values of the weights and the new pattern. The above update rule performs immediate update of the connection weights (“one shot” process; not a limit process requiring multiple update rounds). The covariance rule has a capacity of 0.14 N⁵⁸. However, if during learning the system reaches its capacity and further patterns are presented, catastrophic forgetting ensues and the network will be unable to retrieve any of the previously stored patterns, forgetting all it has learnt.

To overcome this problem and to preserve the favorable properties of the covariance rule (one-shot, local and incremental updating) Storkey has introduced a palimpsest learning scheme⁴¹ as follows:

\begin{array}{l} w_{i j}^{m} = w_{i j}^{m - 1} + \frac{1}{N} ξ_{i}^{m} ξ_{j}^{m} - \frac{1}{N} ξ_{i}^{m} h_{j}^{m} - \frac{1}{N} h_{i}^{m} ξ_{j}^{m} i f i \neq j, \\ w_{i j}^{m} = 0 i f i = j, \end{array}

and

h_{i}^{m} = \sum_{k = 1}^{N} w_{i k}^{m - 1} ξ_{k}^{m} .

Using the above rule, the memory becomes palimpsest (i.e. new patterns successively replace earlier ones during learning) with a capacity of C = 0.25 N (for details and proper definition of palimpsest capacity, see 41).

An interesting feature of some autoassociative neural networks is the appearance of spurious patterns. In some cases, the network converges to a pattern different from any other patterns learnt previously. These spurious patterns can be the linear combination of an odd number of stored patterns:

ξ_{i}^{s p u r} = \pm sgn (\pm ξ_{i}^{m_{1}} \pm ξ_{i}^{m_{2}} ... \pm ξ_{i}^{m_{s}}),

where S is the number of the stored patterns⁵⁸. This effect can be thought of as an effective implementation of a neuronal recombination operator.

Selection. For the selection experiment, we used N_A structurally identical attractor networks, each consisting of N neurons, implementing Storkey’s palimpsest learning rule. N_A = 20 networks (N = 200) were initially trained with random patterns plus a special pattern for each. The 20 special training patterns were as follows. The worst special pattern was the uniform -1, the best special pattern was the uniform +1. Intermediate special patterns had increasing number of +1-s from the left. Fitness was measured as the relative Hamming similarity from the globally best target O_target (i.e. the proportion of +1-s in the pattern). The worst special pattern was trained only to network #1, the second worst to #2, etc., while the best special pattern (which was the target pattern) was trained to network #20. In this scenario, no further training occurred (i.e., the dashed arrows on Figure 1 are not there). Assuming that the attractor basins of these patterns overlap among networks (Figure 2A) the output of one network will be the cue to trigger one or more close special patterns in other networks. The special patterns ensure that there exists a search trajectory leading from the worst to the best pattern. Starting from any arbitrary initial pattern, if any of the special patterns gets triggered at any time, the system can quickly converge to the optimum.

Figure 2. Schematics of attractor networks searching for the global optimum.

A) Four time steps of selection, from top to bottom. At each step, we only show the network that produces the best output (numbered); the rest of the networks are not depicted. In each time step the networks are provoked by a new pattern that was selected from the previous generation of patterns. Different attractor networks partition the pattern-space differently: blobs inside networks represent basins of attraction. At start, the topmost network (#3) is provoked with an input pattern. It then returns the center of the attractor basin which is triggered by the input. When the output of this network is forwarded as input to the next network (#11), there is a chance that the new attractor basin has a center that is closer to the global optimum. If there is a continuity of overlapping attractor basins through the networks from the initial pattern (top) to the global optimum (bottom), then the system can find the global optimum even without learning. B) Learning in attractor networks. Network #5, when provoked, returns an output pattern that is used to train network #9 (blue arrow). As the network learns the new pattern, the palimpsest memory discards an earlier attractor (with the gray basin), a new basin (purple shape) forms around the new prototype (purple ×) and possibly many other basins are modified (basins with dotted outlines). Black dots indicate attractor prototypes (i.e. learnt patterns). With learning, successful patterns could spread in the population of networks. Furthermore, if learning is noisy and a network might learn a slightly different version of the pattern, new variation is introduced to the system above the standing variation. This allows finding the global optimum even if it was not pre-trained to any network. The gray arrow in the background indicates the timeline of network #9.

After initial training, each network received the same random input and generated an output according to its internal attractor dynamics. The output population was evaluated and the best output O_best was selected based on its fitness. Noisy copies (with μ_I, where μ is the per-bit mutation probability) of O_best were redistributed for each network as new input for the next generation. These steps were iterated until fitness reached the theoretical optimum (i.e. the system found special pattern #20). The crucial assumption for selection to work is continuity, namely the possibility that the output of one attractor of one network could fall in a different attractor basin of another network returning an output that is closer to the global optimum than the input was (see Figure 1 and Figure 2).

Evolutionary optimization on a single-peak landscape. In contrast to purely selective dynamics, in the evolutionary experiment, networks could learn new patterns during the search process. At start, each network was trained with a different set of random patterns. The fitness of a pattern is defined as the relative (per bit) Hamming similarity between the given pattern and an arbitrarily set globally best target pattern O_target. The selection process for the actual best output O_best and redistribution of its noisy copies (with μ_I = 0.005) for input was the same as before. Most importantly, the mutated versions (with μ_T = 0.01) of O_best were also used for retraining N_T different networks in each generation (see Figure 1): this forms the basis for the Darwinian evolutionary search over attractor networks, as it allows for replication with variation of (learnt) patterns over networks (thin lines in Figure 3).

Figure 3. The effect of retraining on the speed of evolution.

Lines represent the evolution in four different populations, where a different number of networks were retrained. Each population consisted of 10 networks (see the rest of the parameters under the Methods section). Thin lines: stochastic attractor dynamics; thick lines: simulated attractor dynamics (abstract networks always return the stored attractor prototype that is closest to the actual input, with 0.001 per bit probability noise; capacity to store C_fix = 30 patterns, μ_O = 0.002). Parameters: N = 200, N_A = 20, μ_T = 0.01, μ_I = 0.005, elitist selection, keeping the best one only from each output generation; retraining selects random networks (never the same in a given generation). Fitness is the relative Hamming similarity to the global optimum.

We have compared the search behavior of our system of attractor networks with a simpler model. In this model networks were represented as abstract storage units, which could store exactly C_fix patterns (C_fix was set to be close to the actual capacity of networks). When such a storage unit receives an input pattern it simply returns the closest (in Hamming distance) of its stored patterns as output, with additional noise (μ_O = 0.001). The units simulate the almost perfect recall property of attractor networks and effectively approximate attractor behavior. We compared evolution in this simple model with evolution in the system of attractor networks (thick and thin lines in Figure 3).

Optimization in a changing environment. In order to test the effect of memory on successive search, we have implemented a periodically changing selective environment, i.e., we periodically changed the fitness function. The environment alternated between E₁ and E₂, with a stable period length of T_E = 2000. Each environmental change reset the global optimum: for this scenario, we assumed a uniform +1 sequence for E₁ and its inverse, uniform -1 for E₂ as global optima, and used the relative Hamming similarity as a fitness measure.

In the first phase of the simulation, networks were allowed to learn in each environment for a total of T_nolearn = 12000 generations (three periods per environments). Afterwards, learning was turned off to test the effect of memory. To make sure that the optimal pattern was not simply carried over as an output pattern from the previous environment but was recalled from memory, the input patterns were set to random patterns (instead of inheriting the previous output population) at the start of each new environmental period after T_nolearn. This ensures that the population could only maintain high fitness afterwards in an environment if the optimum was stored and could be successfully recalled (see Figure 4). In order to assess the memory of a network, we also measured the distance between the actual best output of the population and the closest one of the set of previously learned patterns within the same network (as different networks have different training history). A small distance indicates that the network outputs a learned pattern from memory (i.e. recalls it) instead of a spurious pattern.

Figure 4. Fitness and recall accuracy over periodically alternating environments.

Blue: average fitness; green: best fitness; purple: distance of the best output of the population from the closest one stored in memory (for details, see main text). Grey and white backgrounds represent the changing environment: We alternated two global optimums at every 2000^th generation. After the 12000^th generation, we turned off learning (thick vertical line) and set the input to random patterns after each changing of the environment. Parameters: N_A = 100, N = 100, N_T = 40, fitness is the relative Hamming similarity to the actual optimum.

For this scenario, we introduced a different selection method (also used in the next section). Each network in the population produces an output according to its internal attractor dynamics and the input it received from the previous generation. From all output sequences one was randomly chosen and mutated (µ_R = 1/N per bit mutation rate). If the mutant had a higher fitness than the worst of the output pool, the worst pattern was replaced by it (elimination of the worst). Furthermore, in the case of a superior mutant, it was also trained to N_T number of different networks. Lastly, the resulting output population is shuffled and fed to the networks as input in the next generation (except when the environment changes and input is reset externally).

Optimization on a difficult landscape. To investigate the applicability of this optimization process, we adopted a complex, deceptive landscape with scalable correlation, and also modified the selection algorithm introduced above. We used the general building-block fitness (GBBF) function of Watson and Jansen³⁸. According to the GBBF function, each sequence of length N is partitioned into blocks of uniform length P, so that N = P B (P, B ∈ Z⁺) where B is the number of blocks. For each block, L arbitrarily chosen subsequences are designated as local optima, with randomly chosen but higher-than-average subfitness values. The overall fitness F(G) of a pattern G (“genotype”) is as follows:

F (G) = \sum_{i = 1}^{B} f (g_{i}),

f (g_{i}) = \sum_{j = 1}^{L} c (g_{i}, t_{j}),

c (g, t_{j}) = {\begin{matrix} w_{j}, & if d (g, t_{j}) = 0 \\ {(1 + d (g, t_{j}))}^{- 1}, & otherwise \end{matrix},

where f(g_i) is the fitness contribution of the ith block in the pattern, t_j is the j^th local optimum of length P (all L different optima are the same for each block in our experiments) with subfitness value w_j > 1, and d is the Hamming distance. Consequently, this landscape has many local optima, a single global optimum and a highly structured topology. Furthermore, since there are no nonlocal effects of blocks, each block can be optimized independently, favoring a metapopulation search.

Accordingly, in this experiment, we introduced multiple populations of attractor networks. Each population of N_A attractor neural networks forms a deme and N_D demes are arranged in a 2D square lattice of Moore neighborhood (all of the eight surrounding demes are considered neighbors). Demes might accept output sequences from neighboring demes with a low probability p_migr per selection event; this slow exchange of patterns can provide the necessary extra variability for recombination. These demes correspond to the groups of columns in the brain.

Networks in the deme are the same as those used in previous experiments. However, selection is modeled in a different way, similar to the selective dynamics outlined in 38. In turn, we only give a brief description. Given a deme, each network produces an output according to its internal attractor dynamics and the input it received from the previous generation. Output sequences are pooled and either one or two is randomly chosen for mutation or recombination, respectively (i.e. no elitist selection). With probability p_rec, two-point recombination is performed of the two selected partners, with 1-p_rec probability, a single selected sequence is mutated, with µ_R = 1/N per bit mutation rate. With p_migr probability, the recombinant partner is chosen from another neighboring deme instead of the focal one. Next, the output(s) of recombination or mutation are calculated: if the resulting sequence (any of the two recombinants or the mutant) has a higher fitness than the worst of the output pool, it is replaced by the better one (elimination of the worst). Furthermore, in the case of a superior mutant or recombinant, it is also trained to N_T number of different networks within the deme. Lastly, the resulting output population is shuffled and fed to the networks as input in the next generation. Each deme is updated in turn according to the outlined method; a full update of all networks in all demes constitutes a generation (i.e. a single time step).

The GBBF landscape was set up identically to the test case in 38, as follows. For each block uniformly, two target sequences of length P, T₁ and T₂, were appointed. T₁ is the uniform plus-one sequence T₁ = {+1}^P and T₂ is alternating between -1 and +1 (T₂ = {-1, +1}^P/2). According to the fitness rule (Equation 5–Equation 6 in 38 and Equation 1–Equation 3 above), the best subfitness of each block in a sequence can be calculated and the sum of all the subfitness values is the fitness of the global optimum sequence. Thus for sake of simplicity, we used relative fitness values with the global optimum (the uniform +1 sequence) having maximal fitness 1. The sequence(s) with lowest fitness always have a nonzero value.

The source code of all models and data presented in this paper is freely available as a supplement to this paper.

Results

Selection. We should distinguish between two processes: (i) search without learning among the stored patterns to find the best available solution (i.e., selection without step 5 on Figure 1A), and (ii) search with learning: retrain one or more networks with the selected and mutated patterns (Figure 1A with step 5). The first is a purely selectionist approach because it cannot generate heritable variants, while the second implements Darwinian evolution because learning changes the output behavior of the networks, thus they generate new patterns. First, we analyze the strictly selectionist version, and then the evolutionary version of the model.

In the selectionist version we pre-trained each network with a random set of patterns (excluding the target pattern) and started by provoking them with a different random input. Each network produced an output pattern according to its own attractors and then the best pattern was selected. This pattern was used in turn to provoke the networks in the next generation, and so on. This search has found among all the available stored (pre-trained) patterns the one with the highest fitness; it could not find the global optimum, as the networks were not pre-trained with it and there was no way for new variants to appear in this simulation.

Next, we specifically composed the sets of pre-training patterns: each network was pre-trained with random patterns as before but also with one special pattern. This set of special patterns (in which individual patterns can be ordered according to gradually increasing fitness values) delineate a route to the optimum through overlapping basins of attractors in different networks (see Figure 2A) so that we can test whether in this simplified case the algorithm converges quickly to the optimum. The first population was initiated with the special pattern that was farthest from the optimum. We have found that the selected output gets closer to the optimum in each generation, but the optimization process is saltatory: it skips over many intermediate neighboring special patterns (and thus networks). This is due to the fact that attractor basins of neighboring special patterns were highly overlapping. For example, in Figure 2A, the stored special pattern of network #3 is in the basins of stored special patterns of networks #4–#11, and since the stored pattern of network #11 is closest to the optimum, networks #4–#10 were skipped. A typical sequence of networks generating the actual best output is: #3, #11, #17 and #20 (of 20 networks; for actual parameters, see Figure 2A).

Evolution. Learning new patterns as attractors (Figure 2B) allows networks to adapt to the problem and perform evolutionary search. The results of the evolutionary experiments clearly prove that a population of attractor networks can implement evolutionary search in problem spaces of different complexity (i.e. different levels of correlation and deceptiveness).

Evolution on a simple fitness landscape. In this scenario, neither the global optimum nor a route toward it is assumed to pre-exist in the system as in the selectionist experiments: networks are pre-trained only with random patterns. Even under these stringent conditions, we have found that the system can converge to the global optimum, and this convergence is robust against a wide range of mutation rates. Our simplified abstract model, which always returns the stored prototype that is closest to an input, behaves qualitatively the same way (see Figure 3). The speed of convergence to the optimum is mainly affected by the number of retrained networks (Figure 3): as we increase the number of networks that are retrained we find a faster fitness increase, albeit with diminishing returns. Mutation has an optimal range in terms of the speed of evolution. On one hand, if mutation rate is too low evolution slows down, because there is not enough variation among patterns. On the other hand, if mutation rate is too high it hinders evolution as the offspring is too dissimilar to the parent and cannot exploit the attractor property of the system. When mutation rate is zero, the source of variation is only the probabilistic input-output behavior of the networks due to their asynchronous update and the appearance of spurious patterns when the input is too far from the stored patterns.

While the attractor networks have memory, due to the monotonic, single-peak nature of the fitness landscape there is no need to use it: the system works almost equally well if the networks only store the last trained pattern (i.e., weights are deleted before each learning event). Next, we present experiments where both the attractor property and the palimpsest memory of the networks are used.

Evolution in a changing environment. In this experiment we alternated two environments: in every 2000th generation the target pattern (the optimum), against which fitness was measured, was changed. From an evolutionary point of view, this can be perceived as a changing environment, whereas from a cognitive point of view, this procedure simulates changing task demands. Figure 4 shows that the system found and learnt the optima of each of the two environments separately. Then, after generation 12000, we switched off learning. The fact that networks are nevertheless able to recall the target pattern right after the environmental change proves that they use previously stored memories. After we switched off learning, we used random patterns to provoke networks at the first generation of each new environment. A single network that can recall the optimum from the random input is enough to produce a correct output that is amplified by selection for the next generational input, ultimately saturating the population with optimal patterns. This experiment effectively proves that a system of attractor networks can reliably recall earlier stored solution patterns, therefore solves the problem faster in an alternating environment than a system without long-term memory.

Evolution on a difficult fitness landscape. The previous evolutionary experiment (where search was on a single-peak fitness landscape with a single population of networks) is a proof of principle of the effectiveness of our evolutionary algorithm. In order to assess the capacity of population search of attractor networks, we introduce a considerably harder fitness landscape with higher dimensionality, where the deceptiveness of the problem can be tuned. The GBBF fitness landscape of 38 provides a method to easily generate scalable and complex landscapes with many deceptive local optima. The complexity of the problem requires the introduction of multiple interacting populations of networks. Though explicit spatial arrangement of the networks is not required to solve the problem, we have nevertheless included it in our implementation to imitate real spatial arrangement of neurons in the brain. Locality allows the exchange of information among neighboring populations (i.e. recombination) that is essential to solve the GBBF problem (or similar deceptive problems) in a reasonable time.

We have investigated the performance of search in a metapopulation with different problem sizes (pattern lengths; see Figure 5). Results indicate that despite the vastness of the search space, the metapopulation is always able to converge to the global optimum, given enough time. The most complex landscape of 100-bit patterns is of size 2¹⁰⁰ with one global optimum and a huge number of local optima. The metapopulation consists of 10⁵ neurons (100 populations of 10 networks each with 100 neurons per network) and can find the single global optimum in ~10⁴ time steps. The limit of further increasing the problem size is in the computational capacity of our resources.

Figure 5. Convergence of the actual best fitness of the metapopulation with increasing problem size N on general building block function landscape.

Each curve is an average of 10 independent iterations. N_D = 10 × 10 demes, N_A = 10 networks per deme, N neurons per network, patterns of length N are partitioned to blocks of size P = 10 (B = N/P blocks per pattern), p_rec = 0.1, µ_R = 1/N, p_migr = 0.004, N_T = 5. Note the logarithmic x axis. Inset: single simulation at N = 80, B = 8 (other parameters are the same). Plateaus roughly correspond to more and more blocks being optimized by finding the best subsequence on the building-block fitness landscape.

Discussion

Summary of results. Attractor networks can be used to implement selection, replication and evolutionary search. As a proof of principle, we showed that attractor networks find the global optimum in a purely selectionist model (i.e. without learning) if they are pre-trained with attractors that overlap in their basins and lead to the optimum. The population can effectively select over the standing variation of all stored patterns and find the trajectory to the (single) peak of the fitness landscape (see Figure 2B). Furthermore, if learning is allowed during search, the relative frequency of good patterns (those closer to the optimum) can be increased by re-training networks with such patterns, so that they are stored in the long-term memory in more copies (Figure 3). Overwriting an older memory trace with a new pattern corresponds to a copying operation, potentially with mutations. A particularly interesting aspect of a population of attractor networks in the given coupling is that even if learning is not erroneous, the Lamarckian nature of inheritance of patterns (as output → memory → output; see Figure 1B) means that there is room for heritable variation to emerge at other stages of the cycle, thus implementing Darwinian dynamics.

The explicit benefit of memory manifests in a periodically changing environment. In a single, stable environment, memory is not very useful because the attractor property acts against exploring variation, and networks might be even slower than gradient hill-climbers (i.e. searchers who generate mutant output blindly and only take a step on the landscape if the output is better than the input). However, in a periodically changing environment, attractor networks with memory are able to recall the actual global optimum if they have already encountered the given environment in the past and stored its optimum; hill-climbers or naïve attractor networks lacking the necessary memory would have to perform the search all over again. Attractor network can recall the appropriate pattern even if there is no initial cue for the population to know which environment it is in. A network with memory can complete a partial cue and thus can recall the global optimum. After learning has ceased, it is enough to only have a few networks in the population that can recall the optimum from random cues that are accidentally close to the actual optimum (maximal fitness is immediately 1 in the population, see green curve in Figure 4). Selection then simply amplifies these optimal output patterns in the population (as output is fed back as input in the next generation) until all networks receive optimal patterns as input. At that point, average fitness also reaches the maximum (Figure 4, blue curve). A network without memory would have to search for the new optima in each environment over and over again, finding the whole uphill path on the landscape.

We also proved that a metapopulation of attractor networks can successfully optimize on a complex, correlated landscape of higher dimensionality. This is a notoriously hard optimization problem (cf. 38) as hill-climbers can easily get stuck in deceptive local optima. The spatial organization of attractor networks resembles the spatial organization of neural structures of the cortex and it allows parallel optimization of subproblems. By this independent optimization of solution parts, local populations can exchange successful partial solutions with each other and form recombinants that are necessary to solve such complex problems in reasonable time.

Evolutionary context. We have chosen attractor networks to demonstrate Darwinian neurodynamics because (i) the search for existing solutions uses the same architecture for generating, testing and storing novel ones; (ii) stored patterns help evolutionary search by readily employing related past (learnt) experience, and (iii) the particular choice of Storkey’s model naturally results in some new recombinant patterns. This is an important point because, as we know from population genetics³⁹, recombination speeds up the response to selection by creating new variants⁴⁰.

Our choice of implementation of attractor networks, following Storkey’s model⁴¹, is based on three important aspects: (i) it has palimpsest memory, so that it can learn new and forget old patterns without catastrophic forgetting, as happens in Hopfield and other networks; (ii) its attractors are roughly of equal size and are well-structured according to a Hamming distance measure, and (iii) unlike most other neural networks, it is able to store correlated patterns. The downside is that these networks require full connectivity, which is neuronally unrealistic. However, its functional properties reflect well what we know of long-term memory in the brain, which is enough for a proof or principle of an evolutionary implementation of neuronal function. To our knowledge no model exists in the literature that could satisfy all requirements above and that, at the same time, works with diluted connectivity⁴².

It is important to clarify that the units of evolution in our proposed Darwinian neurodynamics system are the activation patterns: they are copied potentially with errors, selected and amplified. However, patterns live in two stages: in the working memory and in the long-term memory (cf. Figure 1). This implies different inheritance methods (routes to pass on information) from what is expected in a purely genetic system. Changed attractor prototypes imply changed outputs, just like a mutated genotype implies a different phenotype. However, in our proposed system, changes made to output patterns (by mutation) can also be “inherited” by the stored attractor prototypes via learning. Furthermore, there is another difference in the dynamics, explained in turn.

Darwinian evolution is often described as a parallel, distributed search on a fitness landscape⁸. The population, as an amoeba, climbs the peaks of the landscape via iterative replication and selection in successive generations. The attractors, however, impose a different mode of evolution, because they simply return the prototype pattern closest to the input, even if it is less fit than the input pattern itself. Consequently, attractor networks work against variability and slow down hill-climbing. However, attractor networks resist fitness increase only half of the time on average; the other half of the time they effectively push inferior patterns uphill in the fitness landscape at a speed much higher than that expected for the same (reduced) amount of genetic variation. Consequently, attractor networks can facilitate evolution²¹.

We stress the importance of evolutionary combinatorial search. In cases where ab initio calculation of molecular functionality is impossible, artificial selection on genetic variants has proven to be an extremely useful method to generate molecular solution to functional problems, as experiments on the generation of catalytic RNA molecules (ribozymes) illustrate (see 43 for a recent review). By the same token when a brute force numerical calculation of a combinatorial functionality problem is impossible for the brain, given the adequate architecture it could (and we suggest it does) use an evolutionary search mechanism, as shown in this paper.

Implementation in the brain. It is of primary importance that all the components in our ‘algorithmic diagram’ (Figure 1) can be implemented by mechanisms known in the brain. It is likely that the cortical hypercolumn⁴⁴ behaves like an (at least one) attractor network. The reciprocal connections between the long-term and working memory networks are assumed to be like those in Figure 3 in 45. We propose that, first, the reinforcing signal from the basal ganglia via the thalamus keeps active the good candidate pattern solutions in the rewarded auto-associative network and, second, that the latter sends a copy of the active pattern to (unconscious) working memory for eventual redistribution. When there is an increase in the quality of a solution (fitness increase) or when a local or global optimum is reached, the central executive elicits the transmission of a copy of the solution to the conscious memory⁴⁶.

Our proposed mechanism relies on information transmission of patterns between cortical groups and relevant subcortical structures with variation, and in this way it differs from all previous models. A discussion of timescales is in order. Without learning newly generated variants, the selective mode would require about the same time as suggested by 47 as the “cognitive cycle” (based on data), but without perception at the beginning of the cycle, i.e. it would be in the 160–310 ms range. In the evolutionary mode, learning of new variants is required which would take more time. A conservative estimate for the reliable expression of changed synaptic weights is between seconds to minutes^48,49. The second scale would allow several dozen cycles/generations per minute—a very good number in artificial selection experiments. A more accurate estimate of timing will require a fully neuronal model of our proposed mechanism.

Copying of information is necessary for evolutionary processes: this is where previous approaches^22–24,50 have been too artificial. There are four well known instances where scholars invoke copying of information in the brain: (i) the transfer of information from short to long-term memory^35,51,52, (ii) the transfer of information from specialized cortical areas to working memory⁵³ or to the global workspace^54,55 and, finally (iii) the possible copying of patterns from working memory to conscious processing⁵⁶. Undoubtedly, all these approaches require accurate information transfer between different brain components⁵⁷.

There are three sources of variation in our system: (i) due to the finite size of our networks and asynchronous update of the neurons, the output patterns show some variation even if a network is repeatedly provoked by the same input pattern, (ii) acknowledging the noisiness of neuronal transmission we introduce variation akin to mutations when patterns are transmitted among the blocks of the model, and finally (iii) we have realized that “spurious patterns”⁵⁸ emerge as by-products of previously trained patterns and they might act as (new) recombinants of learnt patterns that facilitate the evolutionary search.

The non-conscious or implicit working memory, which has received considerable attention lately^46,56, is crucial for our proposed mechanism. Irrespective of whether working memory overlaps with the conscious domain⁵⁹ or not⁵⁶ (in the latter case a ‘conscious copy’ must be sent from working memory to conscious access), the important factor is that the bound on the number of patterns that can be held in the unconscious part of the working memory is larger than that of the conscious working memory⁵⁹. In other words, our mechanism suggests that the total storage capacity of the unconscious network population is much higher than that of the conscious one. Crucially, there is support for this requirement: there is evidence that the central executive function of working memory is not restricted to the conscious domain either⁴⁶. The relatively large capacity of (unconscious) working memory can hold not one, but several patterns selected by the cortex-striatum-basal ganglia loop. This type of selection can be realized by a winner-share-all (WSA) mechanism⁶⁰.

The latter point requires special attention. The reader is referred to the recent review by 61 on models of action selection and reinforcement learning. We wish to make a few critical points in this regard. First, as we are considering problem solving that unfolds its capacity online, there is no reason to select one pattern, since the interim patterns are not alternative actions but only candidate solutions. They can be turned into actions sometimes during, or only at the very end, of the evolutionary search. Weak lateral inhibition within the evaluation mechanism enhances value differences in selection, but a single winner is not selected^60,61. Second, parallelism of the evolutionary approach loses considerable power if the evaluations are not done in parallel, and if poor solutions cannot be replaced by better solutions in the storage. (In a subsequent study we shall show that the number of parallel evaluations are allowed to be considerably smaller than population size but also that purely serial evaluation of candidates is a killer). Third, it is perfectly possible that the WSA part is implemented by the cortex rather than the striatum (cf. 61): we are agnostic on this point for the time being. (Admittedly that option would require a different version of the full model). Fourth, we maintain that parallel survival of a number of candidates should happen, and the mechanism for this might have evolved with selection for complex (offline) problem solving. Fifth, it is well possible that WSA is gradually reduced towards WTA (one winner takes all) during evolutionary search in the brain: this would also guard against premature convergence early and fast convergence towards the end of the search.

To sum up, we have seen that a process analogous to natural selection can be rendered into a neuronal model employing known neurophysiological mechanisms. Now we discuss relations to some other publications and outline future work.

Related work. Several examples show that evolution with neurodynamics can be more powerful than either of the components alone. Fernando et al.²⁵ proved that the path evolution algorithm – which includes both elements of structural plasticity^62,63 and iterative generation of variation – is more powerful in several regards than classical genetic algorithms. Fernando et al. have also shown that replication combined with Hebbian learning is more powerful than classical natural selection in a model of mechanistic copying of binary activity²². De Vladar and Szathmáry provided proof that the synergy between selection and learning results in increased evolvability; also they pointed out that synaptic plasticity helps escaping impasses and build circuits that are tailored to the problem²¹. Finally, in a recent model Fernando and his colleagues have used autoencoders for the generation of “neuronal genotypes”⁶⁴. Since autoencoders produce compressed representations of the input, we expect them to successfully replace the identity function (i.e. bit by bit copying, as in DNA replication). Indeed, applying this neural component within the context of a genetic algorithm turned out to be rewarding.

Unless the envisaged information transfer is accurate enough in space and time, the evolutionary dynamics breaks down. Similar to genetic evolution, where the error threshold¹⁷ had to be raised before long informative genomes could arise by evolving adaptations for more accurate copying⁶⁵, in the neuronal context the element of accuracy was raised by Adams¹⁸. In his “Hebb and Darwin” paper Adams talks about synaptic replication and synaptic mutation as important ingredients for a Darwinian view of the brain. Synaptic replication means either the strengthening of an existing synapse, or the making of a new synapse between two neurons that already have one synapse between them. Adams’ is an important insight: evolutionary dynamics does not need copying for selection (scoring or strengthening is enough), but it needs copying with errors to test the new variants against the others. Synaptic mutation happens when a neuron grows a synapse towards a neighboring neuron with which previously it had no contact. Interestingly, these thoughts preceded the burst of interest in structural synaptic plasticity (SSP,^62,63). Following his expansion-renormalization model for SSP⁶⁶ Kilgard observes that when SSP is used for learning something new, this could be regarded as a Darwinian mechanism³⁷, as it generates and tests variations in successive rounds, based on what is already there (unlike the original models of “neural Darwinism”). Kilgard’s mechanism has not been formalized yet (although see 21), but the path evolution model²⁵ bears some relationship to it.

We share the view of Eliasmith⁵³ that the cortex/basal ganglia/thalamus/cortex loop plays a crucial role not only in elementary action selection but also in symbolic reasoning. We conjecture that non-primate animals (in particular mammals and birds) employ the same (or at least an analogous) loop in order to retrieve old and to innovate new solutions, in a similar way as we have shown using our elementary model.

Another view to which we feel strongly related to is that of Bayesian models that advocate “theory learning as stochastic search in a language of thought”²⁷. We are reasonably confident that we have found a candidate mechanism for the search process. If true, the rugged learning landscape in Figure 3 of 27 can be directly interpreted as the fitness landscape of our neuro-evolutionary model. A task for the future is to work out the explicit relations in detail. We note again the formal link between Bayesian inference and evolutionary selection^31,32 mentioned in the Introduction. Our mechanism (Figure 1) could in principle implement, with appropriate modifications, an estimation of distribution algorithm (EDA). The population-based incremental learning (PBIL) algorithm consists of the following steps⁶⁷: (i) Generate a population from the probability vector; (ii) Evaluate and rank the fitness of each member; (iii) Update the probability vector based on the elite individual; (iv) Mutate the probability vector; (v) Repeat steps (i-iv) until a finish criterion is met. EDA can work better than copying algorithms making it an interesting line to pursue.

Future work will be to link our recurrent model with the feedforward autoencoder model of Fernando et al.⁶⁴, since the latter can generate interesting genotypes (better substrates for selection) due to the emerging compressed representations of the inputs.

As two experts aptly remark: “The Bayesian brain falls short in explaining how the brain creates new knowledge” (68 p. 9). We suggest that neuronal evolutionary dynamics might serve as a remedy.

Data and software availability

Zenodo: IstvanZachar/Neurodynamics: Publication release, doi: 10.5281/zenodo.154113⁶⁹.

The algorithm described in the paper is also available on GitHub at https://github.com/IstvanZachar/Neurodynamics.

Author contributions

ESz concieved the model. ASz and IZ coded the model. ASz, IZ, AF and HPdV contributed to the model and designed the experiments. ASz and IZ run the experiments and analyzed the data. All authors contributed to writing the manuscript.

Competing interests

No competing interests were disclosed.

Grant information

The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Acknowledgements

We are thankful to Luc Steels, Chrisantha Fernando, Mauro Santos, and Thomas Filk for useful comments and discussions.

Faculty Opinions recommended

References

1. Changeux JP: Neuronal man: The biology of mind. Princeton, NJ: Princeton University Press; Translated by Garey, L. 1985. Reference Source
2. Changeux JP, Courrége P, Danchin A: A theory of the epigenesis of neuronal networks by selective stabilization of synapses. Proc Natl Acad Sci U S A. 1973; 70(10): 2974–2978. PubMed Abstract | Free Full Text
3. Edelman GM: Neural Darwinism. The theory of neuronal group selection. New York: Basic Books; 1987. Reference Source
4. Williams RW, Rakic P: Elimination of neurons from the rhesus monkey’s lateral geniculate nucleus during development. J Comp Neurol. 1988; 272(3): 424–436. PubMed Abstract | Publisher Full Text
5. O’Leary DD: Development of connectional diversity and specificity in the mammalian brain by the pruning of collateral projections. Curr Opin Neurobiol. 1992; 2(1): 70–77. PubMed Abstract | Publisher Full Text
6. Rabinowicz T, de Courten-Myers GM, Petetot JM, et al.: Human cortex development: estimates of neuronal numbers indicate major loss late during gestation. J Neuropathol Exp Neurol. 1996; 55(3): 320–328. PubMed Abstract | Publisher Full Text
7. Miller-Fleming TW, Petersen SC, Manning L, et al.: The DEG/ENaC cation channel protein UNC-8 drives activity-dependent synapse removal in remodeling GABAergic neurons. eLife. 2016; 5: pii: e14599. PubMed Abstract | Publisher Full Text | Free Full Text
8. Fernando C, Szathmáry E, Husbands P: Selectionist and evolutionary approaches to brain function: a critical appraisal. Front Comput Neurosci. 2012; 6: 24. PubMed Abstract | Publisher Full Text | Free Full Text
9. Quartz SR, Sejnowski TJ: The neural basis of cognitive development: a constructivist manifesto. Behav Brain Sci. 1997; 20(4): 537–556; discussion 556-96. PubMed Abstract | Publisher Full Text
10. Bandeira F, Lent R, Herculano-Houzel S: Changing numbers of neuronal and non-neuronal cells underlie postnatal brain growth in the rat. Proc Natl Acad Sci U S A. 2009; 106(33): 14108–14113. PubMed Abstract | Publisher Full Text | Free Full Text
11. Caroni P, Donato F, Muller D: Structural plasticity upon learning: regulation and functions. Nat Rev Neurosci. 2012; 13(7): 478–490. PubMed Abstract | Publisher Full Text
12. Bernardinelli Y, Nikonenko I, Muller D: Structural plasticity: mechanisms and contribution to developmental psychiatric disorders. Front Neuroanat. 2014; 8: 123. PubMed Abstract | Publisher Full Text | Free Full Text
13. Tetzlaff C, Kolodziejski C, Markelic I, et al.: Time scales of memory, learning, and plasticity. Biol Cybern. 2012; 106(11–12): 715–726. PubMed Abstract | Publisher Full Text
14. Maynard Smith J: The problems of biology. USA: Oxford University Press; 1986. Reference Source
15. Maynard Smith J: Genes, memes, and minds. The New York Review of Books. 1995; 42(19): 46–48. Reference Source
16. Kimura M: The Neutral Theory of Molecular Evolution. Cambridge: Cambridge University Press; 1983. Publisher Full Text
17. Eigen M: Selforganization of matter and the evolution of biological macromolecules. Naturwissenschaften. 1971; 58(10): 465–523. PubMed Abstract | Publisher Full Text
18. Adams P: Hebb and Darwin. J Theor Biol. 1998; 195(4): 419–438. Publisher Full Text
19. Calvin WH: The brain as a Darwin Machine. Nature. 1987; 330(6143): 33–34. PubMed Abstract | Publisher Full Text
20. Calvin WH: The cerebral code: thinking a thought in the mosaics of the mind. Cambridge, MA: MIT Press. 1996. Reference Source
21. de Vladar HP, Szathmáry E: Neuronal boost to evolutionary dynamics. Interface Focus. 2015; 5(6): 20150074. PubMed Abstract | Publisher Full Text | Free Full Text
22. Fernando C, Goldstein R, Szathmáry E: The neuronal replicator hypothesis. Neural Comput. 2010; 22(11): 2809–2857. PubMed Abstract | Publisher Full Text
23. Fernando C, Karishma KK, Szathmáry E: Copying and evolution of neuronal topology. PLoS One. 2008; 3(11): e3775. PubMed Abstract | Publisher Full Text | Free Full Text
24. Fernando C, Szathmáry E: Natural selection in the brain. In: Glatzeder B, Goel V, von Müller A, editors. Towards a theory of thinking. vol. 5 of On thinking. Berlin/Heidelberg: Springer-Verlag; 2010; 291–322. Publisher Full Text
25. Fernando C, Vasas V, Szathmáry E, et al.: Evolvable neuronal paths: a novel basis for information and search in the brain. PLoS One. 2011; 6(8): e23534. PubMed Abstract | Publisher Full Text | Free Full Text
26. Kemp C, Tenenbaum JB: The discovery of structural form. Proc Natl Acad Sci U S A. 2008; 105(31): 10687–10692. PubMed Abstract | Publisher Full Text | Free Full Text
27. Ullman TD, Goodman ND, Tenenbaum JB: Theory learning as stochastic search in the language of thought. Cognitive Dev. The Potential Contribution of Computational Modeling to the Study of Cognitive Development: When, and for What Topics? 2012; 27(4): 455–480. Publisher Full Text
28. Börgers T, Sarin R: Learning through reinforcement and replicator dynamics. J Econ Theory. 1997; 77(1): 1–14. Publisher Full Text
29. Niekum S, Barto AG, Spector L: Genetic programming for reward function search. IEEE Trans Auton Ment Dev. 2010; 2(2): 83–90. Publisher Full Text
30. Sutton RS, Barto AG: Introduction to reinforcement learning. 1st ed. Cambridge, MA, USA: MIT Press; 1998. Reference Source
31. Harper M: The replicator equation as an inference dynamic. ArXiv e-prints. 2009. Reference Source
32. Shalizi CR: Dynamics of Bayesian updating with dependent data and misspecified models. Electron J Statist. 2009; 3: 1039–1074. Publisher Full Text
33. Campbell JO: Universal Darwinism As a Process of Bayesian Inference. Front Syst Neurosci. 2016; 10: 49. PubMed Abstract | Publisher Full Text | Free Full Text
34. Hopfield JJ: Neural networks and physical systems with emergent collective computational abilities. Proc Natl Acad Sci U S A. 1982; 79(8): 2554–2558. PubMed Abstract | Publisher Full Text | Free Full Text
35. Rolls ET, Treves A: Neural networks and brain function. Oxford, New York: Oxford University Press; 1998. Publisher Full Text
36. Carrillo-Reid L, Yang W, Bando Y, et al.: Imprinting and recalling cortical ensembles. Science. 2016; 353(6300): 691–694. PubMed Abstract | Publisher Full Text
37. Kilgard MP: Harnessing plasticity to understand learning and treat disease. Trends Neurosci. 2012; 35(12): 715–722. PubMed Abstract | Publisher Full Text | Free Full Text
38. Watson RA, Jansen T: A building-block royal road where crossover is provably essential. In: Proceedings of the 9th Annual Conference on Genetic and Evolutionary Computation. GECCO ’ 07. New York, NY, USA: ACM; 2007; 1452–1459. Publisher Full Text
39. Maynard Smith J: The Evolution of Sex. Cambridge University Press; 1978. Reference Source
40. Maynard Smith J: The units of selection. Novartis Found Symp. 1998; 213: 203–11; discussion 211-7. PubMed Abstract
41. Storkey AJ: Efficient covariance matrix methods for Bayesian gaussian processes and Hopfield neural networks. Imperial College, Department of Electrical Engineering, Neural System Group; 1999. Reference Source
42. Sompolinsky H: Computational neuroscience: beyond the local circuit. Curr Opin Neurobiol. Theoretical and computational neuroscience. 2014; 25: xiii–xviii. PubMed Abstract | Publisher Full Text
43. Müller S, Appel B, Balke D, et al.: Thirty-five years of research into ribozymes and nucleic acid catalysis: where do we stand today? [version 1; referees: 2 approved]. F1000Res. 2016; 5: pii: F1000 Faculty Rev-1511. PubMed Abstract | Publisher Full Text | Free Full Text
44. Mountcastle VB: Modality and topographic properties of single neurons of cat’s somatic sensory cortex. J Neurophysiol. 1957; 20(4): 408–434. PubMed Abstract
45. Rolls ET: Attractor networks. Wiley Interdiscip Rev Cogn Sci. 2010; 1(1): 119–134. PubMed Abstract | Publisher Full Text
46. Soto D, Silvanto J: Reappraising the relationship between working memory and conscious awareness. Trends Cogn Sci. 2014; 18(10): 520–525. PubMed Abstract | Publisher Full Text
47. Madl T, Baars BJ, Franklin S: The timing of the cognitive cycle. PLoS One. 2011; 6(4): e14803. PubMed Abstract | Publisher Full Text | Free Full Text
48. Gustafsson B, Asztely F, Hanse E, et al.: Onset Characteristics of Long-Term Potentiation in the Guinea-Pig Hippocampal CA1 Region in Vitro. Eur J Neurosci. 1989; 1(4): 382–394. PubMed Abstract | Publisher Full Text
49. Hirsch JC, Crepel F: Use-dependent changes in synaptic efficacy in rat prefrontal neurons in vitro. J Physiol. 1990; 427(1): 31–49. PubMed Abstract | Publisher Full Text | Free Full Text
50. Fernando C, Szathmáry E: Chemical, neuronal, and linguistic replicators. In Pigliucci M, Müller G, editors. Cambridge, MA: MIT Press; 2010; 209–250. Publisher Full Text
51. Nadel L, Land C: Commentary-Reconsolidation: Memory traces revisited. Nat Rev Neurosci. 2000; 1(3): 209–212. Publisher Full Text
52. Nadel L, Moscovitch M: Memory consolidation, retrograde amnesia and the hippocampal complex. Curr Opin Neurobiol. 1997; 7(2): 217–227. PubMed Abstract | Publisher Full Text
53. Stewart TC, Choo X, Eliasmith C: Symbolic Reasoning in Spiking Neurons: A Model of the Cortex/Basal Ganglia/Thalamus Loop. In: 32nd Annual Meeting of the Cognitive Science Society; 2010. Reference Source
54. Dehaene S, Kerszberg M, Changeux JP: A neuronal model of a global workspace in effortful cognitive tasks. Proc Natl Acad Sci U S A. 1998; 95(24): 14529–14534. PubMed Abstract | Publisher Full Text | Free Full Text
55. Shanahan M: A spiking neuron model of cortical broadcast and competition. Conscious Cogn. 2008; 17(1): 288–303. PubMed Abstract | Publisher Full Text
56. Jacobs C, Silvanto J: How is working memory content consciously experienced? The ‘conscious copy’ model of WM introspection. Neurosci Biobehav Rev. 2015; 55: 510–519. PubMed Abstract | Publisher Full Text
57. Kumar A, Rotter S, Aertsen A: Spiking activity propagation in neuronal networks: reconciling different perspectives on neural coding. Nat Rev Neurosci. 2010; 11(9): 615–627. PubMed Abstract | Publisher Full Text
58. Hertz J, Palmer RG, Krogh AS: Introduction to the Theory of Neural Computation. 1st ed. Perseus Publishing; 1991. Reference Source
59. Oberauer K: Access to information in working memory: exploring the focus of attention. J Exp Psychol Learn Mem Cogn. 2002; 28(3): 411–421. PubMed Abstract | Publisher Full Text
60. Fukai T, Tanaka S: A simple neural network exhibiting selective activation of neuronal ensembles: from winner-take-all to winners-share-all. Neural Comput. 1997; 9(1): 77–97. PubMed Abstract | Publisher Full Text
61. Morita K, Jitsev J, Morrison A: Corticostriatal circuit mechanisms of value-based action selection: Implementation of reinforcement learning algorithms and beyond. Behav Brain Res. 2016; 311: 110–121. PubMed Abstract | Publisher Full Text
62. Chklovskii DB, Mel BW, Svoboda K: Cortical rewiring and information storage. Nature. 2004; 431(7010): 782–788. PubMed Abstract | Publisher Full Text
63. Holtmaat A, Svoboda K: Experience-dependent structural synaptic plasticity in the mammalian brain. Nat Rev Neurosci. 2009; 10(9): 647–658. PubMed Abstract | Publisher Full Text
64. Churchill AW, Sigtia S, Fernando C: Learning to generate genotypes with neural networks. Evolutionary Computation. 2015. Reference Source
65. Maynard Smith J, Szathmáry E: The major transitions in evolution. Oxford: Freeman & Co. 1995. Reference Source
66. Reed A, Riley J, Carraway R, et al.: Cortical map plasticity improves learning but is not necessary for improved performance. Neuron. 2011; 70(1): 121–131. PubMed Abstract | Publisher Full Text
67. Baluja S, Caruana R: Removing the genetics from the standard genetic algorithm. Morgan Kaufmann Publishers; 1995. 38–46. Publisher Full Text
68. Friston K, Buzsáki G: The Functional Anatomy of Time: What and When in the Brain. Trends Cogn Sci. 2016; 20(7): 500–511. PubMed Abstract | Publisher Full Text
69. Zachar I: IstvanZachar/Neurodynamics: Publication release. 2016. Data Source

Comments on this article Comments (0)

Version 2

VERSION 2 PUBLISHED 28 Sep 2016

Author details Author details

Competing interests

No competing interests were disclosed.

Grant information

The research leading to these results has received funding from the European Union Seventh Framework Program (FP7/2007-2013) under grant agreement numbers 308943 (INSIGHT project) and 294332 (EvoEvo project).
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Article Versions (2)

version 2

Revised

Published: 29 Jun 2017, 5:2416

https://doi.org/10.12688/f1000research.9630.2

version 1

Published: 28 Sep 2016, 5:2416

https://doi.org/10.12688/f1000research.9630.1

© 2016 Szilágyi A et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The author(s) is/are employees of the US Government and therefore domestic copyright protection in USA does not apply to this work. The work may be protected under the copyright laws of other jurisdictions when used in those jurisdictions.

Download

Export To

metrics

	Views	Downloads
F1000Research	-	-
PubMed Central Data from PMC are received and updated monthly.	-	-

Citations

SEE MORE DETAILS

CITE

how to cite this article

Szilágyi A, Zachar I, Fedor A et al. Breeding novel solutions in the brain: a model of Darwinian neurodynamics [version 1; peer review: 1 approved, 2 approved with reservations] F1000Research 2016, 5:2416 (https://doi.org/10.12688/f1000research.9630.1)

NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.

track

receive updates on this article

Track an article to receive email alerts on any updates to this article.

Open Peer Review

Current Reviewer Status: ?

Key to Reviewer Statuses VIEW HIDE

ApprovedThe paper is scientifically sound in its current form and only minor, if any, improvements are suggested

Approved with reservations A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.

Not approvedFundamental flaws in the paper seriously undermine the findings and conclusions

Version 1

VERSION 1

PUBLISHED 28 Sep 2016

Views

Reviewer Report 25 Nov 2016

László Acsády, Laboratory of Thalamus Research, Institute of Experimental Medicine, Hungarian Academy of Sciences, Budapest, Hungary

Approved with Reservations

https://doi.org/10.5256/f1000research.10377.r17462

This is a truly enlightening and thought provoking paper which utilizes Darwinian logic to explain neuronal network dynamics. The manuscript is a nice example of how approaches in a different discipline (population genetics) may yield fresh insights into age old ... Continue reading

One drawback of the paper is that the parallelism between genetic and neuronal Darwinism is not made clear right at the onset. We should be aware of what the authors mean by parent, offspring, multiplication, mutation, selection in case of neuronal activity right from the beginning in order to follow the logic of the paper.

Results:

I miss the clear demonstration of the “Selection” experiments showing that it is not able to find the global optimum (e.g as an additional panel to Fig 2). I also think we need some form of quantification here, how many simulations were run, how significant the result was…etc.etc
I also miss the formal demonstration of the effect of mutation rate on the speed of evolution. Since this is a crucial concept, I would dedicate a separate figure for that.
I would like to see, how implementing palimpsest memory affects the performance of the model and how this depends on whether the system use dense or sparse coding. Presently this is only briefly mentioned in the Method section, but since this may have important implications it may be good provide some more details. Intuitively, more sparse coding may tolerate the lack of palimpsest memory.
Neuronal noise considered as “mutation” in the model is enlightening. Still, I feel there are some basic differences here. During evolution genetic mutations can get stabilized when they reach the global optimum whereas neuronal noise is inherent to the system and not necessary change with evolution. Can it be demonstrated or is there any evidence that neuronal noise decreases as the system approaches the optimum state?
What were the connection weights of the recurrent (i.e. new input) patterns relative to the weights of the local, autoassociative connections? Can the model perform better/worse by changing the relative weights of these connections? Note that many original autoassociative models worked with a “detonator” synapse as an input and weaker local connections¹.

Discussion:

I would not necessarily constrain the model to implicit memories. I think “implicit” here refers to the unconscious effort to recall the best target pattern not to type of memory item to be recalled. The term “implicit memory” evokes mainly procedural memories and indeed the authors place the model in the cortex-basal ganglia loop. I don’t see why recall of an episodic memory trace by the CA3 recurrent network cannot follow the same evolutionary logic even though hippocampal memories are not considered as “implicit”.
In the cortex-basal ganglia-thalamus loop, it is not really known how exactly the cortical output will affect the return signal from the thalamus but, in any case, the signal goes through significant dimension reduction² and the final output of basal ganglia may also affect thalamic firing in different ways³ (i.e it is “mutated” a lot). The question is, how the properties of the model network changes if the final output is not directly fed back to the system but undergoes various (but consistent) signal transformation.

Minors:

I miss the definition of “best pattern”. Can this term be equated with “pattern with highest fitness”?
I would also support a short glossary with the neurobiological relevance of the crucial ecological concepts (fitness, generation, landscape, mutation).
I guess subheadings in the Results section are not appropriate. “Selection” and “Evolution” are the two main subheadings and all the others are subsections of the “Evolution” section.

References

1. Treves A, Rolls ET: Computational constraints suggest the need for two distinct input systems to the hippocampal CA3 network.Hippocampus. 1992; 2 (2): 189-99 PubMed Abstract | Publisher Full Text
2. Bar-Gad I, Bergman H: Stepping out of the box: information processing in the neural networks of the basal ganglia.Curr Opin Neurobiol. 2001; 11 (6): 689-95 PubMed Abstract
3. Goldberg JH, Farries MA, Fee MS: Basal ganglia output to the thalamus: still a paradox.Trends Neurosci. 2013; 36 (12): 695-705 PubMed Abstract | Publisher Full Text

Competing Interests: No competing interests were disclosed.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.

CITE

Report a concern

Author Response 29 Jun 2017

István Zachar, Parmenides Center for the Conceptual Foundations of Science, Munich/Pullach, 82049, Germany

29 Jun 2017

Author Response

We are grateful for the review of László Acsády, and his suggestions to improve our paper. We refer to the points raised by our Reviewer by a hashmark (#). Please ... Continue reading We are grateful for the review of László Acsády, and his suggestions to improve our paper. We refer to the points raised by our Reviewer by a hashmark (#). Please also see our answers to the other Reviewers for some relevant answers.

Introduction #1. There is limited similarity only between neural Darwinism and the model and thoughts presented in this paper. In both cases “neuronal groups” are important in terms function, although the term “neuronal assembly” is more traditional and functionally relevant. Synapses are important inasmuch as they contribute to the functionality of the assembly. There is a form of reentry in both models but they play very different roles: in our case it means that the same or variant information is fed back, after evaluation, to the same population of assemblies and get stored in multiple copies. It is this multiplication component that supports “neuronal replication” of candidate solutions.

A note is in order about the search mechanism in the space of candidate solutions. In the classic Deheane-Changeux model for prefrontal cortex functionality in terms of action production and selection on the example of the Wisconsin card sorting test [1], search for new candidate solutions is random and their evaluation is strictly serial. A non-random but still serial search process, for which there is neural evidence, is chaotic itinerancy [2-4]. Except for the learning phase our attractors are locally stable, classical ones. In contrast, chaotic itinerancy rests on so-called “attractor ruins” from which there is always the possibility of dynamical escape in certain directions via transient chaotic bridges. Since these attractor ruins are deterministic entities, escape directions are not random. The pertinent question is whether these directions are random relative to those leading to better candidate solutions. One can imagine that synaptic plasticity combined with reinforcement learning might strengthen certain escape routes with experience.

An explanatory table and an accompanying figure were added to the text to guide the reader among the terms we use and to explain the relationship we imply between genetic and neuronal Darwinism.

Results #1. In case of Selection experiments the system can always find the global optimum, the best special pattern (it was trained to network #20). The overlapping basins of the series of special patterns among networks guarantee that there is a trajectory from the basin of any one of the special patterns to the best special pattern. All simulations were started from noisy copies of a random input pattern similar (in Hamming-distance) to the worst special pattern. We ran 1000 independent simulations, the average number and variance of the rounds needed to converge to the optimum were 3.5± 0.64.

Results #2. A new figure and a few sentences were added to demonstrate the effect of mutation rate on the speed of evolution.

Results #3. Palimpsest memory is essential to our model. Because of the repeated retraining of the networks with the selected patterns, without palimpsest memory all networks – even if the patterns are sparse – sooner or later would reach catastrophic forgetting. Consequently, palimpsest memory is necessary for evolutionary processes. We used dense coding for the reason of computational time.

Results #4. In the present state of the model, the two types of noise (m_l and m_T) are considered to be constant. Using decreasing m_l with increasing fitness does not change the qualitative outcome of the evolutionary process (results not shown). Changes in m_T do not affect the convergence too much, see the answer for question #2 above.)

Results #5. To be in line with Storkey’s original model we used similar weights to the recurrent patterns and autoassociative connections. In the present state of the model, we have not tested the palimpsest behavior and the retrieval ability with other than 1:1 relative weights.

Discussion #1, #2. We have presented our model as potentially realisable by the cortex-basal ganglia-thalamus cortex loop. Actually, noting the massively parallel processing in the brain in general, added to the evaluative function of the mentioned loop, we would be surprised if no true evolutionary behaviour could be found during complicated problem-solving tasks. But we agree that it is not necessary to restrict the proposed cognitive architecture to this concrete loop: recall of an episodic memory trace by the CA3 recurrent network [5] can in principle follow the same evolutionary logic, even though hippocampal memories are not considered as “implicit”.

A deeper question considers how “informational reentry” in the Darwinian sense can be realized by the cortex-basal ganglia-thalamus-cortex loop. In this loop it is not really known how exactly the cortical output will affect the return signal from the thalamus but, in any case, the signal goes through significant dimension reduction [6] and the final output of basal ganglia may also affect thalamic firing in different ways ([7], i.e. it is “mutated” a lot). The question is, how the properties of the model network change if the final output is not directly fed back to the system but undergoes various (but consistent) signal transformation. We think that the key to our kind of mechanism to work is exactly the consistency of signal transformation. A variant mechanism might look like this: the thalamus pointedly activates those cortical networks where the best (few) activity pattern(s) came from (as before, the number of candidate patterns “kept alive” depends on the stringency of selection in the striatum). These could in turn spread in the cortex through the horizontal connections among local attractors. Passing of information from attractor network to another is a common element in many relevant dynamical models [8,9].

Minor points were all corrected according to Reviewer’s suggestions.

References:
1 Dehaene S, Changeux JP. The Wisconsin Card Sorting Test: Theoretical Analysis and Modeling in a Neuronal Network. Cerebral Cortex. 1991;1(1):62. http://dx.doi.org/10.1093/cercor/1.1.62.
2 Kaneko K, Tsuda I. Chaotic itinerancy. Chaos: An Interdisciplinary Journal of Nonlinear Science. 2003;13(3):926–936. http://dx.doi.org/10.1063/1.1607783.
3 van Leeuwen C. Chaos breeds autonomy: connectionist design between bias and baby-sitting. Cognitive Processing. 2008;9(2):83–92. http://dx.doi.org/10.1007/s10339-007-0193-8.
4 Tyukin I, Tyukina T, van Leeuwen C. Invariant template matching in systems with spatiotemporal coding: A matter of instability. Neural Networks. 2009;22(4):425–449. http://www.sciencedirect.com/science/article/pii/S089360800900015X.
5 Treves A, Rolls ET. Computational constraints suggest the need for two distinct input systems to the hippocampal CA3 network. Hippocampus. 1992;2(2):189–199. http://dx.doi.org/10.1002/hipo.450020209.
6 Bar-Gad I, Bergman H. Stepping out of the box: information processing in the neural networks of the basal ganglia. Current Opinion in Neurobiology. 2001;11(6):689–695. http://www.sciencedirect.com/science/article/pii/S0959438801002707.
7 Goldberg JH, Farries MA, Fee MS. Basal ganglia output to the thalamus: still a paradox. Trends in Neurosciences. 2013;36(12):695–705. http://www.sciencedirect.com/science/article/pii/S0166223613001574.
8 Rolls ET, Treves A. Neural networks and brain function. Oxford, New York: Oxford University Press; 1998. http://opac.inria.fr/record=b1094909.
9 Rolls ET. Attractor networks. Wiley Interdisciplinary Reviews: Cognitive Science. 2010;1(1):119–134. http://dx.doi.org/10.1002/wcs.1.

We are grateful for the review of László Acsády, and his suggestions to improve our paper. We refer to the points raised by our Reviewer by a hashmark (#). Please also see our answers to the other Reviewers for some relevant answers.

Introduction #1. There is limited similarity only between neural Darwinism and the model and thoughts presented in this paper. In both cases “neuronal groups” are important in terms function, although the term “neuronal assembly” is more traditional and functionally relevant. Synapses are important inasmuch as they contribute to the functionality of the assembly. There is a form of reentry in both models but they play very different roles: in our case it means that the same or variant information is fed back, after evaluation, to the same population of assemblies and get stored in multiple copies. It is this multiplication component that supports “neuronal replication” of candidate solutions.

A note is in order about the search mechanism in the space of candidate solutions. In the classic Deheane-Changeux model for prefrontal cortex functionality in terms of action production and selection on the example of the Wisconsin card sorting test [1], search for new candidate solutions is random and their evaluation is strictly serial. A non-random but still serial search process, for which there is neural evidence, is chaotic itinerancy [2-4]. Except for the learning phase our attractors are locally stable, classical ones. In contrast, chaotic itinerancy rests on so-called “attractor ruins” from which there is always the possibility of dynamical escape in certain directions via transient chaotic bridges. Since these attractor ruins are deterministic entities, escape directions are not random. The pertinent question is whether these directions are random relative to those leading to better candidate solutions. One can imagine that synaptic plasticity combined with reinforcement learning might strengthen certain escape routes with experience.

An explanatory table and an accompanying figure were added to the text to guide the reader among the terms we use and to explain the relationship we imply between genetic and neuronal Darwinism.

Results #1. In case of Selection experiments the system can always find the global optimum, the best special pattern (it was trained to network #20). The overlapping basins of the series of special patterns among networks guarantee that there is a trajectory from the basin of any one of the special patterns to the best special pattern. All simulations were started from noisy copies of a random input pattern similar (in Hamming-distance) to the worst special pattern. We ran 1000 independent simulations, the average number and variance of the rounds needed to converge to the optimum were 3.5± 0.64.

Results #2. A new figure and a few sentences were added to demonstrate the effect of mutation rate on the speed of evolution.

Results #3. Palimpsest memory is essential to our model. Because of the repeated retraining of the networks with the selected patterns, without palimpsest memory all networks – even if the patterns are sparse – sooner or later would reach catastrophic forgetting. Consequently, palimpsest memory is necessary for evolutionary processes. We used dense coding for the reason of computational time.

Results #4. In the present state of the model, the two types of noise (m_l and m_T) are considered to be constant. Using decreasing m_l with increasing fitness does not change the qualitative outcome of the evolutionary process (results not shown). Changes in m_T do not affect the convergence too much, see the answer for question #2 above.)

Results #5. To be in line with Storkey’s original model we used similar weights to the recurrent patterns and autoassociative connections. In the present state of the model, we have not tested the palimpsest behavior and the retrieval ability with other than 1:1 relative weights.

Discussion #1, #2. We have presented our model as potentially realisable by the cortex-basal ganglia-thalamus cortex loop. Actually, noting the massively parallel processing in the brain in general, added to the evaluative function of the mentioned loop, we would be surprised if no true evolutionary behaviour could be found during complicated problem-solving tasks. But we agree that it is not necessary to restrict the proposed cognitive architecture to this concrete loop: recall of an episodic memory trace by the CA3 recurrent network [5] can in principle follow the same evolutionary logic, even though hippocampal memories are not considered as “implicit”.

A deeper question considers how “informational reentry” in the Darwinian sense can be realized by the cortex-basal ganglia-thalamus-cortex loop. In this loop it is not really known how exactly the cortical output will affect the return signal from the thalamus but, in any case, the signal goes through significant dimension reduction [6] and the final output of basal ganglia may also affect thalamic firing in different ways ([7], i.e. it is “mutated” a lot). The question is, how the properties of the model network change if the final output is not directly fed back to the system but undergoes various (but consistent) signal transformation. We think that the key to our kind of mechanism to work is exactly the consistency of signal transformation. A variant mechanism might look like this: the thalamus pointedly activates those cortical networks where the best (few) activity pattern(s) came from (as before, the number of candidate patterns “kept alive” depends on the stringency of selection in the striatum). These could in turn spread in the cortex through the horizontal connections among local attractors. Passing of information from attractor network to another is a common element in many relevant dynamical models [8,9].

Minor points were all corrected according to Reviewer’s suggestions.

References:
1 Dehaene S, Changeux JP. The Wisconsin Card Sorting Test: Theoretical Analysis and Modeling in a Neuronal Network. Cerebral Cortex. 1991;1(1):62. http://dx.doi.org/10.1093/cercor/1.1.62.
2 Kaneko K, Tsuda I. Chaotic itinerancy. Chaos: An Interdisciplinary Journal of Nonlinear Science. 2003;13(3):926–936. http://dx.doi.org/10.1063/1.1607783.
3 van Leeuwen C. Chaos breeds autonomy: connectionist design between bias and baby-sitting. Cognitive Processing. 2008;9(2):83–92. http://dx.doi.org/10.1007/s10339-007-0193-8.
4 Tyukin I, Tyukina T, van Leeuwen C. Invariant template matching in systems with spatiotemporal coding: A matter of instability. Neural Networks. 2009;22(4):425–449. http://www.sciencedirect.com/science/article/pii/S089360800900015X.
5 Treves A, Rolls ET. Computational constraints suggest the need for two distinct input systems to the hippocampal CA3 network. Hippocampus. 1992;2(2):189–199. http://dx.doi.org/10.1002/hipo.450020209.
6 Bar-Gad I, Bergman H. Stepping out of the box: information processing in the neural networks of the basal ganglia. Current Opinion in Neurobiology. 2001;11(6):689–695. http://www.sciencedirect.com/science/article/pii/S0959438801002707.
7 Goldberg JH, Farries MA, Fee MS. Basal ganglia output to the thalamus: still a paradox. Trends in Neurosciences. 2013;36(12):695–705. http://www.sciencedirect.com/science/article/pii/S0166223613001574.
8 Rolls ET, Treves A. Neural networks and brain function. Oxford, New York: Oxford University Press; 1998. http://opac.inria.fr/record=b1094909.
9 Rolls ET. Attractor networks. Wiley Interdisciplinary Reviews: Cognitive Science. 2010;1(1):119–134. http://dx.doi.org/10.1002/wcs.1.
Competing Interests: No competing interests were disclosed. Close
Report a concern
Respond or Comment

COMMENTS ON THIS REPORT

Author Response 29 Jun 2017

István Zachar, Parmenides Center for the Conceptual Foundations of Science, Munich/Pullach, 82049, Germany

29 Jun 2017

Author Response

We are grateful for the review of László Acsády, and his suggestions to improve our paper. We refer to the points raised by our Reviewer by a hashmark (#). Please ... Continue reading We are grateful for the review of László Acsády, and his suggestions to improve our paper. We refer to the points raised by our Reviewer by a hashmark (#). Please also see our answers to the other Reviewers for some relevant answers.

Introduction #1. There is limited similarity only between neural Darwinism and the model and thoughts presented in this paper. In both cases “neuronal groups” are important in terms function, although the term “neuronal assembly” is more traditional and functionally relevant. Synapses are important inasmuch as they contribute to the functionality of the assembly. There is a form of reentry in both models but they play very different roles: in our case it means that the same or variant information is fed back, after evaluation, to the same population of assemblies and get stored in multiple copies. It is this multiplication component that supports “neuronal replication” of candidate solutions.

A note is in order about the search mechanism in the space of candidate solutions. In the classic Deheane-Changeux model for prefrontal cortex functionality in terms of action production and selection on the example of the Wisconsin card sorting test [1], search for new candidate solutions is random and their evaluation is strictly serial. A non-random but still serial search process, for which there is neural evidence, is chaotic itinerancy [2-4]. Except for the learning phase our attractors are locally stable, classical ones. In contrast, chaotic itinerancy rests on so-called “attractor ruins” from which there is always the possibility of dynamical escape in certain directions via transient chaotic bridges. Since these attractor ruins are deterministic entities, escape directions are not random. The pertinent question is whether these directions are random relative to those leading to better candidate solutions. One can imagine that synaptic plasticity combined with reinforcement learning might strengthen certain escape routes with experience.

An explanatory table and an accompanying figure were added to the text to guide the reader among the terms we use and to explain the relationship we imply between genetic and neuronal Darwinism.

Results #1. In case of Selection experiments the system can always find the global optimum, the best special pattern (it was trained to network #20). The overlapping basins of the series of special patterns among networks guarantee that there is a trajectory from the basin of any one of the special patterns to the best special pattern. All simulations were started from noisy copies of a random input pattern similar (in Hamming-distance) to the worst special pattern. We ran 1000 independent simulations, the average number and variance of the rounds needed to converge to the optimum were 3.5± 0.64.

Results #2. A new figure and a few sentences were added to demonstrate the effect of mutation rate on the speed of evolution.

Results #3. Palimpsest memory is essential to our model. Because of the repeated retraining of the networks with the selected patterns, without palimpsest memory all networks – even if the patterns are sparse – sooner or later would reach catastrophic forgetting. Consequently, palimpsest memory is necessary for evolutionary processes. We used dense coding for the reason of computational time.

Results #4. In the present state of the model, the two types of noise (m_l and m_T) are considered to be constant. Using decreasing m_l with increasing fitness does not change the qualitative outcome of the evolutionary process (results not shown). Changes in m_T do not affect the convergence too much, see the answer for question #2 above.)

Results #5. To be in line with Storkey’s original model we used similar weights to the recurrent patterns and autoassociative connections. In the present state of the model, we have not tested the palimpsest behavior and the retrieval ability with other than 1:1 relative weights.

Discussion #1, #2. We have presented our model as potentially realisable by the cortex-basal ganglia-thalamus cortex loop. Actually, noting the massively parallel processing in the brain in general, added to the evaluative function of the mentioned loop, we would be surprised if no true evolutionary behaviour could be found during complicated problem-solving tasks. But we agree that it is not necessary to restrict the proposed cognitive architecture to this concrete loop: recall of an episodic memory trace by the CA3 recurrent network [5] can in principle follow the same evolutionary logic, even though hippocampal memories are not considered as “implicit”.

A deeper question considers how “informational reentry” in the Darwinian sense can be realized by the cortex-basal ganglia-thalamus-cortex loop. In this loop it is not really known how exactly the cortical output will affect the return signal from the thalamus but, in any case, the signal goes through significant dimension reduction [6] and the final output of basal ganglia may also affect thalamic firing in different ways ([7], i.e. it is “mutated” a lot). The question is, how the properties of the model network change if the final output is not directly fed back to the system but undergoes various (but consistent) signal transformation. We think that the key to our kind of mechanism to work is exactly the consistency of signal transformation. A variant mechanism might look like this: the thalamus pointedly activates those cortical networks where the best (few) activity pattern(s) came from (as before, the number of candidate patterns “kept alive” depends on the stringency of selection in the striatum). These could in turn spread in the cortex through the horizontal connections among local attractors. Passing of information from attractor network to another is a common element in many relevant dynamical models [8,9].

Minor points were all corrected according to Reviewer’s suggestions.

References:
1 Dehaene S, Changeux JP. The Wisconsin Card Sorting Test: Theoretical Analysis and Modeling in a Neuronal Network. Cerebral Cortex. 1991;1(1):62. http://dx.doi.org/10.1093/cercor/1.1.62.
2 Kaneko K, Tsuda I. Chaotic itinerancy. Chaos: An Interdisciplinary Journal of Nonlinear Science. 2003;13(3):926–936. http://dx.doi.org/10.1063/1.1607783.
3 van Leeuwen C. Chaos breeds autonomy: connectionist design between bias and baby-sitting. Cognitive Processing. 2008;9(2):83–92. http://dx.doi.org/10.1007/s10339-007-0193-8.
4 Tyukin I, Tyukina T, van Leeuwen C. Invariant template matching in systems with spatiotemporal coding: A matter of instability. Neural Networks. 2009;22(4):425–449. http://www.sciencedirect.com/science/article/pii/S089360800900015X.
5 Treves A, Rolls ET. Computational constraints suggest the need for two distinct input systems to the hippocampal CA3 network. Hippocampus. 1992;2(2):189–199. http://dx.doi.org/10.1002/hipo.450020209.
6 Bar-Gad I, Bergman H. Stepping out of the box: information processing in the neural networks of the basal ganglia. Current Opinion in Neurobiology. 2001;11(6):689–695. http://www.sciencedirect.com/science/article/pii/S0959438801002707.
7 Goldberg JH, Farries MA, Fee MS. Basal ganglia output to the thalamus: still a paradox. Trends in Neurosciences. 2013;36(12):695–705. http://www.sciencedirect.com/science/article/pii/S0166223613001574.
8 Rolls ET, Treves A. Neural networks and brain function. Oxford, New York: Oxford University Press; 1998. http://opac.inria.fr/record=b1094909.
9 Rolls ET. Attractor networks. Wiley Interdisciplinary Reviews: Cognitive Science. 2010;1(1):119–134. http://dx.doi.org/10.1002/wcs.1.

We are grateful for the review of László Acsády, and his suggestions to improve our paper. We refer to the points raised by our Reviewer by a hashmark (#). Please also see our answers to the other Reviewers for some relevant answers.

Introduction #1. There is limited similarity only between neural Darwinism and the model and thoughts presented in this paper. In both cases “neuronal groups” are important in terms function, although the term “neuronal assembly” is more traditional and functionally relevant. Synapses are important inasmuch as they contribute to the functionality of the assembly. There is a form of reentry in both models but they play very different roles: in our case it means that the same or variant information is fed back, after evaluation, to the same population of assemblies and get stored in multiple copies. It is this multiplication component that supports “neuronal replication” of candidate solutions.

A note is in order about the search mechanism in the space of candidate solutions. In the classic Deheane-Changeux model for prefrontal cortex functionality in terms of action production and selection on the example of the Wisconsin card sorting test [1], search for new candidate solutions is random and their evaluation is strictly serial. A non-random but still serial search process, for which there is neural evidence, is chaotic itinerancy [2-4]. Except for the learning phase our attractors are locally stable, classical ones. In contrast, chaotic itinerancy rests on so-called “attractor ruins” from which there is always the possibility of dynamical escape in certain directions via transient chaotic bridges. Since these attractor ruins are deterministic entities, escape directions are not random. The pertinent question is whether these directions are random relative to those leading to better candidate solutions. One can imagine that synaptic plasticity combined with reinforcement learning might strengthen certain escape routes with experience.

An explanatory table and an accompanying figure were added to the text to guide the reader among the terms we use and to explain the relationship we imply between genetic and neuronal Darwinism.

Results #1. In case of Selection experiments the system can always find the global optimum, the best special pattern (it was trained to network #20). The overlapping basins of the series of special patterns among networks guarantee that there is a trajectory from the basin of any one of the special patterns to the best special pattern. All simulations were started from noisy copies of a random input pattern similar (in Hamming-distance) to the worst special pattern. We ran 1000 independent simulations, the average number and variance of the rounds needed to converge to the optimum were 3.5± 0.64.

Results #2. A new figure and a few sentences were added to demonstrate the effect of mutation rate on the speed of evolution.

Results #3. Palimpsest memory is essential to our model. Because of the repeated retraining of the networks with the selected patterns, without palimpsest memory all networks – even if the patterns are sparse – sooner or later would reach catastrophic forgetting. Consequently, palimpsest memory is necessary for evolutionary processes. We used dense coding for the reason of computational time.

Results #4. In the present state of the model, the two types of noise (m_l and m_T) are considered to be constant. Using decreasing m_l with increasing fitness does not change the qualitative outcome of the evolutionary process (results not shown). Changes in m_T do not affect the convergence too much, see the answer for question #2 above.)

Results #5. To be in line with Storkey’s original model we used similar weights to the recurrent patterns and autoassociative connections. In the present state of the model, we have not tested the palimpsest behavior and the retrieval ability with other than 1:1 relative weights.

Discussion #1, #2. We have presented our model as potentially realisable by the cortex-basal ganglia-thalamus cortex loop. Actually, noting the massively parallel processing in the brain in general, added to the evaluative function of the mentioned loop, we would be surprised if no true evolutionary behaviour could be found during complicated problem-solving tasks. But we agree that it is not necessary to restrict the proposed cognitive architecture to this concrete loop: recall of an episodic memory trace by the CA3 recurrent network [5] can in principle follow the same evolutionary logic, even though hippocampal memories are not considered as “implicit”.

A deeper question considers how “informational reentry” in the Darwinian sense can be realized by the cortex-basal ganglia-thalamus-cortex loop. In this loop it is not really known how exactly the cortical output will affect the return signal from the thalamus but, in any case, the signal goes through significant dimension reduction [6] and the final output of basal ganglia may also affect thalamic firing in different ways ([7], i.e. it is “mutated” a lot). The question is, how the properties of the model network change if the final output is not directly fed back to the system but undergoes various (but consistent) signal transformation. We think that the key to our kind of mechanism to work is exactly the consistency of signal transformation. A variant mechanism might look like this: the thalamus pointedly activates those cortical networks where the best (few) activity pattern(s) came from (as before, the number of candidate patterns “kept alive” depends on the stringency of selection in the striatum). These could in turn spread in the cortex through the horizontal connections among local attractors. Passing of information from attractor network to another is a common element in many relevant dynamical models [8,9].

Minor points were all corrected according to Reviewer’s suggestions.

References:
1 Dehaene S, Changeux JP. The Wisconsin Card Sorting Test: Theoretical Analysis and Modeling in a Neuronal Network. Cerebral Cortex. 1991;1(1):62. http://dx.doi.org/10.1093/cercor/1.1.62.
2 Kaneko K, Tsuda I. Chaotic itinerancy. Chaos: An Interdisciplinary Journal of Nonlinear Science. 2003;13(3):926–936. http://dx.doi.org/10.1063/1.1607783.
3 van Leeuwen C. Chaos breeds autonomy: connectionist design between bias and baby-sitting. Cognitive Processing. 2008;9(2):83–92. http://dx.doi.org/10.1007/s10339-007-0193-8.
4 Tyukin I, Tyukina T, van Leeuwen C. Invariant template matching in systems with spatiotemporal coding: A matter of instability. Neural Networks. 2009;22(4):425–449. http://www.sciencedirect.com/science/article/pii/S089360800900015X.
5 Treves A, Rolls ET. Computational constraints suggest the need for two distinct input systems to the hippocampal CA3 network. Hippocampus. 1992;2(2):189–199. http://dx.doi.org/10.1002/hipo.450020209.
6 Bar-Gad I, Bergman H. Stepping out of the box: information processing in the neural networks of the basal ganglia. Current Opinion in Neurobiology. 2001;11(6):689–695. http://www.sciencedirect.com/science/article/pii/S0959438801002707.
7 Goldberg JH, Farries MA, Fee MS. Basal ganglia output to the thalamus: still a paradox. Trends in Neurosciences. 2013;36(12):695–705. http://www.sciencedirect.com/science/article/pii/S0166223613001574.
8 Rolls ET, Treves A. Neural networks and brain function. Oxford, New York: Oxford University Press; 1998. http://opac.inria.fr/record=b1094909.
9 Rolls ET. Attractor networks. Wiley Interdisciplinary Reviews: Cognitive Science. 2010;1(1):119–134. http://dx.doi.org/10.1002/wcs.1.
Competing Interests: No competing interests were disclosed. Close
Report a concern

Views

Reviewer Report 25 Oct 2016

Stuart J. Edelstein, Ecole Normale Supérieur, Paris, France

Approved with Reservations

https://doi.org/10.5256/f1000research.10377.r16965

This article continues the important effort of the authors and their colleagues to advance a line of research capable of fleshing out the somewhat vague concept subsumed under the heading Neural Darwinism. While the general idea that subtle functions of the brain involve Darwinian features has persisted over the last 40 years (beginning with Changeux et al.¹) an up-to-date synthesis has been lacking and the authors are to be congratulated for their sustained effort to achieve this goal. Part of the difficulty encountered is related to ambiguities concerning exactly what features of neo-Darwinism so successfully applied to population genetics and molecular biology, can be imported into concepts of neuroscience. In the present effort, many terms from the classical Darwinian literature are invoked, but how they are translated from population genetics to computational neuroscience is not always clear. In this respect a glossary could be very helpful, which would include the classical definition and the neural application, as applied for example to terms such as: replication, hereditary variation, breeding, fitness, mutation, deme, generation, Lamarckian, etc. In particular, the hallmark of fitness in evolution is expansion of population size, but whether and how this criterion is applicable in neuroscience should be addressed.

A second challenge is to define the level at which Darwinian principles are applied. More explicit attention could be given to dendritic spines (an obvious target for LTP, STDP), axonal pruning (see for example Kolodkin and Tessier-Lavigne²), or neural circuitry. Cortical columns are invoked in passing, but should be evaluated in more detail. Each of these levels is stochastic in some respects, but to what extent does the variation implicit in neural Darwinism require additional mechanisms? Concerning the results presented for the various simulations performed, several valuable points were raised in the comments by Karl Friston. In addition, it would be helpful to make clearer connections with respect to the putative neuronal structures simulated. For comparison, the recent success of deep learning algorithms should be considered, in so far as they do or do not mimic Darwinian mechanisms of the brain. Moreover, since a global optimum is set for the simulations, what criteria would establish optimality in a natural Darwinian system within the brain? Finally, what is the relationship of the high number of “generations” (20,000 in Figure 4) to putative neuronal processes?

Overall, describing neuronal activity using Darwinian terminology is a double-edged sword. On the one hand, a thorough application of Darwinian principles to brain science involves many one-to-one correspondences that must be clearly articulated. On the other hand, since the words are familiar, their usage carries immediate associations that can obscure understanding and become an ambiguous jargon. It would be hard to over-estimate the difficulty of finding the right balance, especially for scholars already immersed in the quest and possessing their own specific understanding of terms employed. Therefore, navigating through these troubled waters, requires extreme vigilance of language, and an addition effort by the authors in preparing their final version would be extremely helpful.

References

1. Changeux JP, Courrège P, Danchin A: A theory of the epigenesis of neuronal networks by selective stabilization of synapses.Proc Natl Acad Sci U S A. 1973; 70 (10): 2974-8 PubMed Abstract
2. Kolodkin AL, Tessier-Lavigne M: Mechanisms and molecules of neuronal wiring: a primer.Cold Spring Harb Perspect Biol. 2011; 3 (6). PubMed Abstract | Publisher Full Text

Competing Interests: No competing interests were disclosed.

CITE

Report a concern

Author Response 29 Jun 2017

István Zachar, Parmenides Center for the Conceptual Foundations of Science, Munich/Pullach, 82049, Germany

29 Jun 2017

Author Response

We are extremely thankful for Stuart J. Edelstein for his comments and suggestions.

Concerning the terms we used here to relate objects and processes from neuroscience and classical Darwinism, ... Continue reading We are extremely thankful for Stuart J. Edelstein for his comments and suggestions.

Concerning the terms we used here to relate objects and processes from neuroscience and classical Darwinism, we have included a glossary that summarizes the important evolutionary and neurobiological concepts that might be useful to clarify across disciplines. We are certain that this is not an absolute and perfect mapping of concepts, but should be enough to support our understanding of how the brain might work under Darwinian principles.

We further direct our Reviewer’s attention to the answers we have provided for László Acsády on the grounding of neural Darwinism within the brain (as we understand it), especially our answers to points #1, #2 about the Discussion, as those also concern the questions raised here. We simply did not want to replicate our words unnecessarily.
We are extremely thankful for Stuart J. Edelstein for his comments and suggestions.

Concerning the terms we used here to relate objects and processes from neuroscience and classical Darwinism, we have included a glossary that summarizes the important evolutionary and neurobiological concepts that might be useful to clarify across disciplines. We are certain that this is not an absolute and perfect mapping of concepts, but should be enough to support our understanding of how the brain might work under Darwinian principles.

We further direct our Reviewer’s attention to the answers we have provided for László Acsády on the grounding of neural Darwinism within the brain (as we understand it), especially our answers to points #1, #2 about the Discussion, as those also concern the questions raised here. We simply did not want to replicate our words unnecessarily.
Competing Interests: No competing interests were disclosed. Close
Report a concern
Respond or Comment

COMMENTS ON THIS REPORT

Author Response 29 Jun 2017

István Zachar, Parmenides Center for the Conceptual Foundations of Science, Munich/Pullach, 82049, Germany

29 Jun 2017

Author Response

We are extremely thankful for Stuart J. Edelstein for his comments and suggestions.

Concerning the terms we used here to relate objects and processes from neuroscience and classical Darwinism, ... Continue reading We are extremely thankful for Stuart J. Edelstein for his comments and suggestions.

Concerning the terms we used here to relate objects and processes from neuroscience and classical Darwinism, we have included a glossary that summarizes the important evolutionary and neurobiological concepts that might be useful to clarify across disciplines. We are certain that this is not an absolute and perfect mapping of concepts, but should be enough to support our understanding of how the brain might work under Darwinian principles.

We further direct our Reviewer’s attention to the answers we have provided for László Acsády on the grounding of neural Darwinism within the brain (as we understand it), especially our answers to points #1, #2 about the Discussion, as those also concern the questions raised here. We simply did not want to replicate our words unnecessarily.
We are extremely thankful for Stuart J. Edelstein for his comments and suggestions.

Concerning the terms we used here to relate objects and processes from neuroscience and classical Darwinism, we have included a glossary that summarizes the important evolutionary and neurobiological concepts that might be useful to clarify across disciplines. We are certain that this is not an absolute and perfect mapping of concepts, but should be enough to support our understanding of how the brain might work under Darwinian principles.

We further direct our Reviewer’s attention to the answers we have provided for László Acsády on the grounding of neural Darwinism within the brain (as we understand it), especially our answers to points #1, #2 about the Discussion, as those also concern the questions raised here. We simply did not want to replicate our words unnecessarily.
Competing Interests: No competing interests were disclosed. Close
Report a concern

Views

Reviewer Report 11 Oct 2016

Karl Friston, Wellcome Trust Centre for Neuroimaging, University College London, London, UK

Approved

https://doi.org/10.5256/f1000research.10377.r16922

This is a thought provoking simulation study of Darwinian neurodynamics. It uses populations of attractor networks to illustrate the distinction between purely selectionist and evolutionary optimisation. This demonstration rests upon the dynamical instability of the neuronal networks considered – and the explicit introduction of variation or mutations in graduating from a selectionist to an evolutionary scheme. The paper is rather dense and I do not pretend to follow all the subtleties and nuances; however, the basic ideas are compelling and are described with sufficient clarity and detail for the interested reader to understand. There are a few points of clarification that I think you could attend to. In addition, there are some minor grammatical improvements you could consider.

Major points

I think you need to overview your simulations so that the reader knows where you are going. I would recommend something like:

“We will present a series of simulations graduating from purely selectionist schemes to evolutionary schemes in the face of a changing environment. In the first set of simulations we preclude variation in transmission over generations to examine the sufficiency of dynamical instabilities in supporting a selective process. This involves taking the outputs of one neural network and using them (after selection) to train another network. In the second set of simulations, we consider evolution proper and the transmission of patterns from generation to generation. Here, the patterns that are transmitted are subject to mild mutations (and selection) to illustrate the efficiency with which optimal (high adaptive fitness) patterns emerge.”
I think you need to explain simply what is being optimised in your simulations. I would suggest something like:

“In what follows, we will treat a pattern of activations over binary neurons as the unit of selection. In the general case, the adaptive fitness of this pattern may be some complicated function that is contextualised by the current inputs and may or may not be a function of the history of inputs and outputs. To keep things simple, we will just consider the fitness of a pattern in terms of its hamming distance to some target pattern. This means, we are effectively using selectionist and evolutionary schemes to optimise the connections (and ensuing dynamics) to recover a target pattern.”
When talking about the utility of dynamical instability in providing a basis for selection, you might want to refer to the work of Ivan Tyukin and colleagues¹,². These authors have studied chaotic systems in the context optimisation – and their neuronal counterparts.
At the end of your discussion, I think you can usefully pursue the Bayesian brain hypothesis. I would suggest something like:

“In fact, there may be a deep connection between the selectionist dynamics illustrated in this paper and the Bayesian brain. This follows from the fact that the Bayesian brain can use Bayesian model selection to identify its most plausible hypotheses about the world. In this sense, the selective mechanisms we have demonstrated become Bayesian model selection, if we use marginal likelihood or variational free energy as adaptive fitness. See for example Sella and Hirsh, 2005³ and Friston, 2013⁴. Crucially, the evolutionary role of mutations and variations provides the extra ingredient required for structure learning; namely the elaboration of a model or hypothesis space."

Minor points

Page 3:

Replace "Bayesian update" with "Bayesian updates".
You might want to add a footnote about reentry and neural Darwinism when you say that “this fast scale dynamics is missing from Edelmanian neural Darwinism.” I suspect that Edelman considered variation an important aspect of neural Darwinism and that this was mediated by reentrant dynamics that shows the dynamical and structural instabilities that you refer to.

Page 5 and throughout:

Replace "was trained only to" with "was presented only to".

Page 5:

Replace "this allows finding the global" with "this allows the system to find".
Replace "system above the", with "system above and beyond the".
I would say “… can quickly converge to the optimum – providing the output of each network is delivered to the appropriate network that successively converges on the global optimum.”.

Page 6:

Replace "at start" with "at the start".
I would remove the simulations based upon the simpler model (i.e. the thin lines in Figure 3). These simulations and their description do not add much to the text – or any insight. It would be less distracting if these simulations and their discussion were removed.

Page 8 and throughout:

Replace "provoking them" with "perturbing them".
Replace "performed of" with "performed in".

Page 8:

It would be useful to mention the (Stuart Kauffman) notion of second order selection and selection for selectability (or evolvibility). In other words, you should discuss the optimisation of the mutation rates in relation to the volatility of the environment.

Page 9:

Replace "solves the problem" with "solving the problem".

Page 10:

Replace "networks in the given" with "networks under the given". Later replace "attractor network" with "attractors networks". Later replace "works with diluted" with "work with diluted". Finally, replace "explained in turn" with "explained next".

I hope that these comments help should any revision be required.

References

1. Tyukin I, Tyukina T, van Leeuwen C: Invariant template matching in systems with spatiotemporal coding: A matter of instability.Neural Netw. 2009; 22 (4): 425-49 PubMed Abstract | Publisher Full Text
2. van Leeuwen C: Chaos breeds autonomy: connectionist design between bias and baby-sitting.Cogn Process. 2008; 9 (2): 83-92 PubMed Abstract | Publisher Full Text
3. Sella G, Hirsh AE: The application of statistical physics to evolutionary biology.Proc Natl Acad Sci U S A. 2005; 102 (27): 9541-6 PubMed Abstract | Publisher Full Text
4. Friston K: Life as we know it.J R Soc Interface. 2013; 10 (86): 20130475 PubMed Abstract | Publisher Full Text

Competing Interests: No competing interests were disclosed.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

CITE

Report a concern

Author Response 29 Jun 2017

István Zachar, Parmenides Center for the Conceptual Foundations of Science, Munich/Pullach, 82049, Germany

29 Jun 2017

Author Response

We are thankful for Karl Friston for his invaluable comments and suggestions. Please also see our answers to the other Reviewers for some relevant answers.

About reentry and neural Darwinism: We ... Continue reading We are thankful for Karl Friston for his invaluable comments and suggestions. Please also see our answers to the other Reviewers for some relevant answers.

About reentry and neural Darwinism: We previously discussed the limitations of Edelman’s so-called “neural Darwinism” [1] in the light of standard evolutionary dynamics [2]. Here we consider two important aspects: the roles of variation and selection and reentry [1,3] and how these relate to our model. In general, there are no units of evolution in the Edelman model, only units of selection. Edelman considered three subsequent phases of brain development and function (i) developmental variation and selection (supported by production and elimination of neurons and their connections), (ii) experiential selection (resting on changes in the strength of a population of synapses) and (iii) reentry (where some connections between two or more maps become stronger due to changes in synaptic weights). Note that beyond stage (i) there is no real novelty emerging. The situation is more like that in combinatorial chemistry where functional molecules are selected from a large pool of generated variants. Crucially, there is no replication with variation in this picture. Reentry has two different functions in Edelman’s picture: the linking of different maps and a critical contribution to consciousness. We further direct Friston’s attention to the answer we have provided for László Acsády about the realization of reentry in the cortex-basal ganglia-thalamus-cortex loop.

Points 1, 2. We have opted to borrow your expressions of Points 1 and 2 that perfectly summarize our original intention, and included them within the Methods section to introduce our selectionist and evolutionary experiments.

Point 3. Chaotic itinerancy is very different from our model. Our search uses the overlaps between attractors in different networks and is guided by evaluation of interim findings. Darwinian neurodynamics harnesses neuronal parallelism for search more profoundly than chaotic itinerancy.

Point 4. We have added an explanation highlighting the connection between selectionist dynamics and the Bayesian brain. We additionally cite Iwasa, who pioneered the concept as well as Barton & de Vladar who refined it (and it would be odd not to cite some of the work of de Vladar, a co-author of this paper, who merged and developed further the ideas of Iwasa and Sella & Hirsch).

Considering Figure 3. and the suggestion to remove the simpler model, we decided rather not to modify this section. Actually, thin lines represent the behaviour of the complex attractor network-model introduced in the paper and thick lines represent the simpler model that superficially approximate attractor dynamics without actually using networks. The whole point of this image is the comparison between our proof-of-principle attractor network model and a baseline, abstracted model.

Certainly to Stuart Kauffman’s work is an important aspect that is present in our studies. We added some explanations in this direction. However, note that the notion of evolvability is not Kauffman’s (although he certainly did a good work on it and on its popularisation), but is attributed, under that name, to Dawkins and back to Riedl (1975) (but not using that term).

Most of the minor points were corrected except in a few cases where they went explicitly against our intended meaning. For example we rather stick with the word "provocation", as "perturbation" has a different meaning to what we intended here.

References:
1 Edelman GM. Neural Darwinism. The theory of neuronal group selection. New York: Basic Books; 1987.
2 Fernando C, Szathmáry E, Husbands P. Selectionist and evolutionary approaches to brain function: a critical appraisal. Frontiers in Computational Neuroscience. 2012;6(24). http://dx.doi.org/10.3389/fncom.2012.00024.
3 Edelman GM. Neural Darwinism: Selection and reentrant signaling in higher brain function. Neuron. 1993;10(2):115–125. http://www.sciencedirect.com/science/article/pii/089662739390304A.
We are thankful for Karl Friston for his invaluable comments and suggestions. Please also see our answers to the other Reviewers for some relevant answers.

About reentry and neural Darwinism: We previously discussed the limitations of Edelman’s so-called “neural Darwinism” [1] in the light of standard evolutionary dynamics [2]. Here we consider two important aspects: the roles of variation and selection and reentry [1,3] and how these relate to our model. In general, there are no units of evolution in the Edelman model, only units of selection. Edelman considered three subsequent phases of brain development and function (i) developmental variation and selection (supported by production and elimination of neurons and their connections), (ii) experiential selection (resting on changes in the strength of a population of synapses) and (iii) reentry (where some connections between two or more maps become stronger due to changes in synaptic weights). Note that beyond stage (i) there is no real novelty emerging. The situation is more like that in combinatorial chemistry where functional molecules are selected from a large pool of generated variants. Crucially, there is no replication with variation in this picture. Reentry has two different functions in Edelman’s picture: the linking of different maps and a critical contribution to consciousness. We further direct Friston’s attention to the answer we have provided for László Acsády about the realization of reentry in the cortex-basal ganglia-thalamus-cortex loop.

Points 1, 2. We have opted to borrow your expressions of Points 1 and 2 that perfectly summarize our original intention, and included them within the Methods section to introduce our selectionist and evolutionary experiments.

Point 3. Chaotic itinerancy is very different from our model. Our search uses the overlaps between attractors in different networks and is guided by evaluation of interim findings. Darwinian neurodynamics harnesses neuronal parallelism for search more profoundly than chaotic itinerancy.

Point 4. We have added an explanation highlighting the connection between selectionist dynamics and the Bayesian brain. We additionally cite Iwasa, who pioneered the concept as well as Barton & de Vladar who refined it (and it would be odd not to cite some of the work of de Vladar, a co-author of this paper, who merged and developed further the ideas of Iwasa and Sella & Hirsch).

Considering Figure 3. and the suggestion to remove the simpler model, we decided rather not to modify this section. Actually, thin lines represent the behaviour of the complex attractor network-model introduced in the paper and thick lines represent the simpler model that superficially approximate attractor dynamics without actually using networks. The whole point of this image is the comparison between our proof-of-principle attractor network model and a baseline, abstracted model.

Certainly to Stuart Kauffman’s work is an important aspect that is present in our studies. We added some explanations in this direction. However, note that the notion of evolvability is not Kauffman’s (although he certainly did a good work on it and on its popularisation), but is attributed, under that name, to Dawkins and back to Riedl (1975) (but not using that term).

Most of the minor points were corrected except in a few cases where they went explicitly against our intended meaning. For example we rather stick with the word "provocation", as "perturbation" has a different meaning to what we intended here.

References:
1 Edelman GM. Neural Darwinism. The theory of neuronal group selection. New York: Basic Books; 1987.
2 Fernando C, Szathmáry E, Husbands P. Selectionist and evolutionary approaches to brain function: a critical appraisal. Frontiers in Computational Neuroscience. 2012;6(24). http://dx.doi.org/10.3389/fncom.2012.00024.
3 Edelman GM. Neural Darwinism: Selection and reentrant signaling in higher brain function. Neuron. 1993;10(2):115–125. http://www.sciencedirect.com/science/article/pii/089662739390304A.
Competing Interests: No competing interests were disclosed. Close
Report a concern
Respond or Comment

COMMENTS ON THIS REPORT

Author Response 29 Jun 2017

István Zachar, Parmenides Center for the Conceptual Foundations of Science, Munich/Pullach, 82049, Germany

29 Jun 2017

Author Response

We are thankful for Karl Friston for his invaluable comments and suggestions. Please also see our answers to the other Reviewers for some relevant answers.

About reentry and neural Darwinism: We ... Continue reading We are thankful for Karl Friston for his invaluable comments and suggestions. Please also see our answers to the other Reviewers for some relevant answers.

About reentry and neural Darwinism: We previously discussed the limitations of Edelman’s so-called “neural Darwinism” [1] in the light of standard evolutionary dynamics [2]. Here we consider two important aspects: the roles of variation and selection and reentry [1,3] and how these relate to our model. In general, there are no units of evolution in the Edelman model, only units of selection. Edelman considered three subsequent phases of brain development and function (i) developmental variation and selection (supported by production and elimination of neurons and their connections), (ii) experiential selection (resting on changes in the strength of a population of synapses) and (iii) reentry (where some connections between two or more maps become stronger due to changes in synaptic weights). Note that beyond stage (i) there is no real novelty emerging. The situation is more like that in combinatorial chemistry where functional molecules are selected from a large pool of generated variants. Crucially, there is no replication with variation in this picture. Reentry has two different functions in Edelman’s picture: the linking of different maps and a critical contribution to consciousness. We further direct Friston’s attention to the answer we have provided for László Acsády about the realization of reentry in the cortex-basal ganglia-thalamus-cortex loop.

Points 1, 2. We have opted to borrow your expressions of Points 1 and 2 that perfectly summarize our original intention, and included them within the Methods section to introduce our selectionist and evolutionary experiments.

Point 3. Chaotic itinerancy is very different from our model. Our search uses the overlaps between attractors in different networks and is guided by evaluation of interim findings. Darwinian neurodynamics harnesses neuronal parallelism for search more profoundly than chaotic itinerancy.

Point 4. We have added an explanation highlighting the connection between selectionist dynamics and the Bayesian brain. We additionally cite Iwasa, who pioneered the concept as well as Barton & de Vladar who refined it (and it would be odd not to cite some of the work of de Vladar, a co-author of this paper, who merged and developed further the ideas of Iwasa and Sella & Hirsch).

Considering Figure 3. and the suggestion to remove the simpler model, we decided rather not to modify this section. Actually, thin lines represent the behaviour of the complex attractor network-model introduced in the paper and thick lines represent the simpler model that superficially approximate attractor dynamics without actually using networks. The whole point of this image is the comparison between our proof-of-principle attractor network model and a baseline, abstracted model.

Certainly to Stuart Kauffman’s work is an important aspect that is present in our studies. We added some explanations in this direction. However, note that the notion of evolvability is not Kauffman’s (although he certainly did a good work on it and on its popularisation), but is attributed, under that name, to Dawkins and back to Riedl (1975) (but not using that term).

Most of the minor points were corrected except in a few cases where they went explicitly against our intended meaning. For example we rather stick with the word "provocation", as "perturbation" has a different meaning to what we intended here.

References:
1 Edelman GM. Neural Darwinism. The theory of neuronal group selection. New York: Basic Books; 1987.
2 Fernando C, Szathmáry E, Husbands P. Selectionist and evolutionary approaches to brain function: a critical appraisal. Frontiers in Computational Neuroscience. 2012;6(24). http://dx.doi.org/10.3389/fncom.2012.00024.
3 Edelman GM. Neural Darwinism: Selection and reentrant signaling in higher brain function. Neuron. 1993;10(2):115–125. http://www.sciencedirect.com/science/article/pii/089662739390304A.
We are thankful for Karl Friston for his invaluable comments and suggestions. Please also see our answers to the other Reviewers for some relevant answers.

About reentry and neural Darwinism: We previously discussed the limitations of Edelman’s so-called “neural Darwinism” [1] in the light of standard evolutionary dynamics [2]. Here we consider two important aspects: the roles of variation and selection and reentry [1,3] and how these relate to our model. In general, there are no units of evolution in the Edelman model, only units of selection. Edelman considered three subsequent phases of brain development and function (i) developmental variation and selection (supported by production and elimination of neurons and their connections), (ii) experiential selection (resting on changes in the strength of a population of synapses) and (iii) reentry (where some connections between two or more maps become stronger due to changes in synaptic weights). Note that beyond stage (i) there is no real novelty emerging. The situation is more like that in combinatorial chemistry where functional molecules are selected from a large pool of generated variants. Crucially, there is no replication with variation in this picture. Reentry has two different functions in Edelman’s picture: the linking of different maps and a critical contribution to consciousness. We further direct Friston’s attention to the answer we have provided for László Acsády about the realization of reentry in the cortex-basal ganglia-thalamus-cortex loop.

Points 1, 2. We have opted to borrow your expressions of Points 1 and 2 that perfectly summarize our original intention, and included them within the Methods section to introduce our selectionist and evolutionary experiments.

Point 3. Chaotic itinerancy is very different from our model. Our search uses the overlaps between attractors in different networks and is guided by evaluation of interim findings. Darwinian neurodynamics harnesses neuronal parallelism for search more profoundly than chaotic itinerancy.

Point 4. We have added an explanation highlighting the connection between selectionist dynamics and the Bayesian brain. We additionally cite Iwasa, who pioneered the concept as well as Barton & de Vladar who refined it (and it would be odd not to cite some of the work of de Vladar, a co-author of this paper, who merged and developed further the ideas of Iwasa and Sella & Hirsch).

Considering Figure 3. and the suggestion to remove the simpler model, we decided rather not to modify this section. Actually, thin lines represent the behaviour of the complex attractor network-model introduced in the paper and thick lines represent the simpler model that superficially approximate attractor dynamics without actually using networks. The whole point of this image is the comparison between our proof-of-principle attractor network model and a baseline, abstracted model.

Certainly to Stuart Kauffman’s work is an important aspect that is present in our studies. We added some explanations in this direction. However, note that the notion of evolvability is not Kauffman’s (although he certainly did a good work on it and on its popularisation), but is attributed, under that name, to Dawkins and back to Riedl (1975) (but not using that term).

Most of the minor points were corrected except in a few cases where they went explicitly against our intended meaning. For example we rather stick with the word "provocation", as "perturbation" has a different meaning to what we intended here.

References:
1 Edelman GM. Neural Darwinism. The theory of neuronal group selection. New York: Basic Books; 1987.
2 Fernando C, Szathmáry E, Husbands P. Selectionist and evolutionary approaches to brain function: a critical appraisal. Frontiers in Computational Neuroscience. 2012;6(24). http://dx.doi.org/10.3389/fncom.2012.00024.
3 Edelman GM. Neural Darwinism: Selection and reentrant signaling in higher brain function. Neuron. 1993;10(2):115–125. http://www.sciencedirect.com/science/article/pii/089662739390304A.
Competing Interests: No competing interests were disclosed. Close
Report a concern

Comments on this article Comments (0)

Version 2

VERSION 2 PUBLISHED 28 Sep 2016

Open Peer Review

Reviewer Status

Reviewer Reports

	Invited Reviewers
	1	2	3
Version 2 (revision) 29 Jun 17	read	read	read
Version 1 28 Sep 16	read	read	read

Karl Friston, University College London, London, UK
Stuart J. Edelstein, Ecole Normale Supérieur, Paris, France
László Acsády, Hungarian Academy of Sciences, Budapest, Hungary

Comments on this article

All Comments(0)

Add a comment

Browse by related subjects

Back to all reports

Reviewer Report

9 Views

17 Jul 2017 | for Version 2

László Acsády, Laboratory of Thalamus Research, Institute of Experimental Medicine, Hungarian Academy of Sciences, Budapest, Hungary

9 Views Cite this report Responses(0)

Approved

Competing Interests

No competing interests were disclosed.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Respond to this report

Responses (0)

Back to all reports

Reviewer Report

14 Views

03 Jul 2017 | for Version 2

Stuart J. Edelstein, Ecole Normale Supérieur, Paris, France

14 Views Cite this report Responses(0)

Approved

A timely article now fully acceptable.

Competing Interests

No competing interests were disclosed.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Respond to this report

Responses (0)

Back to all reports

Reviewer Report

19 Views

30 Jun 2017 | for Version 2

Karl Friston, Wellcome Trust Centre for Neuroimaging, University College London, London, UK

19 Views Cite this report Responses(0)

Approved

Many thanks for responding to my previous comments and congratulations on a very interesting piece.

Competing Interests

No competing interests were disclosed.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Respond to this report

Responses (0)

Back to all reports

Reviewer Report

28 Views

25 Nov 2016 | for Version 1

László Acsády, Laboratory of Thalamus Research, Institute of Experimental Medicine, Hungarian Academy of Sciences, Budapest, Hungary

28 Views Cite this report Responses(1)

Approved With Reservations

One drawback of the paper is that the parallelism between genetic and neuronal Darwinism is not made clear right at the onset. We should be aware of what the authors mean by parent, offspring, multiplication, mutation, selection in case of neuronal activity right from the beginning in order to follow the logic of the paper.

Results:

I miss the clear demonstration of the “Selection” experiments showing that it is not able to find the global optimum (e.g as an additional panel to Fig 2). I also think we need some form of quantification here, how many simulations were run, how significant the result was…etc.etc
I also miss the formal demonstration of the effect of mutation rate on the speed of evolution. Since this is a crucial concept, I would dedicate a separate figure for that.
I would like to see, how implementing palimpsest memory affects the performance of the model and how this depends on whether the system use dense or sparse coding. Presently this is only briefly mentioned in the Method section, but since this may have important implications it may be good provide some more details. Intuitively, more sparse coding may tolerate the lack of palimpsest memory.
Neuronal noise considered as “mutation” in the model is enlightening. Still, I feel there are some basic differences here. During evolution genetic mutations can get stabilized when they reach the global optimum whereas neuronal noise is inherent to the system and not necessary change with evolution. Can it be demonstrated or is there any evidence that neuronal noise decreases as the system approaches the optimum state?
What were the connection weights of the recurrent (i.e. new input) patterns relative to the weights of the local, autoassociative connections? Can the model perform better/worse by changing the relative weights of these connections? Note that many original autoassociative models worked with a “detonator” synapse as an input and weaker local connections¹.

Discussion:

I would not necessarily constrain the model to implicit memories. I think “implicit” here refers to the unconscious effort to recall the best target pattern not to type of memory item to be recalled. The term “implicit memory” evokes mainly procedural memories and indeed the authors place the model in the cortex-basal ganglia loop. I don’t see why recall of an episodic memory trace by the CA3 recurrent network cannot follow the same evolutionary logic even though hippocampal memories are not considered as “implicit”.
In the cortex-basal ganglia-thalamus loop, it is not really known how exactly the cortical output will affect the return signal from the thalamus but, in any case, the signal goes through significant dimension reduction² and the final output of basal ganglia may also affect thalamic firing in different ways³ (i.e it is “mutated” a lot). The question is, how the properties of the model network changes if the final output is not directly fed back to the system but undergoes various (but consistent) signal transformation.

Minors:

I miss the definition of “best pattern”. Can this term be equated with “pattern with highest fitness”?
I would also support a short glossary with the neurobiological relevance of the crucial ecological concepts (fitness, generation, landscape, mutation).
I guess subheadings in the Results section are not appropriate. “Selection” and “Evolution” are the two main subheadings and all the others are subsections of the “Evolution” section.

References

Competing Interests

No competing interests were disclosed.

Respond to this report

Responses (1)

Author Response

29 Jun 2017

István Zachar, Parmenides Center for the Conceptual Foundations of Science, Munich/Pullach, 82049, Germany

We are grateful for the review of László Acsády, and his suggestions to improve our paper. We refer to the points raised by our Reviewer by a hashmark (#). Please also see our answers to the other Reviewers for some relevant answers.

Introduction #1. There is limited similarity only between neural Darwinism and the model and thoughts presented in this paper. In both cases “neuronal groups” are important in terms function, although the term “neuronal assembly” is more traditional and functionally relevant. Synapses are important inasmuch as they contribute to the functionality of the assembly. There is a form of reentry in both models but they play very different roles: in our case it means that the same or variant information is fed back, after evaluation, to the same population of assemblies and get stored in multiple copies. It is this multiplication component that supports “neuronal replication” of candidate solutions.

A note is in order about the search mechanism in the space of candidate solutions. In the classic Deheane-Changeux model for prefrontal cortex functionality in terms of action production and selection on the example of the Wisconsin card sorting test [1], search for new candidate solutions is random and their evaluation is strictly serial. A non-random but still serial search process, for which there is neural evidence, is chaotic itinerancy [2-4]. Except for the learning phase our attractors are locally stable, classical ones. In contrast, chaotic itinerancy rests on so-called “attractor ruins” from which there is always the possibility of dynamical escape in certain directions via transient chaotic bridges. Since these attractor ruins are deterministic entities, escape directions are not random. The pertinent question is whether these directions are random relative to those leading to better candidate solutions. One can imagine that synaptic plasticity combined with reinforcement learning might strengthen certain escape routes with experience.

An explanatory table and an accompanying figure were added to the text to guide the reader among the terms we use and to explain the relationship we imply between genetic and neuronal Darwinism.

Results #1. In case of Selection experiments the system can always find the global optimum, the best special pattern (it was trained to network #20). The overlapping basins of the series of special patterns among networks guarantee that there is a trajectory from the basin of any one of the special patterns to the best special pattern. All simulations were started from noisy copies of a random input pattern similar (in Hamming-distance) to the worst special pattern. We ran 1000 independent simulations, the average number and variance of the rounds needed to converge to the optimum were 3.5± 0.64.

Results #2. A new figure and a few sentences were added to demonstrate the effect of mutation rate on the speed of evolution.

Results #3. Palimpsest memory is essential to our model. Because of the repeated retraining of the networks with the selected patterns, without palimpsest memory all networks – even if the patterns are sparse – sooner or later would reach catastrophic forgetting. Consequently, palimpsest memory is necessary for evolutionary processes. We used dense coding for the reason of computational time.

Results #4. In the present state of the model, the two types of noise (m_l and m_T) are considered to be constant. Using decreasing m_l with increasing fitness does not change the qualitative outcome of the evolutionary process (results not shown). Changes in m_T do not affect the convergence too much, see the answer for question #2 above.)

Results #5. To be in line with Storkey’s original model we used similar weights to the recurrent patterns and autoassociative connections. In the present state of the model, we have not tested the palimpsest behavior and the retrieval ability with other than 1:1 relative weights.

Discussion #1, #2. We have presented our model as potentially realisable by the cortex-basal ganglia-thalamus cortex loop. Actually, noting the massively parallel processing in the brain in general, added to the evaluative function of the mentioned loop, we would be surprised if no true evolutionary behaviour could be found during complicated problem-solving tasks. But we agree that it is not necessary to restrict the proposed cognitive architecture to this concrete loop: recall of an episodic memory trace by the CA3 recurrent network [5] can in principle follow the same evolutionary logic, even though hippocampal memories are not considered as “implicit”.

A deeper question considers how “informational reentry” in the Darwinian sense can be realized by the cortex-basal ganglia-thalamus-cortex loop. In this loop it is not really known how exactly the cortical output will affect the return signal from the thalamus but, in any case, the signal goes through significant dimension reduction [6] and the final output of basal ganglia may also affect thalamic firing in different ways ([7], i.e. it is “mutated” a lot). The question is, how the properties of the model network change if the final output is not directly fed back to the system but undergoes various (but consistent) signal transformation. We think that the key to our kind of mechanism to work is exactly the consistency of signal transformation. A variant mechanism might look like this: the thalamus pointedly activates those cortical networks where the best (few) activity pattern(s) came from (as before, the number of candidate patterns “kept alive” depends on the stringency of selection in the striatum). These could in turn spread in the cortex through the horizontal connections among local attractors. Passing of information from attractor network to another is a common element in many relevant dynamical models [8,9].

Minor points were all corrected according to Reviewer’s suggestions.

References:
1 Dehaene S, Changeux JP. The Wisconsin Card Sorting Test: Theoretical Analysis and Modeling in a Neuronal Network. Cerebral Cortex. 1991;1(1):62. http://dx.doi.org/10.1093/cercor/1.1.62.
2 Kaneko K, Tsuda I. Chaotic itinerancy. Chaos: An Interdisciplinary Journal of Nonlinear Science. 2003;13(3):926–936. http://dx.doi.org/10.1063/1.1607783.
3 van Leeuwen C. Chaos breeds autonomy: connectionist design between bias and baby-sitting. Cognitive Processing. 2008;9(2):83–92. http://dx.doi.org/10.1007/s10339-007-0193-8.
4 Tyukin I, Tyukina T, van Leeuwen C. Invariant template matching in systems with spatiotemporal coding: A matter of instability. Neural Networks. 2009;22(4):425–449. http://www.sciencedirect.com/science/article/pii/S089360800900015X.
5 Treves A, Rolls ET. Computational constraints suggest the need for two distinct input systems to the hippocampal CA3 network. Hippocampus. 1992;2(2):189–199. http://dx.doi.org/10.1002/hipo.450020209.
6 Bar-Gad I, Bergman H. Stepping out of the box: information processing in the neural networks of the basal ganglia. Current Opinion in Neurobiology. 2001;11(6):689–695. http://www.sciencedirect.com/science/article/pii/S0959438801002707.
7 Goldberg JH, Farries MA, Fee MS. Basal ganglia output to the thalamus: still a paradox. Trends in Neurosciences. 2013;36(12):695–705. http://www.sciencedirect.com/science/article/pii/S0166223613001574.
8 Rolls ET, Treves A. Neural networks and brain function. Oxford, New York: Oxford University Press; 1998. http://opac.inria.fr/record=b1094909.
9 Rolls ET. Attractor networks. Wiley Interdisciplinary Reviews: Cognitive Science. 2010;1(1):119–134. http://dx.doi.org/10.1002/wcs.1.

View more View less

Competing Interests

No competing interests were disclosed.

Back to all reports

Reviewer Report

36 Views

25 Oct 2016 | for Version 1

Stuart J. Edelstein, Ecole Normale Supérieur, Paris, France

36 Views Cite this report Responses(1)

Approved With Reservations

References

Competing Interests

No competing interests were disclosed.

Respond to this report

Responses (1)

Author Response

29 Jun 2017

István Zachar, Parmenides Center for the Conceptual Foundations of Science, Munich/Pullach, 82049, Germany

We are extremely thankful for Stuart J. Edelstein for his comments and suggestions.

Concerning the terms we used here to relate objects and processes from neuroscience and classical Darwinism, we have included a glossary that summarizes the important evolutionary and neurobiological concepts that might be useful to clarify across disciplines. We are certain that this is not an absolute and perfect mapping of concepts, but should be enough to support our understanding of how the brain might work under Darwinian principles.

We further direct our Reviewer’s attention to the answers we have provided for László Acsády on the grounding of neural Darwinism within the brain (as we understand it), especially our answers to points #1, #2 about the Discussion, as those also concern the questions raised here. We simply did not want to replicate our words unnecessarily.

View more View less

Competing Interests

No competing interests were disclosed.

Back to all reports

Reviewer Report

41 Views

11 Oct 2016 | for Version 1

Karl Friston, Wellcome Trust Centre for Neuroimaging, University College London, London, UK

41 Views Cite this report Responses(1)

Approved

I think you need to overview your simulations so that the reader knows where you are going. I would recommend something like:

“We will present a series of simulations graduating from purely selectionist schemes to evolutionary schemes in the face of a changing environment. In the first set of simulations we preclude variation in transmission over generations to examine the sufficiency of dynamical instabilities in supporting a selective process. This involves taking the outputs of one neural network and using them (after selection) to train another network. In the second set of simulations, we consider evolution proper and the transmission of patterns from generation to generation. Here, the patterns that are transmitted are subject to mild mutations (and selection) to illustrate the efficiency with which optimal (high adaptive fitness) patterns emerge.”
I think you need to explain simply what is being optimised in your simulations. I would suggest something like:

“In what follows, we will treat a pattern of activations over binary neurons as the unit of selection. In the general case, the adaptive fitness of this pattern may be some complicated function that is contextualised by the current inputs and may or may not be a function of the history of inputs and outputs. To keep things simple, we will just consider the fitness of a pattern in terms of its hamming distance to some target pattern. This means, we are effectively using selectionist and evolutionary schemes to optimise the connections (and ensuing dynamics) to recover a target pattern.”
When talking about the utility of dynamical instability in providing a basis for selection, you might want to refer to the work of Ivan Tyukin and colleagues¹,². These authors have studied chaotic systems in the context optimisation – and their neuronal counterparts.
At the end of your discussion, I think you can usefully pursue the Bayesian brain hypothesis. I would suggest something like:

“In fact, there may be a deep connection between the selectionist dynamics illustrated in this paper and the Bayesian brain. This follows from the fact that the Bayesian brain can use Bayesian model selection to identify its most plausible hypotheses about the world. In this sense, the selective mechanisms we have demonstrated become Bayesian model selection, if we use marginal likelihood or variational free energy as adaptive fitness. See for example Sella and Hirsh, 2005³ and Friston, 2013⁴. Crucially, the evolutionary role of mutations and variations provides the extra ingredient required for structure learning; namely the elaboration of a model or hypothesis space."

Minor points

Page 3:

Replace "Bayesian update" with "Bayesian updates".
You might want to add a footnote about reentry and neural Darwinism when you say that “this fast scale dynamics is missing from Edelmanian neural Darwinism.” I suspect that Edelman considered variation an important aspect of neural Darwinism and that this was mediated by reentrant dynamics that shows the dynamical and structural instabilities that you refer to.

Page 5 and throughout:

Replace "was trained only to" with "was presented only to".

Page 5:

Replace "this allows finding the global" with "this allows the system to find".
Replace "system above the", with "system above and beyond the".
I would say “… can quickly converge to the optimum – providing the output of each network is delivered to the appropriate network that successively converges on the global optimum.”.

Page 6:

Replace "at start" with "at the start".
I would remove the simulations based upon the simpler model (i.e. the thin lines in Figure 3). These simulations and their description do not add much to the text – or any insight. It would be less distracting if these simulations and their discussion were removed.

Page 8 and throughout:

Replace "provoking them" with "perturbing them".
Replace "performed of" with "performed in".

Page 8:

It would be useful to mention the (Stuart Kauffman) notion of second order selection and selection for selectability (or evolvibility). In other words, you should discuss the optimisation of the mutation rates in relation to the volatility of the environment.

Page 9:

Replace "solves the problem" with "solving the problem".

Page 10:

Replace "networks in the given" with "networks under the given". Later replace "attractor network" with "attractors networks". Later replace "works with diluted" with "work with diluted". Finally, replace "explained in turn" with "explained next".

I hope that these comments help should any revision be required.

References

Competing Interests

No competing interests were disclosed.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Respond to this report

Responses (1)

Author Response

29 Jun 2017

István Zachar, Parmenides Center for the Conceptual Foundations of Science, Munich/Pullach, 82049, Germany

We are thankful for Karl Friston for his invaluable comments and suggestions. Please also see our answers to the other Reviewers for some relevant answers.

About reentry and neural Darwinism: We previously discussed the limitations of Edelman’s so-called “neural Darwinism” [1] in the light of standard evolutionary dynamics [2]. Here we consider two important aspects: the roles of variation and selection and reentry [1,3] and how these relate to our model. In general, there are no units of evolution in the Edelman model, only units of selection. Edelman considered three subsequent phases of brain development and function (i) developmental variation and selection (supported by production and elimination of neurons and their connections), (ii) experiential selection (resting on changes in the strength of a population of synapses) and (iii) reentry (where some connections between two or more maps become stronger due to changes in synaptic weights). Note that beyond stage (i) there is no real novelty emerging. The situation is more like that in combinatorial chemistry where functional molecules are selected from a large pool of generated variants. Crucially, there is no replication with variation in this picture. Reentry has two different functions in Edelman’s picture: the linking of different maps and a critical contribution to consciousness. We further direct Friston’s attention to the answer we have provided for László Acsády about the realization of reentry in the cortex-basal ganglia-thalamus-cortex loop.

Points 1, 2. We have opted to borrow your expressions of Points 1 and 2 that perfectly summarize our original intention, and included them within the Methods section to introduce our selectionist and evolutionary experiments.

Point 3. Chaotic itinerancy is very different from our model. Our search uses the overlaps between attractors in different networks and is guided by evaluation of interim findings. Darwinian neurodynamics harnesses neuronal parallelism for search more profoundly than chaotic itinerancy.

Point 4. We have added an explanation highlighting the connection between selectionist dynamics and the Bayesian brain. We additionally cite Iwasa, who pioneered the concept as well as Barton & de Vladar who refined it (and it would be odd not to cite some of the work of de Vladar, a co-author of this paper, who merged and developed further the ideas of Iwasa and Sella & Hirsch).

Considering Figure 3. and the suggestion to remove the simpler model, we decided rather not to modify this section. Actually, thin lines represent the behaviour of the complex attractor network-model introduced in the paper and thick lines represent the simpler model that superficially approximate attractor dynamics without actually using networks. The whole point of this image is the comparison between our proof-of-principle attractor network model and a baseline, abstracted model.

Certainly to Stuart Kauffman’s work is an important aspect that is present in our studies. We added some explanations in this direction. However, note that the notion of evolvability is not Kauffman’s (although he certainly did a good work on it and on its popularisation), but is attributed, under that name, to Dawkins and back to Riedl (1975) (but not using that term).

Most of the minor points were corrected except in a few cases where they went explicitly against our intended meaning. For example we rather stick with the word "provocation", as "perturbation" has a different meaning to what we intended here.

References:
1 Edelman GM. Neural Darwinism. The theory of neuronal group selection. New York: Basic Books; 1987.
2 Fernando C, Szathmáry E, Husbands P. Selectionist and evolutionary approaches to brain function: a critical appraisal. Frontiers in Computational Neuroscience. 2012;6(24). http://dx.doi.org/10.3389/fncom.2012.00024.
3 Edelman GM. Neural Darwinism: Selection and reentrant signaling in higher brain function. Neuron. 1993;10(2):115–125. http://www.sciencedirect.com/science/article/pii/089662739390304A.

View more View less

Competing Interests

No competing interests were disclosed.

Alongside their report, reviewers assign a status to the article:

Approved - the paper is scientifically sound in its current form and only minor, if any, improvements are suggested

Approved with reservations - A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.

Not approved - fundamental flaws in the paper seriously undermine the findings and conclusions

[1] 1. Changeux JP: Neuronal man: The biology of mind. Princeton, NJ: Princeton University Press; Translated by Garey, L. 1985. Reference Source

[2] 2. Changeux JP, Courrége P, Danchin A: A theory of the epigenesis of neuronal networks by selective stabilization of synapses. Proc Natl Acad Sci U S A. 1973; 70(10): 2974–2978. PubMed Abstract | Free Full Text

[3] 3. Edelman GM: Neural Darwinism. The theory of neuronal group selection. New York: Basic Books; 1987. Reference Source

[4] 4. Williams RW, Rakic P: Elimination of neurons from the rhesus monkey’s lateral geniculate nucleus during development. J Comp Neurol. 1988; 272(3): 424–436. PubMed Abstract | Publisher Full Text

[5] 5. O’Leary DD: Development of connectional diversity and specificity in the mammalian brain by the pruning of collateral projections. Curr Opin Neurobiol. 1992; 2(1): 70–77. PubMed Abstract | Publisher Full Text

[6] 6. Rabinowicz T, de Courten-Myers GM, Petetot JM, et al.: Human cortex development: estimates of neuronal numbers indicate major loss late during gestation. J Neuropathol Exp Neurol. 1996; 55(3): 320–328. PubMed Abstract | Publisher Full Text

[7] 7. Miller-Fleming TW, Petersen SC, Manning L, et al.: The DEG/ENaC cation channel protein UNC-8 drives activity-dependent synapse removal in remodeling GABAergic neurons. eLife. 2016; 5: pii: e14599. PubMed Abstract | Publisher Full Text | Free Full Text

[8] 8. Fernando C, Szathmáry E, Husbands P: Selectionist and evolutionary approaches to brain function: a critical appraisal. Front Comput Neurosci. 2012; 6: 24. PubMed Abstract | Publisher Full Text | Free Full Text

[9] 9. Quartz SR, Sejnowski TJ: The neural basis of cognitive development: a constructivist manifesto. Behav Brain Sci. 1997; 20(4): 537–556; discussion 556-96. PubMed Abstract | Publisher Full Text

[10] 10. Bandeira F, Lent R, Herculano-Houzel S: Changing numbers of neuronal and non-neuronal cells underlie postnatal brain growth in the rat. Proc Natl Acad Sci U S A. 2009; 106(33): 14108–14113. PubMed Abstract | Publisher Full Text | Free Full Text

[11] 11. Caroni P, Donato F, Muller D: Structural plasticity upon learning: regulation and functions. Nat Rev Neurosci. 2012; 13(7): 478–490. PubMed Abstract | Publisher Full Text

[12] 12. Bernardinelli Y, Nikonenko I, Muller D: Structural plasticity: mechanisms and contribution to developmental psychiatric disorders. Front Neuroanat. 2014; 8: 123. PubMed Abstract | Publisher Full Text | Free Full Text

[13] 13. Tetzlaff C, Kolodziejski C, Markelic I, et al.: Time scales of memory, learning, and plasticity. Biol Cybern. 2012; 106(11–12): 715–726. PubMed Abstract | Publisher Full Text

[14] 14. Maynard Smith J: The problems of biology. USA: Oxford University Press; 1986. Reference Source

[15] 15. Maynard Smith J: Genes, memes, and minds. The New York Review of Books. 1995; 42(19): 46–48. Reference Source

[16] 16. Kimura M: The Neutral Theory of Molecular Evolution. Cambridge: Cambridge University Press; 1983. Publisher Full Text

[17] 17. Eigen M: Selforganization of matter and the evolution of biological macromolecules. Naturwissenschaften. 1971; 58(10): 465–523. PubMed Abstract | Publisher Full Text

[18] 18. Adams P: Hebb and Darwin. J Theor Biol. 1998; 195(4): 419–438. Publisher Full Text

[19] 19. Calvin WH: The brain as a Darwin Machine. Nature. 1987; 330(6143): 33–34. PubMed Abstract | Publisher Full Text

[20] 20. Calvin WH: The cerebral code: thinking a thought in the mosaics of the mind. Cambridge, MA: MIT Press. 1996. Reference Source

[21] 21. de Vladar HP, Szathmáry E: Neuronal boost to evolutionary dynamics. Interface Focus. 2015; 5(6): 20150074. PubMed Abstract | Publisher Full Text | Free Full Text

[22] 22. Fernando C, Goldstein R, Szathmáry E: The neuronal replicator hypothesis. Neural Comput. 2010; 22(11): 2809–2857. PubMed Abstract | Publisher Full Text

[23] 23. Fernando C, Karishma KK, Szathmáry E: Copying and evolution of neuronal topology. PLoS One. 2008; 3(11): e3775. PubMed Abstract | Publisher Full Text | Free Full Text

[24] 24. Fernando C, Szathmáry E: Natural selection in the brain. In: Glatzeder B, Goel V, von Müller A, editors. Towards a theory of thinking. vol. 5 of On thinking. Berlin/Heidelberg: Springer-Verlag; 2010; 291–322. Publisher Full Text

[25] 25. Fernando C, Vasas V, Szathmáry E, et al.: Evolvable neuronal paths: a novel basis for information and search in the brain. PLoS One. 2011; 6(8): e23534. PubMed Abstract | Publisher Full Text | Free Full Text

[26] 26. Kemp C, Tenenbaum JB: The discovery of structural form. Proc Natl Acad Sci U S A. 2008; 105(31): 10687–10692. PubMed Abstract | Publisher Full Text | Free Full Text

[27] 27. Ullman TD, Goodman ND, Tenenbaum JB: Theory learning as stochastic search in the language of thought. Cognitive Dev. The Potential Contribution of Computational Modeling to the Study of Cognitive Development: When, and for What Topics? 2012; 27(4): 455–480. Publisher Full Text

[28] 28. Börgers T, Sarin R: Learning through reinforcement and replicator dynamics. J Econ Theory. 1997; 77(1): 1–14. Publisher Full Text

[29] 29. Niekum S, Barto AG, Spector L: Genetic programming for reward function search. IEEE Trans Auton Ment Dev. 2010; 2(2): 83–90. Publisher Full Text

[30] 30. Sutton RS, Barto AG: Introduction to reinforcement learning. 1st ed. Cambridge, MA, USA: MIT Press; 1998. Reference Source

[31] 31. Harper M: The replicator equation as an inference dynamic. ArXiv e-prints. 2009. Reference Source

[32] 32. Shalizi CR: Dynamics of Bayesian updating with dependent data and misspecified models. Electron J Statist. 2009; 3: 1039–1074. Publisher Full Text

[33] 33. Campbell JO: Universal Darwinism As a Process of Bayesian Inference. Front Syst Neurosci. 2016; 10: 49. PubMed Abstract | Publisher Full Text | Free Full Text

[34] 34. Hopfield JJ: Neural networks and physical systems with emergent collective computational abilities. Proc Natl Acad Sci U S A. 1982; 79(8): 2554–2558. PubMed Abstract | Publisher Full Text | Free Full Text

[35] 35. Rolls ET, Treves A: Neural networks and brain function. Oxford, New York: Oxford University Press; 1998. Publisher Full Text

[36] 36. Carrillo-Reid L, Yang W, Bando Y, et al.: Imprinting and recalling cortical ensembles. Science. 2016; 353(6300): 691–694. PubMed Abstract | Publisher Full Text

[37] 37. Kilgard MP: Harnessing plasticity to understand learning and treat disease. Trends Neurosci. 2012; 35(12): 715–722. PubMed Abstract | Publisher Full Text | Free Full Text

[38] 38. Watson RA, Jansen T: A building-block royal road where crossover is provably essential. In: Proceedings of the 9th Annual Conference on Genetic and Evolutionary Computation. GECCO ’ 07. New York, NY, USA: ACM; 2007; 1452–1459. Publisher Full Text

[39] 39. Maynard Smith J: The Evolution of Sex. Cambridge University Press; 1978. Reference Source

[40] 40. Maynard Smith J: The units of selection. Novartis Found Symp. 1998; 213: 203–11; discussion 211-7. PubMed Abstract

[41] 41. Storkey AJ: Efficient covariance matrix methods for Bayesian gaussian processes and Hopfield neural networks. Imperial College, Department of Electrical Engineering, Neural System Group; 1999. Reference Source

[42] 42. Sompolinsky H: Computational neuroscience: beyond the local circuit. Curr Opin Neurobiol. Theoretical and computational neuroscience. 2014; 25: xiii–xviii. PubMed Abstract | Publisher Full Text

[43] 43. Müller S, Appel B, Balke D, et al.: Thirty-five years of research into ribozymes and nucleic acid catalysis: where do we stand today? [version 1; referees: 2 approved]. F1000Res. 2016; 5: pii: F1000 Faculty Rev-1511. PubMed Abstract | Publisher Full Text | Free Full Text

[44] 44. Mountcastle VB: Modality and topographic properties of single neurons of cat’s somatic sensory cortex. J Neurophysiol. 1957; 20(4): 408–434. PubMed Abstract

[45] 45. Rolls ET: Attractor networks. Wiley Interdiscip Rev Cogn Sci. 2010; 1(1): 119–134. PubMed Abstract | Publisher Full Text

[46] 46. Soto D, Silvanto J: Reappraising the relationship between working memory and conscious awareness. Trends Cogn Sci. 2014; 18(10): 520–525. PubMed Abstract | Publisher Full Text

[47] 47. Madl T, Baars BJ, Franklin S: The timing of the cognitive cycle. PLoS One. 2011; 6(4): e14803. PubMed Abstract | Publisher Full Text | Free Full Text

[48] 48. Gustafsson B, Asztely F, Hanse E, et al.: Onset Characteristics of Long-Term Potentiation in the Guinea-Pig Hippocampal CA1 Region in Vitro. Eur J Neurosci. 1989; 1(4): 382–394. PubMed Abstract | Publisher Full Text

[49] 49. Hirsch JC, Crepel F: Use-dependent changes in synaptic efficacy in rat prefrontal neurons in vitro. J Physiol. 1990; 427(1): 31–49. PubMed Abstract | Publisher Full Text | Free Full Text

[50] 50. Fernando C, Szathmáry E: Chemical, neuronal, and linguistic replicators. In Pigliucci M, Müller G, editors. Cambridge, MA: MIT Press; 2010; 209–250. Publisher Full Text

[51] 51. Nadel L, Land C: Commentary-Reconsolidation: Memory traces revisited. Nat Rev Neurosci. 2000; 1(3): 209–212. Publisher Full Text

[52] 52. Nadel L, Moscovitch M: Memory consolidation, retrograde amnesia and the hippocampal complex. Curr Opin Neurobiol. 1997; 7(2): 217–227. PubMed Abstract | Publisher Full Text

[53] 53. Stewart TC, Choo X, Eliasmith C: Symbolic Reasoning in Spiking Neurons: A Model of the Cortex/Basal Ganglia/Thalamus Loop. In: 32nd Annual Meeting of the Cognitive Science Society; 2010. Reference Source

[54] 54. Dehaene S, Kerszberg M, Changeux JP: A neuronal model of a global workspace in effortful cognitive tasks. Proc Natl Acad Sci U S A. 1998; 95(24): 14529–14534. PubMed Abstract | Publisher Full Text | Free Full Text

[55] 55. Shanahan M: A spiking neuron model of cortical broadcast and competition. Conscious Cogn. 2008; 17(1): 288–303. PubMed Abstract | Publisher Full Text

[56] 56. Jacobs C, Silvanto J: How is working memory content consciously experienced? The ‘conscious copy’ model of WM introspection. Neurosci Biobehav Rev. 2015; 55: 510–519. PubMed Abstract | Publisher Full Text

[57] 57. Kumar A, Rotter S, Aertsen A: Spiking activity propagation in neuronal networks: reconciling different perspectives on neural coding. Nat Rev Neurosci. 2010; 11(9): 615–627. PubMed Abstract | Publisher Full Text

[58] 58. Hertz J, Palmer RG, Krogh AS: Introduction to the Theory of Neural Computation. 1st ed. Perseus Publishing; 1991. Reference Source

[59] 59. Oberauer K: Access to information in working memory: exploring the focus of attention. J Exp Psychol Learn Mem Cogn. 2002; 28(3): 411–421. PubMed Abstract | Publisher Full Text

[60] 60. Fukai T, Tanaka S: A simple neural network exhibiting selective activation of neuronal ensembles: from winner-take-all to winners-share-all. Neural Comput. 1997; 9(1): 77–97. PubMed Abstract | Publisher Full Text

[61] 61. Morita K, Jitsev J, Morrison A: Corticostriatal circuit mechanisms of value-based action selection: Implementation of reinforcement learning algorithms and beyond. Behav Brain Res. 2016; 311: 110–121. PubMed Abstract | Publisher Full Text

[62] 62. Chklovskii DB, Mel BW, Svoboda K: Cortical rewiring and information storage. Nature. 2004; 431(7010): 782–788. PubMed Abstract | Publisher Full Text

[63] 63. Holtmaat A, Svoboda K: Experience-dependent structural synaptic plasticity in the mammalian brain. Nat Rev Neurosci. 2009; 10(9): 647–658. PubMed Abstract | Publisher Full Text

[64] 64. Churchill AW, Sigtia S, Fernando C: Learning to generate genotypes with neural networks. Evolutionary Computation. 2015. Reference Source

[65] 65. Maynard Smith J, Szathmáry E: The major transitions in evolution. Oxford: Freeman & Co. 1995. Reference Source

[66] 66. Reed A, Riley J, Carraway R, et al.: Cortical map plasticity improves learning but is not necessary for improved performance. Neuron. 2011; 70(1): 121–131. PubMed Abstract | Publisher Full Text

[67] 67. Baluja S, Caruana R: Removing the genetics from the standard genetic algorithm. Morgan Kaufmann Publishers; 1995. 38–46. Publisher Full Text

[68] 68. Friston K, Buzsáki G: The Functional Anatomy of Time: What and When in the Brain. Trends Cogn Sci. 2016; 20(7): 500–511. PubMed Abstract | Publisher Full Text

[69] 69. Zachar I: IstvanZachar/Neurodynamics: Publication release. 2016. Data Source

Breeding novel solutions in the brain: a model of Darwinian neurodynamics

Abstract

Keywords

Introduction

Figure 1.

Methods

Figure 2. Schematics of attractor networks searching for the global optimum.

Figure 3. The effect of retraining on the speed of evolution.

Figure 4. Fitness and recall accuracy over periodically alternating environments.

Results

Figure 5. Convergence of the actual best fitness of the metapopulation with increasing problem size N on general building block function landscape.

Discussion

Data and software availability

Author contributions

Competing interests

Grant information

Acknowledgements

References

Comments on this article Comments (0)

Open Peer Review

Comments on this article Comments (0)

Open Peer Review

Reviewer Status

Reviewer Reports

Comments on this article

Browse by related subjects

Competing Interests Policy

Stay Updated