The Bcm rule allows a spinal cord model to learn rhythmic movements

Kohler, Matthias; Röhrbein, Florian; Knoll, Alois; Albu-Schäffer, Alin; Jörntell, Henrik

doi:10.1007/s00422-023-00970-z

The Bcm rule allows a spinal cord model to learn rhythmic movements

Original Article
Open access
Published: 18 August 2023

Volume 117, pages 275–284, (2023)
Cite this article

Download PDF

You have full access to this open access article

Biological Cybernetics Aims and scope Submit manuscript

The Bcm rule allows a spinal cord model to learn rhythmic movements

Download PDF

Matthias Kohler¹,
Florian Röhrbein⁴,
Alois Knoll¹,
Alin Albu-Schäffer^1,2 &
…
Henrik Jörntell³

862 Accesses
Explore all metrics

Abstract

Currently, it is accepted that animal locomotion is controlled by a central pattern generator in the spinal cord. Experiments and models show that rhythm generating neurons and genetically determined network properties could sustain oscillatory output activity suitable for locomotion. However, current central pattern generator models do not explain how a spinal cord circuitry, which has the same basic genetic plan across species, can adapt to control the different biomechanical properties and locomotion patterns existing in these species. Here we demonstrate that rhythmic and alternating movements in pendulum models can be learned by a monolayer spinal cord circuitry model using the Bienenstock–Cooper–Munro learning rule, which has been previously proposed to explain learning in the visual cortex. These results provide an alternative theory to central pattern generator models, because rhythm generating neurons and genetically defined connectivity are not required in our model. Though our results are not in contradiction to current models, as existing neural mechanism and structures, not used in our model, can be expected to facilitate the kind of learning demonstrated here. Therefore, our model could be used to augment existing models.

Decoding the Neural Mechanisms Underlying Locomotion Using Mathematical Models and Bio-inspired Robots: From Lamprey to Human Locomotion

Effect of Electrical Synapses in the Cycle-by-Cycle Period and Burst Duration of Central Pattern Generators

Sensorimotor Integration in the Spinal Cord, from Behaviors to Circuits: New Tools to Close the Loop?

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Currently a central pattern generator (CPG) in the spinal cord is the accepted model, that explains locomotor control in animals (Grillner and El Manira 2020). Early experiments showed that decerebrated cats can locomote if held on a propelled treadmill (Brown 1911). Later evidence showed a reciprocal organization of inhibitory spinal interneurons (Jankowska etal. 1967; Hultborn et al. 1971; Cohen and Harris-Warrick 1984), which could facilitate alternation between antagonistic muscles and between left and right limbs. Isolated spinal cords without afferents can generate locomotion patterns without rhythmic input (Cohen and Harris-Warrick 1984; Grillner and Wallen 1985; Grillner and El Manira 2020) and such activity patterns are called fictive locomotion. All of these phenomena were ascribed to rhythm generating neurons (Ziskind-Conhaim et al. 2008; Brocard et al. 2013). In addition, there are other neurons that belong to the locomotor circuits and aspects of the connectivity between them have been described as being dependent on their gene expression phenotype (Danner et al. 2017). The combination of these findings has been used to generate models, which are sufficient to explain coordinated patterns of motor output thought to underlie locomotor control (Rybak et al. 2013; Molkov et al. 2015; Shevtsova et al. 2015; Danner et al. 2017; Ausborn et al. 2017).

This current understanding of the spinal cord has a number of limitations. Most rhythm generating mechanisms proposed for the spinal cord circuitry rely on rhythm generating neurons (Grillner and El Manira 2020). In order to evoke rhythmic activity in the spinal cord in adult animals in vivo, without sufficient afferent input, drug manipulation is required (Meehan et al. 2012). However, without drug manipulation, spinal neurons do not intrinsically generate rhythms in adult animals in vivo (Spanne etal. 2014; Kohler et al. 2020). Furthermore, the normal locomotor behavior of animals in nature is much more varied and flexible to the circumstances of the terrain than the current CPG models can explain. In most CPG models (Rybak et al. 2013; Molkov et al. 2015; Shevtsova et al. 2015; Danner et al. 2017; Ausborn et al. 2017), the connectivity between neurons is assumed to be fixed, which means that the model would not be compatible with, for example, reorganized biomechanics. However, after tendon transfer surgery, cats are still capable of developing normal locomotor behavior (Loeb 1999). One genetically defined class of spinal interneuron is known to change its role in locomotor control during development from larvae to adult (Picton et al. 2022). Moreover, afferent wiring is activity dependent (Granmo et al. 2008), spinal withdrawal reflexes depend on experience (Petersson et al. 2003) and the diversity of spinal interneuron input patterns also suggests a major impact of learning (Kohler et al. 2022). Beyond the flexibility of the spinal cord circuitry, the observation that “the skate spinal cord has the same interneuronal building blocks available to form the locomotor network as seen in mammals” Grillner (2018) is in fact another strong indication that the spinal cord circuitry is, to a large extent, dependent on learning. If the genetic construction of the circuitry is the same across species that have very different biomechanical configurations, this must mean that the spinal cord circuitry is highly adaptable to the mechanics of the body it is connected to.

We propose a new model of biological locomotor control, to address the limitations of current CPG models. Our model could explain how the spinal cord acquires the capability to contribute to locomotion control. This model does not use neurons that intrinsically generate rhythm nor an initial network structure that would facilitate rhythm generation. Instead we use a generic network of neurons, without any assumptions of a priori formed connectivity, justified by recent results (Kohler et al. 2022). The network is attached to a mechanical system, which has a tendency for intrinsic rhythmic movement but otherwise is unable to sustain rhythmic movement on its own because of damping. Moreover, we assume that the synaptic weights of the neurons can be learned according to the Bienenstock–Cooper–Munro (BCM) rule (Bienenstock et al. 1982; Intrator and Cooper 1992). This rule has been proposed to explain learning in the visual cortex and is widely considered as a good model of synaptic learning rules in the brain. The BCM rule tries to find synaptic weights so that the firing rate of the neuron has a bimodal distribution and it implements a homeostatic mechanism that stabilizes the average firing rate. In particular, the BCM rule has been applied to learn from passively received input. In our model, the neurons learn from sensory input from the mechanical system, whose activation is only controlled by the output of these neurons. Hence, in our model the neurons learn from the input they themselves are responsible for creating, in a closed loop. The use of learning in a spinal cord model is justified by experimental results (Loeb 1999; Petersson et al. 2003; Granmo et al. 2008; Kohler et al. 2022; Picton et al. 2022) and recent modeling (Enander et al. 2022a, b). Our model was capable of learning to control two quite different mechanical systems. One model was two independent pendulums, as a model of two independent limbs. The other model was a double pendulum, as a basic model of a leg or an arm. Our results show that under these conditions and assumptions, a simplified spinal cord circuitry can adapt to different mechanical systems to generate rhythmic and alternating movement, in that respect being equivalent to current CPG models. These results are in line with recent models of learning in the spinal cord that successfully explain the formation of connectivity patterns in the spinal cord (Enander et al. 2022a, b). Even though we did not use a great number of known features of spinal neurons or known connectivity patterns in the spinal cord, our results do not contradict other current models. Our results should be interpreted as additional mechanisms that could help explaining the circuitry functionality in the spinal cord.

2 Methods

2.1 Neuron and network model

The neuron model was similar to a previous non-spiking model (Rongala et al. 2018, 2021). Each neuron had an output activity, which was a time continuous voltage, called firing rate, and was modeled as follows. Let $\nu _i^+ \in [0,1], i \in 1, \ldots , n$ be the firing rates of neurons and external inputs connected with positive weights $w_i^+ \ge 0$ to the neurons and $\nu _j^- \in [0,1], i \in 1, \ldots , m$ the firing rates of neurons connected with negative weights $w_j^- < 0$. The dynamics of the potential are

$$\begin{aligned} \tau \frac{\textrm{d}V}{\textrm{d}t} = -V + (1-V) \sum w^+_i \nu ^+_i + (1+V) \sum w^-_j \nu ^-_j \hspace{5.0pt}, \end{aligned}$$

where $\tau = {5}$ ms. The term $-V$ modeled a leak, the factors $1-V$ and $1+V$ modeled reversal potentials which limited the maximum and minimum voltages the neuron could have. The neurons firing rate $\nu = \max (0, V)$ was computed by thresholding the voltage at zero.

The network structure was defined by the connection weights between the neurons and between inputs and neurons. Connections from a neuron to itself were not present in this model. All weights were subject to learning, as described in the next section. The weights between two distinct neurons were initialized with a random sample from ${\mathcal {U}}(-0.9, 0.9)$ (uniform distribution). The connections between an external input and a neuron were initialized with a random sample from ${\mathcal {U}}(1.5, 2.9)$, to somewhat bias the neurons toward external input.

2.2 Learning

The learning model was adapted from the BCM rule (Bienenstock et al. 1982; Intrator and Cooper 1992). For each neuron with firing rate $\nu $, a learning threshold $\phi $ was maintained. The learning threshold is the low-pass filtered version of the squared firing rate, with a time constant $\tau _{\phi } = {500}\,{\text{ ms }}$,

$$\begin{aligned} \tau _{\phi } {\dot{\phi }} = -\phi + \nu _\text {out}^2 \hspace{5.0pt}. \end{aligned}$$

The time constant $\tau _{\phi }$ was chosen to be substantially larger than $\tau $.

For a connection with weight w, where a neuron with firing rate $\nu _\text {in}$ provides input to a neuron with firing rate $\nu _\text {out}$ the change of the weight was

$$\begin{aligned} \tau _w {\dot{w}} = \nu _\text {out} \left( 0.5 \nu _\text {out} - \phi \right) \nu _\text {in} \hspace{5.0pt}, \end{aligned}$$

where $\tau _w ={10}$ s. For the BCM rule, an equilibrium is obtained when the learning threshold and the voltage are both either one or zero. For the model neurons used here, it is impossible to reach a voltage of one. Therefore, the factor 0.5 was introduced in the above equation, which moved the equilibrium to a voltage and a learning threshold of 0.5.

2.3 Mechanical models

We used two mechanical system models, two independent pendulums (Fig 1a) and a double pendulum (Fig 1b). The models consisted of links connected by rotational joints, either to another link or an unmovable base. All joints had friction. Otherwise, the specific choice of models and parameters was arbitrary.

2.3.1 Independent pendulums

Let $\theta _i$, $i \in {1,2}$ be the joint angles of each of the two pendulums. In each joint, a linear spring with spring constant $\kappa = 1\frac{1}{{\text{ s }}^2}$ was located. The joint had friction $\beta = 0.1 \frac{1}{{\text{ s }}}$. An external force $F_{i}$ could be applied to each joint. No gravity acted on the pendulums. The dynamics of pendulum i was

$$\begin{aligned} \ddot{\theta _i} = - \kappa \theta _i - \beta \dot{\theta _i} + F_{i} \hspace{5.0pt}. \end{aligned}$$

2.3.2 Double pendulum

Let $\theta _1$ and $\theta _2$ be the joint angles of the two joints in the double pendulum. Both links had length $l = 2$ m, mass $m = 1$ kg and the center of mass was, relative to the joint, at $l_c = 1$ m. The moment of inertia of each link was $I = 1\, {\text{ kgm}^2}$ and the friction in each joint was $\beta = 1\, \frac{1}{{\text{ s }}}$. A gravity of $g = 9.81\, {\text{ m/s}^2}$ acted on the pendulum. The dynamics were given by

$$\begin{aligned}{} & {} M(\theta _1, \theta _2) \begin{pmatrix} \ddot{\theta _1} \\ \ddot{\theta _2} \end{pmatrix} + C(\theta _1, \theta _2, \dot{\theta _1}, \dot{\theta _2}) + G(\theta _1, \theta _2) \\{} & {} \qquad = \begin{pmatrix} F_1 \\ F_2 \end{pmatrix} - \beta \begin{pmatrix} \dot{\theta _1} \\ \dot{\theta _2}) \end{pmatrix} \end{aligned}$$

The mass matrix was

$$\begin{aligned} M(\theta _1, \theta _2) = \begin{pmatrix} a + 2 b \cos (\theta _2) &{} c + b \cos (\theta _2) \\ c + b \cos (\theta _2) &{} c \end{pmatrix}\hspace{5.0pt}, \end{aligned}$$

where

$$\begin{aligned} a&= 2I + m l_c^2 + 2 m l^2 \\ b&= m l_c l \\ c&= I + m l_c^2 \end{aligned}$$

Let $h = -m l_c l \sin (\theta _2)$, the Coriolis matrix was

$$\begin{aligned} C(\theta _1, \theta _2, \dot{\theta _1}, \dot{\theta _2}) = \begin{pmatrix} h \dot{\theta _2} &{} h (\dot{\theta _1} + \dot{\theta _2}) \\ -h \dot{\theta _1} &{} 0 \end{pmatrix} \hspace{5.0pt}. \end{aligned}$$

The gravity vector was

$$\begin{aligned} G(\theta _1, \theta _2) = \begin{pmatrix} (m l_c + m l) g cos(\theta _1) + m l_c g cos(\theta _1 + \theta _2) \\ m l_c g cos(\theta _1) + \theta _2) \end{pmatrix} \hspace{5.0pt}. \end{aligned}$$

The cartesian coordinates of the endpoint of the double pendulum are

$$\begin{aligned} x&= \cos (\theta _1) + \cos (\theta _2) \\ y&= \sin (\theta _1) + \sin (\theta _1) \hspace{5.0pt}. \end{aligned}$$

2.4 Network and mechanics closed loop

In order for the network to control the mechanical system, the two systems were connected in a closed loop (Fig 1c). Each neuron received as an external input the joint angles, velocities and the negative joint angles and velocities thresholded at zero and limited to one. The inputs were $\min (\max (0, x),1)$ and $\min (\max (0, -x),1)$ for all $x \in \{\theta _1, \theta _2, \dot{\theta _1}, \dot{\theta _2} \}$. This way all inputs to the neurons are positive and negative angles and velocities are presented to the neurons on a different input than positive angles. Additional to the input from the mechanical system, each neuron received a motor command in the format of a random external input. This input was used to initiate movement in the system. It was generated for each neuron separately by random sampling from ${\mathcal {U}}(0, 0.9)$.

The torques applied in the joints were computed from the neurons firing rates

$$\begin{aligned} F_1&= f (\nu _1 + \nu _2 - \nu _3 - \nu _4) \\ F_2&= f (\nu _5 + \nu _6 - \nu _7 - \nu _8) \hspace{5.0pt}, \end{aligned}$$

where f is a constant factor, which scaled firing rates of the neurons so that a torque leading to reasonable movement in the mechanical system is produced. For the independent pendulums $f = 12$ and for the double pendulum $f = 6$. If this model is to be adapted to different mechanical systems than presented here, this factor must be adapted too.

2.5 Learning and testing

The network and mechanics were initialized so that the neuron voltages were zero and there was no movement. Weights were initialized randomly as described above. First, learning was disabled and it was tested if rhythmic movement could be generated. This was done by sampling a constant external random input (motor command) for each neuron independently, running the simulation for ${100}\,{\text{ s }}$ and then determining if rhythmic movement was generated in the second half of the simulation, as described below. The test set was composed of one hundred random combinations of motor command inputs each lasting ${100}\,{\text{ s }}$. Then learning was enabled and the simulation was run for ${2000}\,{\text{ s }}$. Every ${1}\,{\text{ s }}$ a new random motor command was sampled for each neuron. Finally, learning was disabled again and it was tested if rhythmic movement could be generated, using the same set of one hundred test inputs, the same way as before the learning.

In order to determine if there was rhythmic movement in a test, the joint angle time sequences were plotted. The plots were inspected if rhythmic movement was visible and if the amplitude did not decay in the second half of the test.

Additionally, to visualize the result of all tests, the autocorrelation of each joint angle sequence of the second half of each test was computed. The period length of the oscillation was taken as the position in time of the second largest local maxima of the autocorrelation. If only one local maxima was available, the period length was estimated as zero, which indicated that there was no rhythmic movement. The amplitude of the oscillation for each joint angle in each test was computed as the difference between the largest and the smallest value of the joint angle.

2.6 Numerical methods

Each neuron, the learning and the mechanical system were simulated with an integration step size of ${1}\,{\text{ ms }}$. The neuron dynamics were integrated with the inverse Euler method. The learning was integrated with the forward Euler method. The mechanical system was integrated with the Runge–Kutta-4–5 method. The entire simulation was implemented in C++ using the Eigen (Guennebaud and Jacob 2010) and boost::odeint libraries.

3 Results

We used two different pendulum systems to test if rhythmic movements could be learned using the BCM rule in neurons modeled without any intrinsic active conductances or rhythm generation (Fig. 1). The pendulums were designed to hang from an attachment point, similar to a leg (Fig. 1a,b). In one case, we used two independent pendulums, similar to left and right legs, which were spring-loaded with the equilibrium in the vertical orientation. The double pendulum had no springs but gravity acted on the masses of the two segments. Both pendulums had friction in the joints so that active actuation was required to generate and sustain movement. The joints had sensors that reported the joint angle and joint velocity. These sensors were provided with activation thresholds and then the sensor information was provided as input to a neural network (Fig. 1c). In order to signal the joint angles and velocities in both directions, the sensor information from each joint was duplicated and half of the sensor information set was inverted before the thresholding (Fig. 1c). Hence, each sensor signal was excitatory. Furthermore, each neuron received input from each of the eight sensors. The eight neurons were located in a one-layer, fully connected network, where each neuron was connected to every other neuron except for itself. The weights of all synapses were modifiable through learning. The neurons had outputs to drive movements at the joints (joint torque input), where each joint was controlled by four neurons, of which half controlled a negative torque proportional to its firing rate (in this case the thresholded voltage output of the neuron), and half controlled a positive torque. Each neuron in addition received a unique motor command input, which was a random but constant excitatory input level, through a synapse whose weight was also modifiable through learning. Motor command inputs were required to elicit movement and thereby sensory feedback information.One simulation was composed of an initial test set with 100 randomized motor command input combinations, followed by a learning phase of 2000 different motor commands (Fig. 1d) and finally another test set with the same motor command inputs as in the first test.

3.1 Generation of rhythmic movement

The BCM rule allowed our model to learn to produce rhythmic and alternating movement in both mechanical systems (Fig. 2a–c).

After the learning for the independent pendulums system, almost all tests showed rhythmic movement with an amplitude larger than in any test with rhythmic movement before the learning (Fig. 2d). Only few tests before the learning showed rhythmic movement at all. The distributions of the periods and amplitudes were narrow both before and after the learning, indicating that the structure of the external input (motor command) did not greatly impact the movement pattern in either case (Fig. 2d). In all tests, the rhythmic movements between the two pendulums were alternating, despite no mechanical coupling between the two independent pendulums.

For the double pendulum in all tests before the learning, the system showed rhythmic movement throughout the entire test phase, but the amplitude of the movement decayed with each oscillation cycle (Fig. 2b, e). The movement of the double pendulum resembled an arm repeatedly picking up and lifting an object (Fig. 2c). The trajectory, drawn out by the endpoint of the double pendulum, is shown in Fig. 3c. After the learning, all tests showed rhythmic movement with non-decaying amplitude. Across all tests after the learning, the movements had similar periods and amplitudes for each joint (Fig. 2d), indicating rhythmic movement.

These two experiments illustrate that the network was able to adapt to both mechanical systems and to produce rhythmic movement in both of them.

3.2 Distribution of the neuron voltages

The learning changed the distribution of the output activity levels or voltages for each neuron. In the case of the independent pendulums, voltages were distributed sparsely before the learning, because the mechanical system remained mostly at a constant position during the tests. After the learning, voltages were not sparsely distributed anymore (Fig. 3a). The grand mean of the distribution means before the learning was 0.54 with a standard deviation of 0.09. After the learning, the grand mean was 0.45 with a standard deviation of 0.01, which indicates that the neurons during learning converged toward the target activity defined by the BCM rule. In the case of the double pendulum system, the distributions of the output activity of the neurons showed no sparsity before the learning because the pendulum showed rhythmic movement, but with decaying amplitude. However, the distributions had a high spread with a grand mean of 0.74 and a standard deviation of 0.07. After learning, the spread substantially narrowed down to yield a grand mean of 0.47 with a standard deviation of 0.01 (Fig. 3b).

In addition, after learning the voltage distributions of three of the eight neurons for the independent pendulums system were clearly bimodal, in line with the expected result of the BCM learning rule. For the double pendulum system, learning instead resulted in four of the neurons having bimodal distributions (Fig. 3a, b blue arrowheads for bimodal distributions).

Figure 3c shows the activity of each neuron during one oscillation cycle of the double pendulum. Each neuron tended to be selectively active for a specific phase of the movement, but the degree of selectivity varied between the neurons. Especially the rightmost four neurons, which corresponded to the neurons controlling the second joint, were highly selectively active for one specific movement direction of the pendulums endpoint.

For both mechanical systems, the learning rule found bimodal voltage distributions for a subset of the neurons and shifted the mean firing rates of all neurons close to 0.5, which was to be expected from the BCM rule.

4 Discussion

We have demonstrated that a randomly connected network can learn to generate patterns of rhythmic and alternating movement in different mechanical systems. The model of the neurons and the synaptic learning has been proposed previously (Rongala et al. 2018, 2021; Bienenstock et al. 1982; Intrator and Cooper 1992). We used them without modification, except for a shift of the equilibrium point of the learning model to adapt it to the neuron model (see below). The model was made very simple, to only demonstrate the basic principles. The pendulums served as crude approximations of animal legs. The monolayer of neurons was designed to resemble a simplified spinal cord circuitry in terms of its sensorimotor connectivity, without any separate classes of interneurons with respect to predefined sensory inputs.

Our model could potentially explain how the biological spinal cord learns to control locomotion. Note that our system, for example, automatically learned ’left-right alternation’ between two mechanically independent pendulums (Fig. 2c). To map our model to an envisaged ontogenetic development of the spinal cord circuitry in an animal, initially neurons would need to form connections with each other at random and they would need to receive random connections from the sensory afferents. Also the connectivity from the spinal interneurons to the motoneurons, each innervating a specific muscle, would be random. The activity of the spinal neurons would stimulate force generation by the muscles, which through the biomechanical configuration of the body would trigger subsets of preferred movements. In a system that is inclined to rhythmic movement, like the systems used here, this would lead to some rhythmic movement and the sensory input to the spinal neurons, that results from that movement, would also be rhythmic. From this, the neurons can learn, so that they have stable firing rates and can provide input to other neurons or to the muscles. This would lead to the muscles having rhythmic activation, which would result in rhythmic movements as shown here. Inhibitory connections could allow neurons to reduce the activity of other neurons, allowing alternating activity between neurons, so that movement patterns where antagonistic muscles are active alternatingly could form. Moreover, inhibitory and excitatory connections between neurons could allow the distribution of neuron activity to change during a movement, where the distribution of muscle activity evolves during the movement. Both alternating activity and shifts in neuron activity could be facilitated by a learning rule that finds synaptic weights so that each neuron is selectively active for different phases of a movement. In reality, the brain would have to select motor command inputs to the spinal cord that result in more varied movements that would be suitable to fulfill the goals of the brain. Such motor command inputs, in turn, could influence the movement patterns from which the spinal cord circuitry structure could learn.

We do not rule out, that there is dependence between genetics and spinal cord function. For instance, our choice of fixed parameters, under which our models showed the presented behavior, could be interpreted as genetically defined parameters that the spinal cord has evolved to. We also do not rule out that genetically defined connectivity has an important role in the spinal cord. It could facilitate learning by providing a rough connectivity pattern, suitable for the biomechanical configuration of the specific animal species, that is then refined by learning, to adapt to the uniqueness and the unique environment of a particular animal. Also the learning itself cannot be independent of genetics as the molecules that are involved in learning, must be encoded by genes. Locomotor-related rhythms, that appear at embryonic stage, before the skeletal muscles or sensory systems mature (Wan et al. 2019), could also facilitate learning. For instance, they could take on a similar role, as what here was modeled as the random external input.

Our model could also help to find and control oscillatory modes of general multi-body mechanical systems. Intuitively, modes are intrinsic movements of mechanical systems, which are determined by the ability of a system to store and release energy (Albu-Schäffer and Della Santina 2020). For the double pendulum system, our model controlled a movement similar to a so-called energy invariant mode, where the pattern of movement is the same regardless of how fast the movement is (Albu-Schäffer et al. 2021). The double pendulum supports a greater variety of modes and developing controllers for them would enable robots to achieve energy efficient movement (Albu-Schäffer and Della Santina 2020). Future work could be to find neural networks and learning rules able to learn to control these modes.

Compared to CPG models of locomotion, our model did not use rhythm generating neurons and the initial connectivity of the network was random. Our model assumes that learning exists in the spinal cord, similar to what has been proposed for the brain. This is a strong assumption, but it is justified by experimental observations (Loeb 1999; Petersson et al. 2003; Granmo et al. 2008; Kohler et al. 2022; Picton et al. 2022). Recent spinal cord models, that also include learning, had success in explaining how prominent connectivity patterns among spinal neurons and afferents could form (Enander et al. 2022a, b). While previous models of locomotion without rhythm generating neurons have been proposed, they either had manually adjusted synaptic weights (Geyer and Herr 2010) or an immutable network structure facilitating locomotion (Stratmann et al. 2016). Overall, our results demonstrate in principle how neural network behavior, which could explain phenomena compatible with the observations underlying the notion of central pattern generators, can arise through learning without any genetically programmed network connectivity or rhythm generating neurons. Through the focus on cyclic movements, our paper differs from previous models of spinal-like learning using spontaneous twitches to obtain a network connectivity that could be underlying reflexes (Blumberg et al. 2013; Marques et al. 2014).

However, our model has a few limitations. During the learning phase, a more organized external input than the independent random signals that we used could lead to a greater variety of random movement from which the network structure could be learned. In a developing animal, such semi-organized but still predominantly random movements could be generated by the spontaneous internal activity of the brain (Khazipov and Luhmann 2006; Inácio et al. 2016), spontaneous activity of spinal neurons and by random muscle twitching.

If the model is able to learn rhythmic movement or not is critically dependent on the so-called force factor. This factor is the gain, that converts the firing rates of the neurons into a force in the mechanical systems. Different mechanical systems, depending on their masses, frictions and springs, require different forces to move. If the factor is too small, only little movement is generated and it is impossible to learn rhythmic movement. If the factor is too large, only erratic movement is generated, also making it impossible to learn. For the double pendulum, it could be of advantage to use different force factors for each joint, because the total masses that move around each joint are different. In biology, muscles and spinal cord must develop during the same time. It could be that muscles must adapt to the inputs from the spinal cord.

The models of the mechanical systems were chosen to be similar to the ones used in control theory and robotics for the sake of simplicity. Models of actual biomechanical systems such as cat legs or entire lamprey bodies would be an important test that was not considered here.

The BCM rule has been proposed to explain how neurons in the brain could learn to be selective for features in passively received visual sensory inputs. Here the BCM rule works in a model where input is actively influenced by the model itself. The key features of the rule, that make this possible, are homeostatic stabilization of the firing rates of the neurons and optimization for a bimodal firing rate distribution. A stable firing rate in our case helped to provide an, on average, limited force input to the mechanical systems. Bimodal firing rate distributions can help make neurons selective for certain phases of a movement and provide activity during this phase.

Moreover, we combined the BCM rule with a nonlinear neuron model, where the maximum firing rate is always smaller than one. If we had used the BCM rule as given in the references (Bienenstock et al. 1982; Intrator and Cooper 1992; Law and Cooper 1994), we would have weights that converge to infinity, because this formulation has an equilibrium point when the firing rate is one. In one study (Law and Cooper 1994), weight growth was counteracted by a learning rate depending on the learning threshold. Instead we shifted the equilibrium point to a firing rate that can be attained by the model neurons.

Clearly limitations in this model and in existing CPG models must be addressed to find an accurate model of the contribution of the spinal cord to rhythmic movement generation. In biology, it is likely that the mechanisms provided in previous CPG models, in combination with developmental, adaptive, and learning processes such as the ones described here, are contributing to an integrated whole.

Data availability

Data and code are available at https://doi.org/10.5281/zenodo.5681218.

References

Albu-Schäffer A, Della Santina C (2020) A review on nonlinear modes in conservative mechanical systems. Annu Rev Control 50:49–71
Article Google Scholar
Albu-Schäffer A, Lakatos D, Stramigioli S (2021) Strict nonlinear normal modes of systems characterized by scalar functions on Riemannian manifolds. IEEE Robot Autom Lett 6(2):1910–1917
Article Google Scholar
Ausborn J, Snyder AC, Shevtsova NA, Rybak IA, Rubin JE (2017) State-dependent rhythmogenesis and frequency control in a half-center locomotor CPG. J Neurophysiol 119(1):96–117
Article PubMed PubMed Central Google Scholar
Bienenstock EL, Cooper LN, Munro PW (1982) Theory for the development of neuron selectivity: orientation specificity and binocular interaction in visual cortex. J Neurosci 2(1):32–48
Article CAS PubMed PubMed Central Google Scholar
Blumberg MS, Marques HG, Iida F (2013) Twitching in sensorimotor development from sleeping rats to robots. Curr Biol 23(12):R532–R537
Article CAS PubMed PubMed Central Google Scholar
Brocard F, Shevtsova NA, Bouhadfane M, Tazerart S, Heinemann U, Rybak IA, Vinay L (2013) Activity-dependent changes in extracellular ca2+ and k+ reveal pacemakers in the spinal locomotor-related network. Neuron 77(6):1047–1054
Article CAS PubMed PubMed Central Google Scholar
Brown TG (1911) The intrinsic factors in the act of progression in the mammal. Proc R Soc Lond B Biol Sci 84(572):308–319
Article Google Scholar
Cohen AH, Harris-Warrick RM (1984) Strychnine eliminates alternating motor output during fictive locomotion in the lamprey. Brain Res 293(1):164–167
Article CAS PubMed Google Scholar
Danner SM, Shevtsova NA, Frigon A, Rybak IA (2017) Computational modeling of spinal circuits controlling limb coordination and gaits in quadrupeds. Elife 6:e31050
Article PubMed PubMed Central Google Scholar
Enander JM, Jones AM, Kirkland M, Hurless J, Jörntell H, Loeb GE (2022) A model for self-organization of sensorimotor function: the spinal monosynaptic loop. J Neurophysiol 127(6):1460–1477
Article PubMed PubMed Central Google Scholar
Enander JM, Loeb GE, Jörntell H (2022) A model for self-organization of sensorimotor function: spinal interneuronal integration. J Neurophysiol 127(6):1478–1495
Article CAS PubMed PubMed Central Google Scholar
Geyer H, Herr H (2010) A muscle-reflex model that encodes principles of legged mechanics produces human walking dynamics and muscle activities. IEEE Trans Neural Syst Rehabilit Eng 18(3):263–273
Article Google Scholar
Granmo M, Petersson P, Schouenborg J (2008) Action-based body maps in the spinal cord emerge from a transitory floating organization. J Neurosci 28(21):5494–5503
Article CAS PubMed PubMed Central Google Scholar
Grillner S (2018) Evolution: vertebrate limb control over 420 million years. Curr Biol 28(4):R162–R164
Article CAS PubMed Google Scholar
Grillner S, El Manira A (2020) Current principles of motor control, with special reference to vertebrate locomotion. Physiol Rev 100(1):271–320
Article PubMed Google Scholar
Grillner S, Wallen P (1985) Central pattern generators for locomotion, with special reference to vertebrates. Annu Rev Neurosci 8(1):233–261
Article CAS PubMed Google Scholar
Guennebaud G, Jacob B, et al (2010) Eigen v3. http://eigen.tuxfamily.org
Hultborn H, Jankowska E, Lindström S (1971) Recurrent inhibition of interneurones monosynaptically activated from group IA afferents. J Physiol 215(3):613–636
Article CAS PubMed PubMed Central Google Scholar
Inácio AR, Nasretdinov A, Lebedeva J, Khazipov R (2016) Sensory feedback synchronizes motor and sensory neuronal networks in the neonatal rat spinal cord. Nat Commun 7(1):1–14
Article Google Scholar
Intrator N, Cooper LN (1992) Objective function formulation of the BCM theory of visual cortical plasticity: statistical connections, stability conditions. Neural Netw 5(1):3–17
Article Google Scholar
Jankowska E, Jukes MGM, Lund S, Lundberg A (1967) The effect of dopa on the spinal cord 5. reciprocal organization of pathways transmitting excitatory action to alpha motoneurones of flexors and extensors. Acta Physiol Scand 70(3–4):369–388
Article CAS PubMed Google Scholar
Khazipov R, Luhmann HJ (2006) Early patterns of electrical activity in the developing cerebral cortex of humans and rodents. Trends Neurosci 29(7):414–418
Article CAS PubMed Google Scholar
Kohler M, Bengtsson F, Stratmann P, Röhrbein F, Knoll A, Albu-Schäffer A, Jörntell H (2022) Diversified physiological sensory input connectivity questions the existence of distinct classes of spinal interneurons. Iscience 25(4):104083
Article PubMed PubMed Central Google Scholar
Kohler M, Stratmann P, Röhrbein F, Knoll A, Albu-Schäffer A, Jörntell H (2020) Biological data questions the support of the self inhibition required for pattern generation in the half center model. PLoS ONE 15(9):e0238586
Article CAS PubMed PubMed Central Google Scholar
Law C, Cooper L (1994) Formation of receptive fields according to the BCM theory in realistic visual environments. Proc Natl Acad Sci 91:7797–7801
Article CAS PubMed PubMed Central Google Scholar
Loeb GE (1999) Asymmetry of hindlimb muscle activity and cutaneous reflexes after tendon transfers in kittens. J Neurophysiol 82(6):3392–3405 (PMID: 10601470)
Article CAS PubMed Google Scholar
Marques HG, Bharadwaj A, Iida F (2014) From spontaneous motor activity to coordinated behaviour: a developmental model. PLoS Comput Biol 10(7):e1003653
Article PubMed PubMed Central Google Scholar
Meehan CF, Grondahl L, Nielsen JB, Hultborn H (2012) Fictive locomotion in the adult decerebrate and spinal mouse in vivo. J Physiol 590(2):289–300
Article CAS PubMed Google Scholar
Molkov YI, Bacak BJ, Talpalar AE, Rybak IA (2015) Mechanisms of left-right coordination in mammalian locomotor pattern generation circuits: a mathematical modeling view. PLoS Comput Biol 11(5):e1004270
Article PubMed PubMed Central Google Scholar
Petersson P, Waldenström A, Fåhraeus C, Schouenborg J (2003) Spontaneous muscle twitches during sleep guide spinal self-organization. Nature 424(6944):72–75
Article CAS PubMed Google Scholar
Picton LD, Björnfors ER, Fontanel P, Pallucchi I, Bertuzzi M, El Manira A (2022) Developmental switch in the function of inhibitory commissural v0d interneurons in zebrafish. Curr Biol 32:3515–3528
Article CAS PubMed Google Scholar
Rongala UB, Enander JM, Kohler M, Loeb GE, Jörntell H (2021) A non-spiking neuron model with dynamic leak to avoid instability in recurrent networks. Front Comput Neurosci 15:656401
Article PubMed PubMed Central Google Scholar
Rongala UB, Spanne A, Mazzoni A, Bengtsson F, Oddo CM, Jörntell H (2018) Intracellular dynamics in cuneate nucleus neurons support self-stabilizing learning of generalizable tactile representations. Front Cell Neurosci 12:210
Article PubMed PubMed Central Google Scholar
Rybak IA, Shevtsova NA, Kiehn O (2013) Modelling genetic reorganization in the mouse spinal cord affecting left-right coordination during locomotion. J Physiol 591(22):5491–5508
Shevtsova NA, Talpalar AE, Markin SN, Harris-Warrick RM, Kiehn O, Rybak IA (2015) Organization of left-right coordination of neuronal activity in the mammalian spinal cord: Insights from computational modelling. J Physiol 593(11):2403–2426
Article CAS PubMed PubMed Central Google Scholar
Spanne A, Geborek P, Bengtsson F, Jörntell H (2014) Spike generation estimated from stationary spike trains in a variety of neurons in vivo. Front Cell Neurosci 8:199
Article PubMed PubMed Central Google Scholar
Stratmann P, Lakatos D, Albu-Schäffer A (2016) Neuromodulation and synaptic plasticity for the control of fast periodic movement: energy efficiency in coupled compliant joints via pca. Front Neurorobot 10:2
Article PubMed PubMed Central Google Scholar
Wan Y, Wei Z, Looger LL, Koyama M, Druckmann S, Keller PJ (2019) Single-cell reconstruction of emerging population activity in an entire developing circuit. Cell 179(2):355–372
Article CAS PubMed PubMed Central Google Scholar
Ziskind-Conhaim L, Wu L, Wiesner EP (2008) Persistent sodium current contributes to induced voltage oscillations in locomotor-related hb9 interneurons in the mouse spinal cord. J Neurophysiol 100(4):2254–2264
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This research received funding to AK from the German Aerospace Center under the research and development contracts D/572/67246870, D/572/67270080, D/572/67302614 the European Union’s Horizon 2020 Framework Programme for Research and Innovation under the Specific Grant Agreements No. 720270 (Human Brain Project SGA1), 785907 (Human Brain Project SGA2), 945539 (Human Brain Project SGA3).

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

Department of Informatics, Technical University of Munich, Boltzmannstraße 3, 85748, Garching, Bavaria, Germany
Matthias Kohler, Alois Knoll & Alin Albu-Schäffer
Institute of Robotics and Mechatronics, German Aerospace Center, Münchener Straße 20, 82234, Weßling, Bavaria, Germany
Alin Albu-Schäffer
Department of Experimental Medical Science, Lund University, Sölvegatan 19, 22184, Lund, Scania, Sweden
Henrik Jörntell
Department of Computer Science, Chemnitz University of Technology, Straße der Nationen 62, 09111, Chemnitz, Saxony, Germany
Florian Röhrbein

Authors

Matthias Kohler
View author publications
You can also search for this author in PubMed Google Scholar
Florian Röhrbein
View author publications
You can also search for this author in PubMed Google Scholar
Alois Knoll
View author publications
You can also search for this author in PubMed Google Scholar
Alin Albu-Schäffer
View author publications
You can also search for this author in PubMed Google Scholar
Henrik Jörntell
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization: MK; Methodology: MK; Formal analysis and investigation: MK; Writing—original draft preparation: MK; Writing—review and editing: MK, HJ; Funding acquisition: AK; Supervision: HJ, AS, AK, FR;

Corresponding author

Correspondence to Matthias Kohler.

Ethics declarations

Conflict of interest

The authors have no competing interests to declare that are relevant to the content of this article.

Additional information

Communicated by Benjamin Lindner.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Kohler, M., Röhrbein, F., Knoll, A. et al. The Bcm rule allows a spinal cord model to learn rhythmic movements. Biol Cybern 117, 275–284 (2023). https://doi.org/10.1007/s00422-023-00970-z

Download citation

Received: 19 November 2022
Accepted: 28 July 2023
Published: 18 August 2023
Issue Date: October 2023
DOI: https://doi.org/10.1007/s00422-023-00970-z

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

The Bcm rule allows a spinal cord model to learn rhythmic movements

Abstract

Similar content being viewed by others

Decoding the Neural Mechanisms Underlying Locomotion Using Mathematical Models and Bio-inspired Robots: From Lamprey to Human Locomotion

Effect of Electrical Synapses in the Cycle-by-Cycle Period and Burst Duration of Central Pattern Generators

Sensorimotor Integration in the Spinal Cord, from Behaviors to Circuits: New Tools to Close the Loop?

1 Introduction