doi:10.1016/j.neunet.2007.04.017
Copyright © 2007 Elsevier Ltd All rights reserved.
2007 Special Issue
Edge of chaos and prediction of computational performance for neural circuit models
aInstitute for Theoretical Computer Science, Technische Universitaet Graz, A-8010 Graz, Austria
Available online 3 May 2007.
References and further reading may be available for this article. To view references and further reading you must
purchase this article.
Abstract
We analyze in this article the significance of the edge of chaos for real-time computations in neural microcircuit models consisting of spiking neurons and dynamic synapses. We find that the edge of chaos predicts quite well those values of circuit parameters that yield maximal computational performance. But obviously it makes no prediction of their computational performance for other parameter values. Therefore, we propose a new method for predicting the computational performance of neural microcircuit models. The new measure estimates directly the kernel property and the generalization capability of a neural microcircuit. We validate the proposed measure by comparing its prediction with direct evaluations of the computational performance of various neural microcircuit models. The proposed method also allows us to quantify differences in the computational performance and generalization capability of neural circuits in different dynamic regimes (UP- and DOWN-states) that have been demonstrated through intracellular recordings in vivo.
Keywords: Neural networks; Spiking networks; Edge of chaos; Microcircuits; Computational performance; Network dynamics
Fig. 2. UP- and DOWN-states in neural microcircuit models. Membrane potential (for a firing threshold of 15 mV) of two randomly selected neurons from circuits in two parameter regimes labeled as UP- and DOWN-states, as well as spike rasters for the same two parameter regimes (with the actual circuit inputs shown between the two rows).
 |
Fig. 4. VC-dimension as predictor for the generalization capability of neural microcircuit models. (a) Prediction of the generalization capability for 90 different types of neural microcircuits (as in Fig. 1(b)): estimated VC-dimension (of the hypothesis class
for a set
of inputs consisting of 500 jittered versions of 4 spike patterns), each of the 90 data points for 90 different circuit types was computed as the average over 20 circuits; for each circuit, the average over 5 different sets of spike patterns was used. The standard deviation is shown in the inset on the upper right. See Section 6 for details. (b) Actual generalization capability ((error on test set) − (error on training set)) of the same neural microcircuit models for a particular learning task, quantified by the difference of test error and training error (error defined as the fraction of examples that are misclassified) in the spike pattern classification task discussed in Section 2. The standard deviation is shown in the inset on the upper right.
Fig. 5. Values of the proposed measures for computations on spike patterns. (a) Kernel-quality for spike patterns of 90 different circuit types (average over 20 circuits, mean SD=13). 7 (b) Generalization capability for spike patterns: estimated VC-dimension of
(for a set
of inputs u consisting of 500 jittered versions of 4 spike patterns), for 90 different circuit types (same as Fig. 4(a)). (c) Difference of both measures (the standard deviation is shown in the inset on the upper right). This should be compared with actual computational performance plotted in Fig. 1(b).
 |
Fig. 6. Analysis of the sets of neurons which get activated. (a) For a given input pattern, only a small fraction of the 540 circuit neurons are activated (i.e., emit at least one spike). Shown is the mean number of neurons which are activated by an input pattern. For each parameter setting, the mean over 20 circuits and 125 patterns per circuit is shown. The input patterns were drawn from the same distribution as the templates for the task in Fig. 1. (b) For different input patterns, sets of activated neurons are diverse. Shown is the mean Hamming distance between two of the 125 activation vectors arising from a set of circuit inputs consisting of 125 jittered versions of a spike template. For each parameter setting, the mean over 20 circuits is shown. (c) Difference between the mean Hamming distance for an input set consisting of 125 randomly drawn spike templates and the mean Hamming distance for an input set consisting of 125 jittered versions of a spike template. This difference predicts computational performance (see Fig. 1(b)) quite well. For each parameter setting, the mean over 20 circuits is shown. The standard deviation is shown in the inset on the upper right.
 |
Fig. 7. Performance and values of the proposed measures for different types of neural microcircuit models with linear readouts for classification of spike patterns on the basis of circuit states with limited precision. In each case the actual analog value of each component of the circuit state was rounded to one of 11 possible values (see text). (a) Fraction of examples (for 500 test examples) for which the output of the readout agreed with the target classification (the readout was trained by linear regression with 2000 training examples; see Fig. 1(b) for more details). Note the similarity to Fig. 1(b). The standard deviation of the result is shown in the inset on the upper right. (b) Generalization capability of the same neural microcircuit models for a particular learning task, quantified by the difference of test error and training error (error defined as the fraction of examples that are misclassified). Compare with Fig. 4(b). (c) Kernel-quality for spike patterns of 90 different circuit types. (d) Generalization capability for spike patterns. (e) Difference of both measures (the standard deviation is shown in the inset on the upper right). Compare with Fig. 5(c).
 |
Fig. 8. Analysis of the computational performance of simulated neural microcircuits (together with linear readouts) in different dynamic regimes. (a) Estimates of the kernel-quality for 135 spike patterns (solid line; average over 20 circuits). Estimate of the VC-dimension for a set
of inputs consisting of 135 jittered versions of one spike pattern (dotted line; average over 20 circuits). (b) Difference of measures from panel (a) (as in Fig. 5(c); mean ± SD). (c) Evaluation of computational performance (correlation coefficient; for 500 test examples) of a linear readout (trained by linear regression with 1000 training examples). 20 randomly drawn circuits and 20 target classification functions randomly drawn from the set of 230 possible classification functions were tested for each of the 13 different circuit types, and resulting correlation coefficients were averaged. (d) Estimates of the kernel-quality for input streams u with 132 different combinations of 13 firing rates (solid line; average over 20 circuits). Estimate of the VC-dimension for a set
of inputs consisting of 132 different spike trains u that represent one combination of firing rates. (dotted line; average over 20 circuits). Also shown is the average number of neurons that get activated (i.e. fire at all) for a typical input (dash-dotted line; scale on right hand side). (e) Difference of measures from panel (d) (solid line). Average firing rate in the same circuits for the same inputs (dashed line). Note that the firing rates are poorly correlated both with the difference of the two measures, and with the computational performance shown in (f). (f) Evaluation of computational performance (correlation coefficient; for 400 test examples) of a linear readout (trained by linear regression with 2000 training examples). 20 randomly drawn circuits were tested for each of the 13 different circuit types, and resulting correlation coefficients were averaged. Results are shown for classification at three different time points after stimulus onset: 100 ms (solid line), 150 ms (dashed line), and 200 ms (dash-dotted line).This shows that at least for this task performances for these time points were comparable. Their performance peak (as a function of the amount of background noise) was also predicted quite well by our two measures as shown in panel (e).