Reconfigurable reservoir computing in a magnetic metamaterial

Vidamour, I. T.; Swindells, C.; Venkat, G.; Manneschi, L.; Fry, P. W.; Welbourne, A.; Rowan-Robinson, R. M.; Backes, D.; Maccherozzi, F.; Dhesi, S. S.; Vasilaki, E.; Allwood, D. A.; Hayward, T. J.

doi:10.1038/s42005-023-01352-4

Download PDF

Article
Open access
Published: 26 August 2023

Reconfigurable reservoir computing in a magnetic metamaterial

Communications Physics volume 6, Article number: 230 (2023) Cite this article

1994 Accesses
5 Citations
7 Altmetric
Metrics details

Subjects

Abstract

In-materia reservoir computing (RC) leverages the intrinsic physical responses of functional materials to perform complex computational tasks. Magnetic metamaterials are exciting candidates for RC due to their huge state space, nonlinear emergent dynamics, and non-volatile memory. However, to be suitable for a broad range of tasks, the material system is required to exhibit a broad range of properties, and isolating these behaviours experimentally can often prove difficult. By using an electrically accessible device consisting of an array of interconnected magnetic nanorings- a system shown to exhibit complex emergent dynamics- here we show how reconfiguring the reservoir architecture allows exploitation of different aspects the system’s dynamical behaviours. This is evidenced through state-of-the-art performance in diverse benchmark tasks with very different computational requirements, highlighting the additional computational configurability that can be obtained by altering the input/output architecture around the material system.

Physical reservoir computing with emerging electronics

Article 12 March 2024

Task-adaptive physical reservoir computing

Article Open access 13 November 2023

Brownian reservoir computing realized using geometrically confined skyrmion dynamics

Article Open access 15 November 2022

Introduction

In-materia computation, where the responses of material systems are exploited to perform computational operations, offers a potential alternative to conventional CMOS computing. Here, like in biological neurons, data processing operations are performed intrinsically via the physics governing the system’s response to inputs. This offers potential improvements in both latency and power efficiency, as dynamical complexity and memory are inherent properties of the substrate. This removes the need to shuttle data between discrete memory and computational units, which can cost up to 100 times the energy of the computation itself when discrete memory units are located off-chip¹.

Reservoir Computing (RC)^2,3 is a bio-inspired computational paradigm which is especially harmonious with in-materia computation. In RC, a time-dependent ‘reservoir’ layer (typically a recurrent neural network, RNN) provides complex nonlinear representations of input data, and a time-invariant readout layer provides a weighted output of the evolving state of the reservoir. Only the readout layer is trained, alleviating the training difficulties associated with standard RNNs since temporal dependencies of the reservoir layer are decoupled from the simple linear output⁴.

As the response of the RNN is mathematically analogous to that of a dynamic system, it can be substituted with a real-world dynamic system with appropriate properties, namely nonlinearity between input and output, and a dependence on previous state that asymptotically diminishes over time, termed a ‘fading memory’. This has led to a plethora of proposed implementations, with platforms including optoelectronic^5,6,7, molecular⁸, mechanical^9,10,11, biological^12,13, memristive^14,15,16 and magnetic^{17,18,19,20,21,22,23} systems.

Nanomagnetic platforms are of particular interest for RC due to their inherent hysteretic behaviours and nonlinearity of system dynamics, satisfying the two broad criteria necessary for RC. Many magnetic systems have been proposed as reservoirs and come with their own strengths and weaknesses. Spin-torque nano-oscillators^17,24,25 offer high data-throughput and passive synchronisation, and can be characterised using simple electrical measurements. The all-electric nature of the input/output to these oscillators has allowed for small artificial neural networks (<10 nodes) to be demonstrated experimentally^26,27, and larger networks have been simulated for RC with binary inputs^28,29. However, the intrinsic dynamics of single oscillators are relatively simple (though they can be augmented via external delayed feedback^30,31) and have durations on the order of nanoseconds, limiting their suitability to processing applications where sensory data arrives with characteristic timescales on the order of seconds- far beyond the intrinsic decay times of these systems. Magnetic metamaterials (materials that are engineered to exhibit complex physical responses beyond their underlying material properties) such as artificial spin-ice systems^19,20,32 and skyrmion textures³³, represent an exciting subcategory for magnetic RC, boasting complex, spatially distributed responses. However, interfacing with these materials is challenging, since spin-ices are electrically discontinuous and skyrmion textures require sub-100K temperatures, inhibiting device-tractable measurement approaches.

While there have been many recent, important developments showcasing device-specific RC performance in a range of physical systems, many more general questions remain, such as how different RC architectures can be used to extract different computational properties, and how these architectures can best synergise with the underlying system dynamics. Frequently, the ‘single dynamical node’ paradigm³⁴ is employed with little attention to its role in the computation or to the alternative computational properties that could be extracted with different reservoir architectures. This leaves some of the broader potential of nanomagnetic RC as reconfigurable computational platforms untapped.

In this paper, we experimentally demonstrate a pipeline from characterisation of device physics, to reservoir design, to state-of-the-art performance in several, diverse computational tasks with a single magnetic device consisting of an array of interconnected magnetic nanorings³⁵. The nanoring system boasts a combination of highly complex system response and simple electrical readout: strong coupling between individual ring elements produces complex ‘emergent’ dynamics (where large-scale responses arise from the collective effects of simple interactions between elements, rather than the properties of the elements themselves), while the continuous nature of the patterned nanostructure facilitates electrical transport measurements. Additionally, non-volatile domain configurations formed in response to input provides a natural means of generating system memory at driving fields an order of magnitude smaller than spin-ice systems³⁶. To harness these emergent behaviours, we employ the device in three distinct reservoir architectures that each leverage different aspects of its dynamical properties. We then demonstrate how this provides flexible computational functionality by performing benchmark tasks with contrasting computational requirements on a single device, achieving state-of-the-art accuracies. This highlights the reconfigurability achievable in in materio platforms via careful choice of the accompanying RC architecture.

Results

Response of nanoring arrays

The devices studied here consist of arrays of 10 nm thick Ni₈₀Fe₂₀ (Permalloy, Py) nanorings, patterned into a square lattice with each ring having nominal diameters of 4μm and track widths of 400 nm, each overlapping with its nearest neighbours across 50% of their track widths^35,36. The arrays were fabricated by electron beam lithography with lift-off processing and metallised via thermal evaporation. Ti/Au electrical contacts were then added via additional lithography and deposition steps, allowing measurements of the device’s anisotropic magnetoresistance (AMR). Despite typical AMR ratios of 3-4% for Py³⁷, shape anisotropy in the rings means that magnetisation typically runs parallel to the applied currents. This meant only the domain walls which present local changes in magnetisation direction can be detected via AMR, leading to an effective AMR ratio of 0.2% for the device, with the signal quality improved via lock-in amplification techniques (see Methods- Electrical transport measurements). The samples have saturation magnetisation ${\mu }_{0}{M}_{s}$ of 0.969 ± 0.006 T, determined via broadband ferromagnetic resonance measurements (see supplementary note 4, and supplementary fig. S6). Figure 1a shows a scanning electron microscope image of the device.

**Fig. 1: Overview of static and dynamic responses of nanoring arrays.**

In previous studies^35,36, we have shown that interconnected nanoring arrays exhibit emergent magnetisation dynamics under rotating in-plane magnetic fields. At the microstate, each ring exists in one of three metastable configurations, defined by the number and position of domain walls (DWs) it possesses, with configurations for ‘vortex’ (zero DWs), ‘onion’ (two DWs, 180° separation), and ‘three-quarter’ (two DWs, 90° separation) shown in Fig. 1b. To initialise the ring arrays, a strong pulse of magnetic field and subsequent relaxation leads to a uniform state of aligned onion rings, with DWs pointing along the direction of the saturation pulse. Under high driving fields, the DWs can coherently propagate with the applied field, maintaining onion configuration. However, under lower driving fields, stochastic pinning events cause differential movement of DWs within a ring (onion to three-quarter transition), potentially leading to DW annihilation (three-quarter to vortex transition) when itinerant DWs in the same ring collide. DWs can be restored in rings via the propagation of a DW in neighbouring ring, with the magnetic reversal across the junction between the two rings leading to injection of a pair of DWs in the empty ring (vortex to onion/three-quarter transition). Schematics for these processes are shown supplementary fig. 2g, h. Whilst these behaviours are stochastic at the local scale, interactions between many rings lead to a well-defined global emergent response, providing a complex yet repeatable dynamic state evolution (Fig. 1c–e).

To evaluate the evolving magnetic states of the arrays for computation, AMR measurements performed via the electrical contacts shown in Fig. 1a. This gives a single global readout for each array, which varies over a given input rotation. Initially, the device’s response as a function of rotating field amplitude was surveyed to determine the characteristics of the responses and identify computationally useful features (Fig. 1f). Fourier analysis of the AMR response led to observation of two distinct signals with frequencies that match (1f signal) as well as double (2f signal) the frequency of the rotating magnetic field, with the relative magnitude of the two signals with respect to driving field amplitude shown in Fig. 1f(i) (see Supplementary note 1 for further Fourier analysis). Physically, these processes can be separated into elastic deformation of the rings’ domain structures due to susceptibility effects (1f, dominant at lower fields), and irreversible DW propagation between pinning sites in the rings (2f, dominant at higher fields). Further details of these mechanisms can be found in the supplementary note 2. The dynamic nature of the system’s response was evaluated by measuring the number of rotations that were required for the AMR signal to reach dynamic equilibrium (<2% amplitude variance between cycles) from saturation, with the measured timescales and the underlying signals shown in Fig. 1f(ii), (iii), (vi) respectively. The onset of DW motion can also be observed at a ~ 22 Oe, marked by the nonlinear increase of 1 f signal in Fig. 1f(i), as well as the start of varying time-signals between cycles in Fig. 1f(iii)–(vi).

From these measurements, three computationally promising properties can be identified. Firstly, the distinct variation of the AMR frequency components with respect to field provides crucial nonlinearity. Secondly, the dependence of the device’s response on its past states, as evidenced by the range of timescales observed in the AMR signals, allows information to be connected across time in manner reminiscent of the echo-state property of echo state networks (ESNs). Finally, the presence of a threshold field below which no irreversible DW motion occurs shows a non-volatility of system state, providing pathways to longer-term storage of information.

The key demonstration of this paper is how these physical behaviours of the nanoring devices can be harnessed in different ways to create RCs with different computational properties, and thus tackle problems with different computational requirements. We achieve this by incorporating the device into three distinct reservoir architectures: an approach which takes advantage of the time-continuous oscillations of the nanoring array (signal sub-sample reservoir), the ‘single dynamical node’ architecture introduced by Appeltant et al.³⁴ and the recently proposed ‘rotating neurons reservoir’ of Liang et al.³⁸ yet to be deployed outside of analogue electronic RC. These architectures are presented schematically in Fig. 2 and described in their respective “Methods” section. In the following, we will explain how each of these architectures allows different computational properties to be emphasised and then exploited to perform challenging computational tasks. For further details on the methods employed for the machine learning tasks, see Supplementary note 5.

**Fig. 2: Schematic diagrams of each reservoir architecture.**

Signal sub-sample reservoir

One foundational task for RC platforms is nonlinear signal transformation^{20,32,39,40,41}. In this problem, the system is provided with input of a given periodic response and is tasked with transforming the input signal into a different target signal. To perform this task, the reservoir should provide a higher-dimensional, nonlinear representation of the input signal so that the transformation between the input and the target can be computed via a simple linear readout.

To meet these computational demands, we designed a simple reservoir input/output architecture that directly exploited the non-linear variation of the 1 f/2 f frequency signals (Fig. 3a). Here, each input datum scaled the field amplitude for a single rotation, and the resulting AMR response was sampled at 32 times per input, expanding input dimensionality 32-fold. The two frequency components have different nonlinear variations with respect to input magnitude, meaning that the relative magnitude of the continuous signal at fixed sample points will have nonlinear variation with respect to each other, providing dimensionality expansion of the input data. This offers a very simple method for providing increased nonlinearity in physical systems with continuous signals, obtained by leveraging a phase transition in system response.

**Fig. 3: Performance of signal transformation task.**

Figures 3b–d shows the resulting signal reconstruction when the ring array system was tasked with transforming sinusoidal input to ReLU(sin(x)) (rectified linear unit), square wave, sawtooth waveforms. To evidence the impact of the metamaterial on computation, a control experiment was performed by recording the voltage of one of the driving electromagnets as the measured reservoir state instead of the resistance of the nanoring array. This provided equal dimensionality expansion as the nanoring array transformation, but without the nonlinearities contributed by the nanoring system. However, these measurements do contain any hardware-based nonlinearities in the electromagnets such as slew-rate between inputs and inductive effects, accounting for any nonlinearities provided by the experimental equipment. The ring array network outperformed the control network in all cases, offering up to a 55-fold reduction in MSE (4.6 x ${10}^{-4}$ compared to 2.5 x ${10}^{-2}$) when replicating the ReLU function. The rings also perform favourably compared to proposed spin-ice platforms, with lower errors for Sawtooth (1.406 x ${10}^{-2}$ vs 1.919 x ${10}^{-2}$) and Square (6.605 x ${10}^{-3}$ vs 2.429 x ${10}^{-2}$) waves²⁰. The different reconstruction tasks are performed optimally at very different ranges of applied field (Fig. 3e), highlighting how the ring array’s dynamics can be further tuned for better performance in a range of similar problems even when held within a consistent reservoir architecture. The accuracies for all transformations for both the ring array and control network, as well as the ratio between them, are shown in the Fig. 3f.

Single dynamical node reservoir

Another key application for RC is the classification of time varying signals such as spoken digits, a task which has been previously used to benchmark a variety of RC platforms^17,21,25,34. While input data for the previous task was 1-dimensional, input data for speech recognition tasks are typically multi-dimensional. Furthermore, non-linear interactions between these input dimensions in the reservoir are essential to successful classification. Here, we consider classification of the spoken digits 0–9 from the TI-46 database (see Supplementary Note 5- Spoken Digit Recognition Task for details). The input data was 13-dimensional, consisting of the results of applying Mel-frequency cepstral filters⁴² to each utterance. The data is linearly inseparable, with classification accuracy being limited to around 75%²⁵ if input data is passed directly to a linear readout layer. The role of the reservoir is to provide a non-linear mapping of input data into higher dimensional reservoir space, thus allowing the linear readout layer to establish hyperplanes which can classify the data accurately.

Tackling this problem requires a reservoir architecture that expresses the non-linearity of the device’s AMR response, can accommodate multiple input dimensions, and allows nonlinear combinations of these input dimensions, properties that cannot be provided by the signal sub-sample architecture. To satisfy these requirements we adopted the single dynamical node approach (Fig. 4a) initially proposed by Appeltant et al.³⁴. and detailed in the “Methods” section. Multidimensional input data was fed sequentially into the device, creating a reservoir constructed of ‘virtual’ nodes that convolves inputs temporally via the ring array’s transient dynamics. Thus, this approach leveraged both the non-linear response of the device’s AMR signal to input (via the activation of the virtual nodes), and its transient nature (which allowed interaction between virtual nodes).

**Fig. 4: Performance of spoken digit recognition task.**

As shown in the previous task application, our device exhibited a broad range of responses that were potentially useful for computation. Searches over parameter space can be performed for simple tasks such as signal transformation, however for more data-intensive tasks, this process is inefficient. Previous studies have shown that task-agnostic metrics, which can be found via statistical analysis of small random datasets^36,43, can speed up parameter selection by identifying promising regions of parameters space. Using metrics of kernel rank (KR, the ability of the reservoir to separate different input classes) and generalisation rank (GR, and the ability of the reservoir to generalise inputs of the same class), we evaluated the computational properties of the device’s transformations for a range of scalar parameters controlling the scaling (H_r) and offset (H_c) of inputs (see supplementary note 3). As the spoken digit recognition task required improving the linear separability of input data, KR was chosen to be the key identifier of promising performance, with a comparatively lower GR also needed to generalise between the different speakers.

To highlight the single dynamical node approach’s better suitability to the spoken digit recognition task, metric maps were also drawn similarly for the other reservoir architectures (Supplementary fig. S4). While the revolving neurons reservoir showed good separation properties (high KR), and the signal sub-sample reservoir good generalisation properties (low GR), only the single dynamical node architecture exhibited a balance of the two, showing better suitability for classification tasks. This is likely due to the rotating neuron reservoir’s increased dependency upon past states reducing its ability to generalise, and the relatively smaller dimensionality of the signal sub-sampling reservoir leading to poorer ability to separate information. Conversely, the single dynamical node approach both provides good dimensionality expansion, as well as having decreased dependence on past states via the increased separation of inputs over time provided by the time multiplexing procedure.

Figure 4b shows the error rates versus training samples for spoken digit recognition, obtained using both a ‘promising’ (H_c = 29 Oe, H_r = 10 Oe, KR = 76) and arbitrarily chosen reservoir configurations (e.g., H_c = 21 Oe, Scaling, H_r = 7.5 Oe, KR = 52). 100-fold cross-validation was performed to evaluate general performance and find a suitable regularisation parameter, selected for best performance on the training set to prevent overfitting. Again, a reservoir constructed from the voltage signals across the driving coils was used as a control, effectively skipping the reservoir transformation whilst including the same pre-processing steps. A significant reduction of word-error-rate is observed moving from ‘control’ to ‘arbitrary’ to the ‘promising’ case, with error-rates of 24.8%, 10.4%, and 4.6% respectively. This demonstrates not only the effectiveness of the reservoir’s transformations in improving the linear separability of the data, but also the utility of evaluating metric scores to expedite system parameter selection.

One method for further improving performance commonly employed in conventional RC settings is the use of bespoke learning rules instead of standard regression-based training methods. Here, the SpaRCe⁴⁴ algorithm was used, which was developed for use on ESNs though thus far has not been applied to physical systems, and its online nature synergises well with life-long learning paradigms especially useful for system-level device applications⁴⁵. The algorithm aims to suppress confounding information and induce sparse output representations. Here, these properties help to mitigate the effects of experimental noise and remove redundant virtual node outputs. With SpaRCe, the accuracy was improved to 99.8%, as shown in Fig. 4c. The ring arrays matched state-of-the-art performance compared to other magnetic architectures, even with fewer (50) virtual nodes used in the time-multiplexing procedure (STNOs with 400 virtual nodes, 99.8%¹⁷, simulations of superparamagnetic arrays with 50 virtual nodes, 95.7%²¹), and improved upon the performance achieved in simulations of the ring system (97.7%³⁶).

Revolving neurons reservoir

In addition to data classification tasks, RC is also highly applicable to time series prediction problems. To be successful in these tasks, RC platforms often require fading memory of past inputs to correctly predict future trajectories, in addition to the non-linear properties that were exploited in the previous two tasks. The memory of a reservoir can characterised by evaluating the linear memory capacity⁴⁶ (MC), which measures the ability to reconstruct past inputs from the current reservoir state over increasing delays. Typically, nanomagnetic RC platforms exhibit low MC without the inclusion of delayed feedback due to the short timescale of intrinsic dynamic behaviors²³. Additionally, reservoirs constructed under the single dynamical node paradigm struggle to recall previous input datapoints due to the long temporal separations between each input created by the time-multiplexing procedure. For example, the prior architectures presented here exhibited peak MC < 3, meaning they could only reliably recall the previous two inputs (see supplementary fig. S5).

To utilise the system’s non-volatile properties and create a architecture better suited to time series prediction tasks, the recently proposed ‘rotating neurons reservoir’³⁸ (RNR, see “Methods” section) configuration was employed. Here, the system was constructed from 50 distinct dynamical nodes, with inputs to each node modulated by a fixed, rotating input (output) mask which multiplied input (reservoir state) values by ±1 (weight value), shifting the input/output connections to each node by one position every timestep (Figs. 2c and 5a).

**Fig. 5: Performance of linear and nonlinear memory tasks.**

The memory effects exhibited in this configuration emerge from the ratchet-like nature of the device’s non-volatile response: small inputs cause reversable perturbations while larger inputs cause non-volatile changes to underlying domain structure. In the RNR configuration, this means the system’s evolution is dependent upon the sign of the input at a given time, determined by the mask. For negative mask values, the low applied field strengths leave the rings’ domain structures unchanged through multiple timesteps until a positive input is applied to the system, where the higher applied fields cause DW propagation which is then measured as a change in the system’s resistance. This allows inference of the previous inputs applied to the system from the current states of the dynamical nodes, increasing MC. This architecture hence synergises well systems where activity decays slowly in the absence of large inputs.

Figure 5b shows the MCs calculated from the ring array system using the RNR approach. A peak MC of around 11.5 was found at ${H}_{c}$ = 21 Oe and ${H}_{r}$ = 10 Oe, showing that the device’s non-volatile properties were being harnessed to provide much greater memory of past inputs than the other approaches, and thus extending applicability to problems with longer-term temporal dependencies. The region of maximum MC here is correlated to the central field at which DW motion starts to occur (Fig. 1f(ii)). This corroborates the reasoning that the movement of DWs into different non-volatile configurations at fields above this value is where the system is ‘storing’ its memory of past states.

While MC can quantify the extent of linear memory (direct reconstruction of past inputs) in the system, real-world regression problems often require nonlinear memory (nonlinear representations of past inputs) for accurate prediction. To demonstrate the extent of nonlinear memory available to the system, we trained the system to reproduce a nonlinear auto-regressive moving average (NARMA-N) of input signals with varying degrees of autocorrelation (NARMA-5 and NARMA-10). For this problem, a system with perfect linear memory of equal degree to the autocorrelation (i.e., a shift-register of length N) can only achieve normalised means squared errors (NMSE) of ~0.4³⁴. To improve upon this, a system needs to store nonlinear representations of past inputs. Figure 5c, d presents heatmaps of NMSE achieved over a range of field scaling parameters for NARMA-5 (5c), and NARMA-10 (5d), as well as examples of the reconstructed signals (5e, 5f). Regions where the ring array system outperforms the shift register in the NARMA-5 and NARMA-10 tasks are shown by the grey lines in Fig. 5c, d, achieving peak NMSEs of 0.265 and 0.359 respectively. The combination of MC and performance of NARMA-N demonstrated that the system had been effectively reconfigured into a configuration with both linear and nonlinear memory without the aid of external delayed feedback lines that have typically been used in other demonstrations.

Conclusion

In this paper, we have demonstrated how a range of different RC architectures allow exploitation of different underlying dynamic properties in a complex magnetic system. This reconfigurability allowed the platform to achieve state-of-the art performance in three diverse tasks with differing computational requirements. To summarise the key correlations between underlying dynamics and suitable reservoir architectures, we found that the signal subsampling architecture synergises with phase transitions in the system’s response to provide nonlinear mappings of input, the single dynamical node paradigm synergises with transient responses to connect different input dimensions across time, and the rotating neurons reservoir scheme synergises well with regimes where reservoir state changes slowly with small/zero inputs, allowing information from past inputs to be sustained over time via the rotating input mask.

The synergy between these dynamic properties is also directly correlated to the type of task that the resulting reservoir is suitable for solving: the dimensionality expansion and nonlinear dynamics provided by the signal subsampling architecture allows for effective 1D signal processing, the temporal mixing of input dimensions in the single dynamical node architecture enables classification on multivariate data, and the slow dynamics modulated by the rotating input mask in the rotating neurons reservoir architecture allows for effective performance in memory-based tasks. Aside from the architecture choice, the selection of suitable scaling parameters for the input data is also critical to performance. To address this, we used task-independent metrics to provide a more holistic mapping of the computational properties of the reservoir across a range of scaling parameters and demonstrated the additional performance attainable via selecting promising parameters from the resulting metric maps for both classification-based tasks (KR/GR) and memory-based tasks (MC), with additional comparisons between each of the architectures’ scores in these metrics.

We believe that the range of dynamical regimes offered by the system, combined with the ability to address each of these properties separately and extract distinct computational properties via controlling the external reservoir architecture, makes the ring system a candidate for reservoir computing with complex dynamic substrates. Additionally, the effectiveness of synergising the reservoir architecture with the dynamic properties of the underlying system makes for an effective methodology for extracting a broad range of computational capability for other similar devices. The ring devices are not without their limitations however, with the current device being driven external rotating magnetic fields, which provides both a limitation on the throughput on data input to the system (on the order of 100 s of Hz), and power wastage in generating the magnetic fields over areas orders of magnitude larger than the nanoring array itself. Additionally, the current electrical readout provides a single scalar readout on the entire system state at a given point in time, which is sub-optimal for extracting complex state information on a system which exhibits spatially distributed responses like the ring system here. The feasibility of the ring system as a complex RC device that would be applicable to real-world settings hinges upon the ability to respond to electrical inputs such as spin-orbit torque driven DW motion, as well as expanding upon the readout mechanism to provide spatially resolved measurement of magnetic state.

To expand the computational capabilities of the ring arrays, the complex behaviours outlined here should operate concurrently as part of a larger system. The changes in magnetic responses offered via geometric changes to the system could enable multiple devices to operate in different regimes of dynamics and emphasise different computational properties under a single input field. Other magnetic metamaterial platforms have been shown to be useful in ‘deep’ reservoir networks with distributed reservoir properties³², which the ring system would also likely benefit from. We believe that this work marks a significant step forward towards the realisation of metamaterial systems as computational platforms that are device-compatible, and that the rich playground of computationally useful dynamics they offer makes the ring system a promising candidate for physics-based neuromorphic computation platforms.

Methods

Device fabrication

The ring array devices were fabricated using two-stage electron-beam lithography, with layouts patterned using a RAITH Voyager system. Wafers of Si (001) with a thermally oxidised surface were spin-coated with a positive resist. The ring structures were metallised to thicknesses of 10 nm via thermal evaporation of Ni₈₀Fe₂₀ powder using a custom-built (Wordentec Ltd) evaporator with typical base pressures of below 10⁻⁷ mBar. The initial resist went through lift-off, leaving the ring structures before re-application of the resist and further electron-beam lithography. Electrical contacts were metallised in two stages of thermal evaporation, first with 20 nm titanium to form a seed layer, before growth of 200 nm of gold. Electrical connections were provided between the device and a chip carrier through bonding of gold wire between contact pads on the device and the chip carrier.

Electrical transport measurements of ring arrays

Currents of 1.4 mA were provided to the arrays as a 43117 Hz sine wave into the patterned contacts (Fig. 1a) on the device using a Keithley 6221 current source. Resistance changes via AMR effects were measured using a Stanford Research SR830 lock-in amplifier. A National Instruments NI DAQ card was used to log the output voltage of the lock-in amplifier 64 times per rotation of applied field, and the data were then saved on a personal computer. The rotating magnetic fields were generated using two pairs of custom-built air-coil electromagnets in a pseudo-Helmholtz arrangement. The electromagnets were driven by a pair of Kepco BOP 36-6D power supplies and were controlled via voltage signals generated using a computer and the analogue output functionality of the NI card. A rotating field frequency of 37 Hz was chosen as a compromise between data throughput and signal fidelity.

Reservoir computing

In RC, the fixed reservoir layer provides a transformation of discrete-time input signals $u\left(t\right)$, to reservoir states, $x\left(t\right)$, according to the internal dynamics of the reservoir layer. The readout layer (here, a single-layer linear perceptron) provides a weighted sum of the reservoir states as output, $y\left(t\right)$. The transformation provided by the reservoir layer results in a higher-dimensional mapping of the input signals. This aids the output layer in classifying the input signals by allowing selection of hyperplanes in higher-dimensional space to correctly classify data that was previously linearly inseparable.

In this work, the RNN that constitutes the reservoir layer of the typical echo state network (ESN) was replaced with the magnetic nanoring device. The reservoir transformation was provided by the physical processes that govern the array’s magnetic response to field, as well as the changes to electrical resistance that consequently occur. Methods for inputting and extracting data are outlined for each reservoir configuration:

Signal subsample reservoir

Input sequences ${u}_{\tau }$ are transformed to give an applied field sequence via a pair of scalar parameters ${H}_{c}$ and ${H}_{r}$, shown in the following equation, which represent the zero-input field offset and the field scaling factor respectively:

$${H}_{{input}}={H}_{c}+{H}_{r}* {u}_{\tau }$$

Each input was applied for a single rotation of magnetic field. The reservoir states were then extracted by sampling the lock-in voltage signal 32 times per rotation, producing a 32-node output.

Single dynamical node reservoir

This approach uses ‘virtual’ nodes³⁴, where the reservoir states are generated from observing the state of the nanoring array as it evolves under time-multiplexed input. The generation of the time-multiplexed sequence of applied field magnitudes, ${H}_{{input}}$, (a vector of length $\theta * \tau$, where θ represents the desired number of virtual nodes, and τ the number of discrete-time windows the initial input sequence contains) was accomplished by combining the d-dimensional input vector for each timestep in ${u}_{\tau d}$ with a fixed input mask matrix, ${M}_{d,\theta }$, and flattened into a 1D sequence by concatenating timestep-by-timestep via:

$${H}_{{input}}={H}_{c}+{H}_{r}\mathop{\sum }\limits_{k=1}^{\tau }{u}_{k,d}*{ M}_{d,\theta }$$

where ${M}_{d,\theta }$ consisted of randomly generated 0’s and 1’s. The field sequence was then input to the system by rotating the field at magnitudes specified by ${H}_{{input}}$ for a given number of quarter-rotations per input datum.

The resulting voltage signals provided by the lock-in amplifier underwent some simple processing steps: Firstly, a high-pass filter with a low cut-off frequency of 3 Hz was used to centre the signals about zero and remove any low-frequency noise in the system. Band-pass filters were used to extract the 1f and 2f components separately. The pass-windows for each of these filters were centred about the input frequency and twice the input frequency, with band widths of 25% of the centre frequency to capture the damped dynamics of the oscillatory system. The outputs of the high-pass, and each of the band-pass filters, were sampled twice per input, forming a complete reservoir state vector six times the length of ${H}_{{input}}$.

Rotating neurons reservoir

This technique employs a shifting input/output mask³⁸, functionally analogous to rotating the input and output weights synchronously while keeping the dynamical neurons fixed. The procedure for this ‘rotation’ can be described as follows: Consider a system of $\theta$ dynamical nodes ${\eta }^{i}$, where i denotes the index of each node. An input signal ${u}_{\tau ,d}$ is combined with mask ${M}_{d,\theta }$, to produce input dimensions ${s}_{\tau \theta }$. The input to node ${\eta }^{i}$ at timestep t, ${\widetilde{s}}_{t,i}$, is given via

$${\widetilde{s}}_{t,i}={s}_{t,\left(i+t\right) \% \theta }$$

where ‘%’ represents the modulo operation. The resulting output matrix, ${\widetilde{X}}_{\tau \theta }$, is generated by vertically concatenating the output of vectors all $\theta$ nodes as they evolve, and is ‘unravelled’ similarly to form reservoir state matrix $X$ via:

$${{X}_{t,i}=\widetilde{X}}_{t,\left(i-t\right) \% \theta }$$

Additional information on each of the machine learning tasks, details of training methods employed, and any data processing steps taken can be found in Supplementary Methods - Machine Learning Tasks.

Data availability

A repository containing all data used in this study can be found on the University of Sheffield’s online database, ORDA, https://doi.org/10.15131/shef.data.23815434.

Code availability

A repository containing all code used in this study can be found on the University of Sheffield’s online database, ORDA, https://doi.org/10.15131/shef.data.23815434.

References

Zou, X., Xu, S., Chen, X., Liang, Y. & Han, Y. Breaking the von Neumann bottleneck: architecture-level processing-in-memory technology. Sci. China Inf. Sci. 64, 160404:1–160404:10 (2021).
Jaeger, H. The “Echo State” Approach To Analysing And Training Recurrent Neural Networks- With An Erratum Note. GMD Technical Report (2001).
Lukoševičius, M., Jaeger, H. & Schrauwen, B. Reservoir computing trends. KI - Kunstliche Intell. 26, 365–371 (2012).
Article Google Scholar
Lukoševičius, M. & Jaeger, H. Reservoir computing approaches to recurrent neural network training. Comput. Sci. Rev. 3, 127–149 (2009).
Article MATH Google Scholar
Paquot, Y. et al. Optoelectronic reservoir computing. Sci. Rep. 2, 287 (2012).
Jacobson, P. L., Shirao, M., Yu, K., Su, G. L. & Wu, M. C. Hybrid convolutional optoelectronic reservoir computing for image recognition. J. Light. Technol. https://doi.org/10.1109/JLT.2021.3124520 (2021).
Sande, G. V., der, Brunner, D. & Soriano, M. C. Advances in photonic reservoir computing. Nanophotonics 6, 561–576 (2017).
Article Google Scholar
Yahiro, W., Aubert-Kato, N. & Hagiya, M. A reservoir computing approach for molecular computing. Artificial Life Conference Proceedings (The International Society for Artificial Life, 2018).
Dion, G., Mejaouri, S. & Sylvestre, J. Reservoir computing with a single delay-coupled non-linear mechanical oscillator. J. Appl. Phys. 124, 152132 (2018).
Article ADS Google Scholar
Coulombe, J. C., York, M. C. A. & Sylvestre, J. Computing with networks of nonlinear mechanical oscillators. PLoS ONE 12, e0178663 (2017).
Article Google Scholar
Dion, G., Oudrhiri, A. I.-E., Barazani, B., Tessier-Poirier, A. & Sylvestre, J. Reservoir Computing in MEMS BT - Reservoir Computing: Theory, Physical Implementations, and Applications (eds. Nakajima, K. & Fischer, I.) 191–217 (Springer Singapore, 2021).
Tsakalos, K. A., Sirakoulis, G. C., Adamatzky, A. & Smith, J. Protein Structured Reservoir computing for Spike-based Pattern Recognition. IEEE Trans. Parallel Distrib. Syst. https://doi.org/10.1109/TPDS.2021.3068826 (2021).
Liu, X. & Parhi, K. K. Reservoir computing using DNA oscillators. ACS Synth. Biol. 11, 780–787 (2022).
Article Google Scholar
Kulkarni, M. S. & Teuscher, C. Memristor-based reservoir computing. Proc. 2012 IEEEACM Int. Symp. Nanoscale Archit. NANOARCH 2012 226–232 https://doi.org/10.1145/2765491.2765531(2012).
Hassan, A. M., Li, H. H. & Chen, Y. Hardware implementation of echo state networks using memristor double crossbar arrays. in 2017 International Joint Conference on Neural Networks (IJCNN) 2171–2177 https://doi.org/10.1109/IJCNN.2017.7966118 (2017).
Mehonic, A. et al. Memristors—from in-memory computing, deep learning acceleration, and spiking neural networks to the future of neuromorphic and bio-inspired computing. Adv. Intell. Syst. 2, 2000085 (2020).
Article Google Scholar
Torrejon, J. et al. Neuromorphic computing with nanoscale spintronic oscillators. Nature 547, 428–431 (2017).
Article Google Scholar
Nakane, R., Tanaka, G. & Hirose, A. Reservoir computing with spin waves excited in a garnet film. IEEE Access 6, 4462–4469 (2018).
Article Google Scholar
Jensen, J. H., Folven, E. & Tufte, G. Computation in artificial spin ice. in ALIFE 2018 - 2018 Conference on Artificial Life: Beyond AI 15–22 (MIT Press - Journals, 2020).
Gartside, J. C. et al. Reconfigurable training and reservoir computing in an artificial spin-vortex ice via spin-wave fingerprinting. Nat. Nanotechnol. 17, 460–469 (2022).
Article ADS Google Scholar
Welbourne, A. et al. Voltage-controlled superparamagnetic ensembles for low-power reservoir computing. Appl. Phys. Lett. 118, 202402 (2021).
Article ADS Google Scholar
Ababei, R. V. et al. Neuromorphic computation with a single magnetic domain wall. Sci. Rep. 11, 1–13 (2021).
Article Google Scholar
Allwood, D. A. et al. A perspective on physical reservoir computing with nanomagnetic devices. Appl. Phys. Lett. 122, 040501 (2023).
Article ADS Google Scholar
Riou, M. et al. Temporal pattern recognition with delayed-feedback spin-torque nano-oscillators. Phys. Rev. Appl. 12, 024049 (2019).
Abreu Araujo, F. et al. Role of non-linear data processing on speech recognition task in the framework of reservoir computing. Sci. Rep. 10, 1–11 (2020).
Article Google Scholar
Leroux, N. et al. Hardware realization of the multiply and accumulate operation on radio-frequency signals with magnetic tunnel junctions. Neuromorphic Comput. Eng. 1, 011001 (2021).
Article Google Scholar
Ross, A. et al. Multilayer spintronic neural networks with radio-frequency connections. Nat.Nanotechnol. 18, 1–8 (2023).
Kanao, T. et al. Reservoir computing on spin-torque oscillator array. Phys. Rev. Appl. 12, 024052 (2019).
Article ADS Google Scholar
Nomura, H. et al. Reservoir computing with two-bit input task using dipole-coupled nanomagnet array. Jpn J. Appl. Phys. 59, SEEG02 (2019).
Article Google Scholar
Williame, J., Difini Accioly, A., Rontani, D., Sciamanna, M. & Kim, J.-V. Chaotic dynamics in a macrospin spin-torque nano-oscillator with delayed feedback. Appl. Phys. Lett. 114, 232405 (2019).
Article ADS Google Scholar
Taniguchi, T. et al. Chaos in nanomagnet via feedback current. Phys. Rev. B 100, 174425 (2019).
Article ADS Google Scholar
Stenning, Kilian D. et al. Neuromorphic Few-Shot Learning: Generalization in Multilayer Physical Neural Networks. Preprint at https://doi.org/10.48550/arXiv.2211.06373 (2023).
Pinna, D., Bourianoff, G. & Everschor-Sitte, K. Reservoir computing with random skyrmion textures. Phys. Rev. Appl. 14, 054020 (2020).
Article ADS Google Scholar
Appeltant, L. et al. Information processing using a single dynamical node as complex system. Nat. Commun. 2, 468 (2011).
Dawidek, R. W. et al. Dynamically-Driven Emergence in a Nanomagnetic System. Adv. Funct. Mater. 31, 2008389 (2021).
Vidamour, I. et al. Quantifying the computational capability of a nanomagnetic reservoir computing platform with emergent magnetisation dynamics. Nanotechnology https://doi.org/10.1088/1361-6528/ac87b5 (2022).
Nagura, H., Saito, K., Takanashi, K. & Fujimori, H. Influence of third elements on the anisotropic magnetoresistance in permalloy films. J. Magn. Magn. Mater. 212, 53–58 (2000).
Article ADS Google Scholar
Liang, X. et al. Rotating neurons for all-analog implementation of cyclic reservoir computing. Nat. Commun. 13, 1549 (2022).
Article ADS Google Scholar
Daniels, R. K. et al. Reservoir computing with 3D nanowire networks. Neural Netw. 154, 122–130 (2022).
Article Google Scholar
Fu, K. et al. Reservoir Computing with Neuromemristive Nanowire Networks. in 2020 International Joint Conference on Neural Networks (IJCNN) 1–8 https://doi.org/10.1109/IJCNN48605.2020.9207727 (2020).
Dale, M., Miller, J. F., Stepney, S. & Trefzer, M. A. Evolving Carbon Nanotube Reservoir Computers. Lecture Notes in Computer Science. Unconventional Computation and Natural Computation: 15th International Conference. Vol. 9726, p. 49–61 (Springer, 2016).
Molau, S., Pitz, M., Schlüter, R. & Ney, H. Computing mel-frequency cepstral coefficients on the power spectrum. IEEE International Conference on Acoustics, Speech, and Signal Processing. Vol. 1, p. 73–76 (2001).
Dale, M., Miller, J. F., Stepney, S. & Trefzer, M. A. A substrate-independent framework to characterize reservoir computers. Proc. R. Soc. Math. Phys. Eng. Sci. 475, 20180723 (2019).
Manneschi, L., Lin, A. C. & Vasilaki, E. SpaRCe: improved learning of reservoir computing systems through sparse representations. IEEE Trans. Neural Netw. Learn. Syst. https://doi.org/10.1109/TNNLS.2021.3102378 (2021).
Liu, B. Lifelong machine learning: a paradigm for continuous learning. Front. Comput. Sci. 11, 359–361 (2017).
Article Google Scholar
Jaeger, H. Short term memory in echo state networks. GMD Rep. 152, 60 (2002).
Google Scholar
Foerster, M. et al. Custom sample environments at the ALBA XPEEM. Ultramicroscopy 171, 63–69 (2016).
Article Google Scholar

Download references

Acknowledgements

The authors would like to thank Jack C. Gartside for helpful discussions. The authors thank STFC for beam time on beamline I06 at the Diamond Light Source, and thank Jordi Prat, Michael Foerster, and Lucia Aballe from ALBA for providing quadrupole sample holders⁴⁷. I.T.V. acknowledges a DTA-funded PhD studentship from EPSRC. The authors gratefully acknowledge the support of EPSRC through grants EP/S009647/1, EP/V006339/1, and EP/V006029/1. This project has received funding from the European Union’s Horizon 2020 FETOpen programme under grant agreement No 861618 (SpinEngine). This work was supported by the Leverhulme Trust (RPG-2018-324, RPG-2019-097).

Author information

Authors and Affiliations

Department of Materials Science and Engineering, University of Sheffield, Sheffield, UK
I. T. Vidamour, C. Swindells, G. Venkat, A. Welbourne, R. M. Rowan-Robinson, D. A. Allwood & T. J. Hayward
Department of Computer Science, University of Sheffield, Sheffield, UK
I. T. Vidamour, L. Manneschi & E. Vasilaki
Nanoscience and Technology Centre, University of Sheffield, Sheffield, UK
P. W. Fry
Diamond Light Source, Harwell Science and Innovation Campus, Didcot, UK
D. Backes, F. Maccherozzi & S. S. Dhesi

Authors

I. T. Vidamour
View author publications
You can also search for this author in PubMed Google Scholar
C. Swindells
View author publications
You can also search for this author in PubMed Google Scholar
G. Venkat
View author publications
You can also search for this author in PubMed Google Scholar
L. Manneschi
View author publications
You can also search for this author in PubMed Google Scholar
P. W. Fry
View author publications
You can also search for this author in PubMed Google Scholar
A. Welbourne
View author publications
You can also search for this author in PubMed Google Scholar
R. M. Rowan-Robinson
View author publications
You can also search for this author in PubMed Google Scholar
D. Backes
View author publications
You can also search for this author in PubMed Google Scholar
F. Maccherozzi
View author publications
You can also search for this author in PubMed Google Scholar
S. S. Dhesi
View author publications
You can also search for this author in PubMed Google Scholar
E. Vasilaki
View author publications
You can also search for this author in PubMed Google Scholar
D. A. Allwood
View author publications
You can also search for this author in PubMed Google Scholar
T. J. Hayward
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

I.T.V. performed all experiments, analysis, and modelling, and drafted the article. I.T.V. and C.S. designed and optimised the AMR measurement apparatus. I.T.V. implemented and performed all machine learning frameworks excluding the SpaRCe algorithm, implemented by L.M.; G.V., C.S. and P.W.F. designed and manufactured all devices. I.T.V., A.W., R.M.R.R., D.A.A. and T.J.H. performed X-PEEM measurements, which were overseen by D.B., F.M. and S.S.D.; E.V., D.A.A. and T.J.H. conceptualised and supervised the work. All authors reviewed the manuscript.

Corresponding author

Correspondence to I. T. Vidamour.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Communications Physics thanks the anonymous reviewers for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Peer review file

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Vidamour, I.T., Swindells, C., Venkat, G. et al. Reconfigurable reservoir computing in a magnetic metamaterial. Commun Phys 6, 230 (2023). https://doi.org/10.1038/s42005-023-01352-4

Download citation

Received: 13 February 2023
Accepted: 17 August 2023
Published: 26 August 2023
DOI: https://doi.org/10.1038/s42005-023-01352-4

This article is cited by

Physical reservoir computing with emerging electronics
- Xiangpeng Liang
- Jianshi Tang
- Huaqiang Wu
Nature Electronics (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Physical reservoir computing with emerging electronics

Task-adaptive physical reservoir computing

Brownian reservoir computing realized using geometrically confined skyrmion dynamics

Introduction

Results

Response of nanoring arrays

Signal sub-sample reservoir

Single dynamical node reservoir

Revolving neurons reservoir

Conclusion

Methods

Device fabrication

Electrical transport measurements of ring arrays

Reservoir computing

Signal subsample reservoir

Single dynamical node reservoir

Rotating neurons reservoir

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Peer review file

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Physical reservoir computing with emerging electronics

Comments

Search

Quick links