Electromagnetic wave-based extreme deep learning with nonlinear time-Floquet entanglement

Momeni, Ali; Fleury, Romain

doi:10.1038/s41467-022-30297-5

Download PDF

Article
Open access
Published: 12 May 2022

Electromagnetic wave-based extreme deep learning with nonlinear time-Floquet entanglement

Nature Communications volume 13, Article number: 2651 (2022) Cite this article

9331 Accesses
9 Citations
39 Altmetric
Metrics details

Subjects

Abstract

Wave-based analog signal processing holds the promise of extremely fast, on-the-fly, power-efficient data processing, occurring as a wave propagates through an artificially engineered medium. Yet, due to the fundamentally weak non-linearities of traditional electromagnetic materials, such analog processors have been so far largely confined to simple linear projections such as image edge detection or matrix multiplications. Complex neuromorphic computing tasks, which inherently require strong non-linearities, have so far remained out-of-reach of wave-based solutions, with a few attempts that implemented non-linearities on the digital front, or used weak and inflexible non-linear sensors, restraining the learning performance. Here, we tackle this issue by demonstrating the relevance of time-Floquet physics to induce a strong non-linear entanglement between signal inputs at different frequencies, enabling a power-efficient and versatile wave platform for analog extreme deep learning involving a single, uniformly modulated dielectric layer and a scattering medium. We prove the efficiency of the method for extreme learning machines and reservoir computing to solve a range of challenging learning tasks, from forecasting chaotic time series to the simultaneous classification of distinct datasets. Our results open the way for optical wave-based machine learning with high energy efficiency, speed and scalability.

Universal scaling between wave speed and size enables nanoscale high-performance reservoir computing based on propagating spin-waves

Article Open access 01 March 2024

Photonic reservoir computing based on nonlinear wave dynamics at microscale

Article Open access 13 December 2019

An optoelectronic synapse based on α-In2Se3 with controllable temporal dynamics for multimode and multiscale reservoir computing

Article 13 October 2022

Introduction

Recently, artificial intelligence (AI) systems based on advanced machine learning algorithms have attracted a surge of interest for their potential applications in processing the information hidden in large datasets^1,2. Wave-based analog implementations of these schemes, exploiting microwave or optical neural networks, promise to revolutionize our ability to perform a large variety of challenging data processing tasks by allowing for power-efficient and fast neuromorphic computing at the speed of light. Indeed, wave-based analog processors work directly in the native domain of an analog signal, processing it while the wave propagates through an engineered artificial structure (metamaterials and metasurfaces)^3,4,5,6, as previously established in the cases of simple linear operations such as image differentiation, signal integration, and integro-differential equations solving^{7,8,9,10,11,12,13,14,15,16,17}. For more complex processing tasks, for example, image recognition or speech processing, both nonlinearity and a high degree of interconnection between the elements are desired, requirements that have led to various proposals of neuromorphic processors exploiting optical diffraction, coupled waveguide networks, disordered structures^{18,19,20,21,22,23,24,25,26,27}, or coupled oscillator chains^28,29. A particularly vexing challenge, however, is the implementation of nonlinear processing elements. While power-efficient neuromorphic schemes require a pronounced, particular form of non-linearities, optical non-linearities, such as in Kerr dielectrics, are typically weak at low intensitites, and cannot be much controlled. This leads to sub-optimal systems that must operate with high input powers^25,30,31,32. As an alternative, non-linearities that are external to the wave-based processor have also been considered, for example by exploiting the intensity dependency of a sensor, that needs an additional electronic interconnection. Unfortunately, exploiting such weak and non-controllable non-linearities drastically confines the performance of most machine learning schemes, and the relevance of wave-based platforms has so far been largely restricted to the implementation of simple linear matrix projections.

Here, we propose to leverage the physics of wave systems that are periodically modulated in time, the so-called time-Floquet systems^{33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48}, to solve this vexing challenge by implementing a strong, controllable nonlinear entanglement between all the neuron signals. We propose to use a simple, thin, uniform dielectric slab, whose refractive index is slowly and weakly modulated in time. With the addition of linear random scattering disorder, we implement very efficient recurrent neural networks (RNNs) schemes, namely extreme learning machine (ELM) and reservoir computing (RC). We demonstrate the high accuracy of our Floquet extreme learning machine in challenging computing tasks, from the processing of one-dimensional data (learning nonlinear functions), to challenging multi-dimensional data (e.g., the abalone dataset classification problem). We also demonstrate the flexibility of our scheme that can be multiplexed to tackle two unrelated classification tasks at the same time, simultaneously sorting COVID-19 X-ray lung images and handwritten digits. Finally, we validate our Floquet RC by predicting the time evolution of a chaotic system over a large time period (the Mackey-Glass time-series). The reservoir size of the proposed wave-based reservoir computing system is enhanced by leveraging both spatial and spectral domains in order to improve the learning performance compared to prior works, without imposing additional filters or a larger computational overhead. Such extreme time-Floquet analog learning machines are not only fast, easy-to-train, power-efficient, and versatile, but also feature a unique accuracy performance that is comparable to that obtained with the best digital schemes.

Results

We consider a particular class of neural networks, known as recurrent neural networks (RNNs). RNNs are ideal to process intricate data due to the internal cyclic connections between internal neurons, whose outputs depend on both the current inputs and the previous states of the neurons⁴⁹. This memory effect allows RNNs to detect recursive relations in the data, which are relevant for example to process temporal signals. In digital implementations, however, the heavy internal connectivity matrices that are involved in the training process make RNNs particularly computationally expensive and complicated^50,51,52,53. In order to solve these challenges, a number of alternative computing approaches such as long short-term memory (LSTM)⁵⁴, echo state networks (ESNs)⁵⁵, extreme learning machines (ELMs)^56,57,58, and reservoir computing (RC)^51,52,53,59 have emerged. These schemes are particularly well suited for wave-based implementations, because wave propagation inherently relies on the inertial memory of the medium, which can be enhanced and engineered by leveraging resonant elements, or multiple scattering. In addition, wave interferences are a particularly efficient way to create a high degree of interconnections between a large set of inputs.

Our time-Floquet neuromorphic processor implements an ELM, schematically shown in Fig. 1a. ELMs, or closely related methods based on random neural networks⁶⁰ or support vector machines⁶¹, are a powerful scheme in which only a last layer of connections is trained (in blue). The fundamental mechanism is the use of the non-trained part of the network, whose layers are represented in gray and red in Fig. 1a, in order to establish a nonlinear mapping between the initial space of the dataset and higher-dimensional feature space, where a properly trained classifier performs the separation and classification. In our case, this nonlinear mapping is performed by letting one of the non-trained layers (in red) be weakly modulated in time at a frequency much lower than the one of the signal, and with a modulation phase that depends on the input state.

**Fig. 1: Wave-based time-Floquet extreme learning machine.**

A concrete implementation of this scheme in a wave platform is shown in Fig. 1b. It consists of three parts: (i), an array of monopole antennas that radiates the various components of the input vector into the surrounding medium; (ii), a propagation space composed of a few scatterers and a thin dielectric slab, called a scattering time-modulated slab (STMS), whose index of refraction is weakly modulated in time; and (iii), the output layer made of an array of receiving antennas and a single dense layer, digitally trained to perform the desired regression or classification tasks. At the input layer, the input vector ζⁱⁿ with components ${\zeta }_{1}^{{{{{{{{\rm{in}}}}}}}}},...,{\zeta }_{N}^{{{{{{{{\rm{in}}}}}}}}}$ is first encoded into N signals ${s}_{i}^{{{{{{{{\rm{in}}}}}}}}}$, injected directly into the source antenna array (see further details in section Details of the proposed wave-based ELM architecture of the Methods). We assume that ζⁱⁿ is modulated at two distinct close-by frequencies ω₁ and ω₂, such that:

$${s}_{i}^{{{{{{{{\rm{in}}}}}}}}}={\zeta }_{i}^{{{{{{{{\rm{in}}}}}}}}}\left(\sin ({\omega }_{1}t)+\sin ({\omega }_{2}t)\right).$$

(1)

The permittivity ϵ_r of the STMS is modulated with a depth δ_m and a phase ϕ, at a frequency ω_m = ∣ω₁ − ω₂∣/2, so that ${\epsilon }_{r}={\epsilon }_{s}+{\delta }_{m}\cos ({\omega }_{m}t+\phi )$. This choice of modulation frequency allows for the two input frequencies to be efficiently mixed at the dominant Floquet harmonic (ω₁ + ω₂)/2 (see Fig. 1c). As we will now see, the reflection and transmission coefficients of Floquet Harmonics can show a strongly nonlinear dependency on the modulation phase, a key property that we will leverage to make the ELM very efficient.

To understand how time-Floquet systems can be used to induce large nonlinear entanglement between the incident and reflected signals, let us consider the toy model of a generic two-port time-Floquet system, where incident and reflected signals at ports 1 and 2 are represented by their time-varying complex amplitudes a_1,2(t) and b_1,2(t). This model applies for each plane wave incident on our STMS, with transverse wave number k, on which the actual field can be decomposed. Assuming the modulation frequency ω_m to be much smaller than the operation frequency ω_k^62,63, we can neglect dispersive effects and write the following instantaneous relation between the signals at each ports^62,63,64:

$$\left[\begin{array}{c}{a}_{1}(t)\\ {b}_{1}(t)\end{array}\right]=\tilde{{{\Psi }}}({\omega }_{k},t)\left[\begin{array}{c}{a}_{2}(t)\\ {b}_{2}(t)\end{array}\right],$$

(2)

where $\tilde{{{\Psi }}}({\omega }_{k},t)$ is the transfer matrix at ω_k, which varies slowly with time. Taking the Fourier transform of both sides yields

$$\left[\begin{array}{c}{A}_{1}(\omega )\\ {B}_{1}(\omega )\end{array}\right]= \,\tilde{{{\Psi }}}({\omega }_{k},\omega )* \left[\begin{array}{c}{A}_{2}(\omega )\\ {B}_{2}(\omega )\end{array}\right] \\ = \int \tilde{{{\Psi }}}({\omega }_{k},\omega -\omega ^{\prime} )\left[\begin{array}{c}{A}_{2}(\omega ^{\prime} )\\ {B}_{2}(\omega ^{\prime} )\end{array}\right]d\omega ^{\prime} ,$$

(3)

Since the scattering process into each Floquet harmonic component is linear, we can define the reflection and transmission coefficients into each harmonic as R₀(ω_k + nω_m) = B₁(ω_k + nω_m)/A₁(ω_k) and T₀(ω_k + nω_m) = A₂(ω_k + nω_m)/A₁(ω_k). A direct calculation shows that (see Sec. 1 of the Supplementary Material for detail derivations):

$${R}_{\phi }({\omega }_{k}+n{\omega }_{m})={e}^{in\phi }{R}_{0}({\omega }_{k}+n{\omega }_{m})$$

(4)

$${T}_{\phi }({\omega }_{k}+n{\omega }_{m})={e}^{in\phi }{T}_{0}({\omega }_{k}+n{\omega }_{m}),$$

(5)

where we have used the notation R_ϕ to highlight the dependency of the scattering coefficients on the modulation phase ϕ. These equations imply that upon adding a phase delay ϕ to the modulation, the generated frequency harmonic of order n will acquire a phase shift of nϕ, both for the forward and backward scattered plane waves. On the other hand, the amplitude of harmonic waves is constant when we alter the phase delay.

Now, consider the superposition of two incident plane waves at frequencies ω₁ and ω₂. Recalling our choice of modulation frequency, namely ω_m = ∣ω₁ − ω₂∣/2, we can write the reflection and transmission waves for all Floquet harmonic components of frequency ω₁ + nω_m = ω₂ + mω_m by using the superposition principle:

$$\left|\right.{R}_{\phi }^{\prime}\left|\right.=\left|\right.{e}^{in\phi }{R}_{0}({\omega }_{1},{\omega }_{1}+n{\omega }_{m})+{e}^{im\phi }{R}_{0}({\omega }_{2},{\omega }_{2}+m{\omega }_{m})\left|\right.$$

(6)

$$\left|\right.{T}_{\phi }^{\prime}\left|\right.=\left|\right.{e}^{in\phi }{T}_{0}({\omega }_{1},{\omega }_{1}+n{\omega }_{m})+{e}^{im\phi }{T}_{0}({\omega }_{2},{\omega }_{2}+m{\omega }_{m})\left|\right.,$$

(7)

where n and m are the orders of the Floquet harmonics with respect to ω₁ and ω₂, respectively. A particular example is the harmonic located at the average frequency ω = (ω₁ + ω₂)/2, for which n = 1 = −m (orange spectrum in Fig. 1c). According to Eqs. 6 and 7, the relation between the modulation phase and the intensity of scattered harmonic fields is highly nonlinear. In fact, we can control the amplitude of the Floquet harmonics only by changing the modulation phase. In order to have a nonlinear input–output mapping, we must therefore entangle the phase delay with the input vector (i.e., ϕ = f(ζⁱⁿ)). This can be done by using a simple voltage-controlled phase shifter (VCP) (see further details in section Details of the proposed wave-based ELM architecture of the Methods). In other words, the value of the modulation phase is directly determined by the value of the input vector, which is fixed when the system is excited, automatically turning the scattering process into a highly nonlinear function of the input, regardless of the input power. This makes such time-Floquet nonlinear entanglement highly advantageous in neuromorphic computing schemes.

To exemplify the strong nonlinear response of the proposed system, we plot the amplitude of the transmitted central harmonic (ω = (ω₁ + ω₂)/2) as a function of various variables, including the phase delay ϕ. The results are displayed in Fig. 2a–d. We fix one of the harmonics and plot ${T}_{\phi }^{\prime}$ versus the modulation phase and the real or imaginary part of the other transmitted harmonics, T(ω₁ + ω_m) (or T(ω₂ − ω_m)). As we can see in Fig. 2a–d, we indeed obtain a complex nonlinear semi-sinusoidal form for ${T}_{\phi }^{\prime}$, upon altering the modulation phase. The dependency on the real or imaginary parts of the other transmitted harmonic is also always nonlinear.

**Fig. 2: Nonlinear Floquet entanglement.**

Next, we implement the entanglement with the input vector to demonstrate the complex nonlinear behavior of the Floquet system, using a full-wave finite-difference time-domain simulation of the setup of Fig. 1b (see Methods). We compute the intensity of the central harmonic with respect to the input intensity for two different scenarios: a static phase delay and an entangled phase delay. In the first scenario, the phase delay is fixed and not dependent on the input (ϕ = 0), and as shown in Fig. 2e, the harmonic intensities are linear in terms of input intensities. In the second scenario, the delay phase is a simple linear function of the input (i.e., ϕ = 2πζⁱⁿ)). Figure 2f shows the complex nonlinear form of the proposed system. The oscillating nonlinear mapping performed by the proposed system is completely different from any earlier approach. As we will show, it is surprisingly effective in transforming the input data space to a nearly linearly separable output data space.

Note that another alternative approach to reach such a highly nonlinear input–output mapping is to entangle the input data with the modulation depth instead of the modulation phase. In this case, no phase shifters are needed. In section 2 of the Supplementary Material, more explanations about this alternative can be found, including a demonstration of its high performance in terms of transforming the input data space to a nearly linearly separable output data space.

Learning highly nonlinear functions

We now demonstrate the performance of the Floquet ELM by starting with simple regression problems, on a dataset generated with nonlinear relations. Such a dataset is often used as a standard benchmark in machine learning since linear regression of a nonlinear function is impossible without a nonlinear transformation^31,56. The input information (ζⁱⁿ) is a set of randomly generated numbers between −π to π and the corresponding output labels (y_i) are generated according to nonlinear functions, namely ${y}_{1}=\alpha \sin (4\pi {\zeta }^{in})(| {\zeta }^{in}| /\pi )$, y₂ = rect(ζⁱⁿ) (pulse function), and ${y}_{3}=\sin (\pi {\zeta }^{in})/(\pi {\zeta }^{in})$. We use 1000 randomly generated samples, which lie in [−π, π] to cover the entire characteristic behavior of the function. We map each input value to a vector by multiplying it with a fixed random 1D vector (mask), here of dimension 1 × 10. In this task, we use 10 and 20 input and readout nodes, respectively. By recording the intensity of the harmonics in the readout nodes of many input values, linear regression is performed on the output data (see Fig. 3a–c). A remarkable learning performance, with very low root-mean-squared error (RMSE) for all three nonlinear functions, is obtained. Interestingly, in the proposed wave-based neural network with a nonlinear time-Floquet layer, the multiple generated harmonic fields can be used to extend the dimension of the nonlinear mapping, and increasing their number improves the accuracy of classification/regression. This tendency is demonstrated in Fig. 3d–f), which plots the RMSE versus the number of considered Floquet harmonics. This mechanism is a clear advantage of the Floquet ELM: by involving a higher number of scattered harmonics, we can improve the RMSE and enhance the accuracy of learning with no additional computational cost. It should be noted that in order to compute outputs at the decision layer, we simply rescale the linear regression weights without having to use additional filters. (see further details in section Training of readout of the Methods).

**Fig. 3: Floquet extreme learning for highly nonlinear maps.**

We can explain the learning principle of the proposed computing system with well-known kernel methods. Kernel methods use kernels (or basis functions) to map the input data into a feature space. After this mapping, simple models can be trained on the new feature space, instead of the input space, which can result in an increase in the performance of the models^65,66. We can describe the projections of input samples in the feature space by ${T}^{\prime}={H}_{{{{{{\mathrm{non}}}}}}}\left(p({\zeta }^{{{{{{\mathrm{in}}}}}}})\right)$, where p is encoding function, here for example $p({\zeta }^{{{{{{\mathrm{in}}}}}}})={\zeta }^{{{{{{\mathrm{in}}}}}}}\left({{{{{\mathrm{sin}}}}}}({\omega }_{1}t)+{{{{{\mathrm{sin}}}}}}({\omega }_{2}t)\right)$ and H_non is a nonlinear and complex function associated with the time-Floquet entanglement. Essentially, H_non can be seen as an explicit form of optical kernel function which contains both the multiple scattering occurring in the media and the complex nonlinearity form. This kernel contains several polynomial basis functions, {x, x², x³, x⁴, . . . } (we can theoretically show it by Taylor expansion of Eqs. 6 and 7). Hence, we have a combination of different orders of polynomial mappings with random coefficients in the feature space for each readout node, resulting in transferring different features of the input data into the feature space. The proposed kernel is thus expected to be very efficient in performing all tasks, even when compared with a strongly nonlinear Kernel such as the modulus square operator (x²), typically found in sensors and detectors used in prior arts.

To prove this quantitatively, we compare the feature space projection of our kernel with the form of nonlinearity that is most commonly used: a square-law at the detector, x². As a reference, we also look at the purely linear case. As an example, we consider again the nonlinear interpolation of y = sinc(x). In order to perform well, the data projected in the feature space should be highly nonlinear with respect to the feature coordinates. To visualize this, we use principal component analysis (PCA) to reduce the dimension of the output data, because it lives in a high-dimensional space (10 dimensions, set by the number of readout nodes). PCA is a kind of linear projection that consists in transforming correlated variables into new variables, de-correlated from each other. These new variables are called “principal components” or principal axes⁶⁷. In Fig. 4a, b, we calculate and plot this projected data in a 3D PCA space for three distinct cases: linear, x² nonlinearity, and time-Floquet entanglement. Panels (a) and (b) show that in both the linear and x² cases, the projected data is on a line, whereas in the case of the time-Floquet entanglement, the data follows a highly nonlinear relation with respect to the principal axes. For this reason, the sinc(x) interpolation fails when using both a linear system or one with x² nonlinearity at the detector (see Fig. 4c, d). Conversely, time-Floquet entanglement is extremely good at performing the sinc problem (see Fig. 4e).

**Fig. 4: PCA analysis of the proposed optical kernel.**

Abalone dataset

In the previous section, we have used our Floquet ELM to learn nonlinear functions and their interpolation capability. However, interpolation is not always the relevant task, especially in complex inference problems. Therefore, we now move to a more challenging multivariable problem: the abalone dataset. This dataset is one of the most used benchmarks for machine learning and concerns the classification of sea snails in terms of age and physical parameters. It lists eight physical features of sea snails that can be used for the prediction of their age. To tackle this problem with our Floquet ELM, we encode the 8 physical features of sea snails on our input nodes (8 input nodes), and consider 50 readout nodes to feed the decision layer, which performs a linear regression. Figure 5a presents the true ages and the corresponding predictions; the figure indicates that the framework learns the ages of the abalone with remarkable accuracy. For a direct comparison, we plot the predicted values for 75 random input data (Fig. 5c). The RMSE with respect to a number of harmonic waves is plotted in Fig. 5b. A remarkable accuracy (RMSE = 0.064) can be achieved by considering five generated harmonics. The achieved RMSE is smaller than the best value reported in prior art³¹.

**Fig. 5: Floquet extreme learning for multivariable regression.**

Parallel image classifications

Another remarkable feature of time-Floquet systems is that since the inputs are modulated at a certain carrier frequency, we can use several frequency bands and multiplex different signals to classify them simultaneously using the same system, and at no additional cost in terms of power consumption. Let us now demonstrate this in a specific complex parallel classification task. We examine the possibility to perform parallel image classification using two wavelength inputs. We use two distinct datasets: the MNIST dataset of handwritten digits and the COVID-19 X-ray images (see Fig. 6a, b). We resize all of the images into 10 × 10 pixels, down-sampling them to decrease the number of input and readout nodes and the total size of our structure. In this task, we use 100 nodes to encode the images with the amplitude of the input waves, and 100 readout nodes. The MNIST data are encoded onto a (randomly selected) frequency range from 4 to 4.125 THz, and the COVID-19 data are encoded between 4.375 and 4.5 THz (see the red and blue frequency bands in Fig. 6c). In the output layer, we use Softmax regression to perform classifications (See Methods).

**Fig. 6: Floquet extreme learning for parallel image classification.**

The training results are shown in Fig. 6d–g. The observed test accuracies were 88.2% for the COVID-19 and 85.3% for the MNIST datasets. These classification accuracies are competitive. For example, they are higher than the ones reported in reference⁵⁰ (parallel image classification). Also, the classification-accuracy results are comparable with other relevant works despite decreasing the pixel sizes of all images⁵³. In addition, this frequency multiplexing technique is the first demonstration of wave-based parallel task processing with extreme deep learning. This enables the use of wide bandwidth as a computational resource, which significantly boosts computation efficiency.

Nonlinear Time-Floquet-based RC system for autonomous forecasting chaotic time-series

To show the high versatility of the proposed nonlinear time-Floquet neuromorphic computing system, we slightly modify it to implement a reservoir computing (RC) scheme. Consider an input vector i(t) that is injected into a high-dimensional dynamical system called the reservoir. The reservoir is described by a vector h(t) and the initial state of the reservoir is defined randomly. Let the W_res matrix define the internal connections of the reservoir nodes and the W_in matrix define the connections between the input and the reservoir nodes. Both matrices are initialized randomly and fixed during the whole RC training process. The state of each reservoir node is a scalar h(t), which evolves according to the following recursive relation:

$$h(t+\tau )=F\left({w}_{{{{{{\mathrm{in}}}}}}}i(t)+{w}_{{{{{{\mathrm{res}}}}}}}h(t)\right)$$

(8)

where τ is the discrete time-step of the input and F is a nonlinear function. From Eq. 8, we see that the reservoir is defined as a dynamical system provided with a unique memory property; namely, each consequent state of the reservoir contains some information about its previous states and about the inputs injected until that time. In the training phase, the input i(t) is fed to the reservoir, and the corresponding reservoir states are recursively calculated. The final step of the information processing is to perform a simple linear regression in order to minimize the RMSE that adjusts the W_out weights. The output can be computed with O(t) = W_outh(t). It should be noted that the output weights are the only parameters that are modified during the training. The input and reservoir weights are fixed throughout the whole computational process, and they are used to randomly project the input into a high-dimensional space, which increases the linear separability of inputs.

In our concrete scheme, we implement this memory using a feedback loop, and use the intensity of harmonic waves as reservoir states. The reservoir computing in our scheme can be described by the following recursive relation:

$${T}_{\phi }^{\prime}(t+\tau )=F\left({w}_{{{{{{\mathrm{in}}}}}}}i(t)+{w}_{{{{{{\mathrm{res}}}}}}}{v}_{h}{T}_{\phi }^{\prime}(t)\right)$$

(9)

where F the nonlinear function describing our system, v_h is a tunable parameter that selects one (or more) harmonics as reservoir states, and ${T}_{\phi }^{\prime}$ is the intensity of transmission harmonic waves. In general, the RC and its different implementations have proven to be very successful for various tasks, such as spoken digits recognition, temporal Exclusive OR task, Mackey-Glass, or Nonlinear Autoregressive Moving Average time-series prediction^68,69.

We use the nonlinear time-Floquet RC for the prediction of chaotic time-series. Forecasting chaotic time-series is an extremely difficult task due to the accumulation of quantitative differences between the ground truth and the predicted value in subsequent predictions, which lead to exponential errors at large times. Indeed, the positive Lyapunov exponent in chaotic systems leads to exponential growth for the separation of close trajectories, so that even small errors in prediction can quickly lead to divergence of the prediction from the ground truth⁴⁹. We test our system using the Mackey-Glass time-series defined by^49,70.

$$\frac{dy}{dt}=\beta \frac{y(t-\tau )}{1+{\left(y(t-\tau )\right)}^{n}}-\gamma y(t)$$

(10)

Unlike deterministic equations, predicting such time-series for specific values of parameters is difficult and thus has been widely used as a benchmark for challenging forecasting tasks. To obtain chaotic dynamics, here, we set the parameters β = 0.2, γ = 0.1, τ = 18, n = 10. During the training phase, as soon as the reservoir states are calculated, a simple linear regression is executed to adjust the W_out weights such that their linear combination with the calculated reservoir states makes the actual output as close as possible to the next time-step of the input. Finally, to automatically predict the future evolution of i(t), we make a feedback loop from the output to the input by replacing the next input i(t + 1) with the one-step prediction W_outo(t), as is done in conventional RC. The ability of the proposed RC system in time-series prediction is tested using a reservoir with 100 input nodes and 50 readout nodes. We consider the middle harmonic as a reservoir state and input, ${\zeta }_{n}^{{{{{{\mathrm{in}}}}}}}$, to feed our RC system for each interaction (Eq. 9). All of the intensity harmonics and reservoir states are then applied to the readout layer (see Methods) to generate the predicted data for the next time-step. Figure 7 shows the results obtained during training from the simulation. Excellent agreement between the target and the predicted value can be obtained, indicating that the trained readout weights can correctly calculate the next time-step signal on the basis of the internal states of the reservoir. Further evidence of successful training can be found by examining the network performance in regression and phase space, as shown in Fig. 7b, c, where an excellent agreement can again be observed.

**Fig. 7: Floquet reservoir computing for forecasting the chaotic Mackey-Glass time-series.**

The network is then used to forecast the time-series autonomously. After training for 400 time-steps, the output from the readout function, that is, the predicted data for the next time-step is then connected to the reservoir as the new input, and the system autonomously produces the forecasted time-series continuously. Figure 8 shows the results for autonomous time-series prediction using the proposed RC system. Afterward, the autonomously generated output (from the 400th time-step onwards) still matches very well the ground truth, showing the ability of the proposed RC system to autonomously forecast the chaotic system. After more than 70 time-steps of autonomous prediction, the predicted signal starts to diverge from the correct value, which is unavoidable due to the chaotic nature of the series. Increasing the size of the reservoir further, by using more nodes and using more previous states may reduce the prediction error so that the length of accurate prediction can be increased. Another solution for long-term forecasting without increasing the dimension of the system is utilizing a periodical update procedure as in ref. ⁴⁹ . In section 3 of the Supplementary Material, we compared the computing performance of the proposed system with prior works for all tasks.

**Fig. 8: Autonomous forecasting of Mackey-Glass time-series.**

In conclusion, we have shown how nonlinear Floquet entanglement can be used to enable wave-based neuromorphic computing, by allowing for strong and tailored nonlinear mapping to a higher-dimensional space without involving any nonlinear material. Our nonlinear time-Floquet learning machine can process information to compute complex tasks that are traditionally only tackled by slower, sophisticated, and digital deep neural networks. In our benchmarks, the proposed computing platform performs as well as its digital counterparts. With better energy efficiency in comparison to the previous proposals and a path to high scalability, our nonlinear time-Floquet system provides a unique solution for supercomputer-level optical computation.

Methods

Numerical simulations

We use a two-dimensional finite-difference time-domain (FDTD) method for all simulations^71,72. Figure 9 shows the rectangular layout of the employed setup. We set the parameters ϵ_s = 3, δ_m = 0.3, ω_m = ∣ω₁ − ω₂∣/2. Furthermore, the propagation space’s height and width, as well as the thickness of the STMS, are set to be 15λ₀, 10λ₀, and λ₀/4, respectively. We use five high index permittivity dielectrics sub-wavelength scatterers, randomly located in the propagating substrate. The time window of simulation and the spatial window (time- and space-discretization factors) is set as a d_t = d_x,y/(2C) and d_x,y = λ₀/30, respectively, (C is speed of light). We use 10,000 time-steps to ensure convergence.

**Fig. 9: A detailed schematic of the proposed wave-based ELM architecture.**

Details of the proposed wave-based ELM architecture

We encode the input information with simple circuit elements, namely variable attenuators and a voltage-controlled phase shifter. These devices are set externally to a certain operating point once the input is selected, and do not change as the neural network processes a given input. Therefore, no dynamic tuning, or conversion between amplitude and phase, is needed between the temporal signal sent by the generator and the modulation signal: just like in standard learning machines, they are just set externally once an input is selected to be processed by the system. More details can be found in Fig. 9. The amplitude-coding scheme that we employ is simple and physically feasible by using variable attenuators (VOA). Since the values of input vector ${\zeta }_{n}^{{{{{{\mathrm{in}}}}}}}$ are normalized between zero and one, the input information can be encoded on the amplitude of the temporal signal generated by a single signal generator (SG), as illustrated in Fig. 9. Variable attenuators provide the input node amplitudes depending on the external voltages applied. In addition, a lower frequency oscillator with a voltage-controlled phase shifter (VCP) is used to drive the modulation of the time-Floquet layer. VCPs are tunable, and the applied voltage is simply calculated by the external user from the input vector. Discussion about realistic physical platforms for realizing the time-Floquet layer is provided in section 4 of the Supplementary Material.

Training of readout

Here, we show how to train the decision layer using the data of temporal signals received at the readout nodes without using extra filtering operations. Consider ${\zeta }_{q}^{{{{{{\mathrm{out}}}}}}}(t)$ as the temporal signal received at the output antenna q, and its discrete Fourier transform of the discretized signal ${\bar{\zeta }}_{q}^{{{{{{\mathrm{out}}}}}}}={\left({\zeta }_{q}^{{{{{{\mathrm{out}}}}}}}(0),{\zeta }_{q}^{{{{{{\mathrm{out}}}}}}}({t}_{0}),...,{\zeta }_{q}^{{{{{{\mathrm{out}}}}}}}((n-1){t}_{0})\right)}^{T}$ defined by ${y}_{{f}_{j}}=\mathop{\sum }\nolimits_{(k = 0)}^{(n-1)}{\zeta }_{q}^{{{{{{\mathrm{out}}}}}}}({t}_{k}){{{{{\mathrm{exp}}}}}}(\frac{-2\pi i}{n}jk)$ where j = 0, . . . , n − 1, (. )^T is the transpose operation, and t₀ is the sampling time. The relation between the Fourier coefficients ${y}_{{f}_{j}}$ and the discretized signal is described by the so-called Vandermonde matrix:

$$\left(\begin{array}{c}{y}_{{f}_{0}}\\ {y}_{{f}_{1}}\\ ...\\ {y}_{{f}_{n-1}}\end{array}\right)=\left(\begin{array}{ccc}1&\cdots &1\\ 1&\cdots &{\kappa }^{n-1}\\ \vdots &\ddots &\vdots \\ 1&\cdots &{\kappa }^{{(n-1)}^{2}}\end{array}\right)\left(\begin{array}{c}{\zeta }_{q}^{{{{{{\mathrm{out}}}}}}}(0)\\ {\zeta }_{q}^{{{{{{\mathrm{out}}}}}}}({t}_{0})\\ ...\\ {\zeta }_{q}^{{{{{{\mathrm{out}}}}}}}((n-1){t}_{0})\end{array}\right)$$

(11)

where $\kappa ={e}^{\frac{-2\pi i}{n}}$. Generally, we are only interested in the middle harmonic whose component q can be calculated by the scalar product ${y}_{{f}_{j}}^{q}={F}_{j}{\bar{\zeta }}_{q}^{{{{{{\mathrm{out}}}}}}}$, where ${F}_{j}=\left(1,{\kappa }^{j},{\kappa }^{2j},...,{\kappa }^{(n-1)j}\right)$ and j is correspond to desired harmonics. In order to train the readout function by linear regression (which is commonly defined by a linear matrix operation of the form Y = WX^T, where W is the weights matrix), we must compose both operations, multiplying the F_j and weight matrices:

$$Y=W{({F}_{j}X)}^{T}$$

(12)

where $X=\left({\bar{\zeta }}_{1}^{{{{{{\mathrm{out}}}}}}},{\bar{\zeta }}_{2}^{{{{{{\mathrm{out}}}}}}},...,{\bar{\zeta }}_{q}^{{{{{{\mathrm{out}}}}}}}\right)$. Clearly, just like a regular extreme learning machine (Y = WX^T), the output of the Floquet extreme learning scheme involves a simple multiplication of matrices without sensitive or complex filters. This can be also viewed as a mere rescaling of the weight matrix of the digital layer.

The relationship between ϕ and ζⁱⁿ is set to be ($\phi =\gamma {\bar{\zeta }}^{{{{{{\mathrm{in}}}}}}}$), where ${\bar{\zeta }}^{{{{{{\mathrm{in}}}}}}}$ and γ are the mean of input vector and a scaling factor, respectively. The value of γ for learning nonlinear functions is 1 and for other tasks is equal to 2π.

For learning nonlinear functions, Abolone dataset, and forecasting chaotic time-series, we used a supervised learning algorithm, linear regression, to train the readout function. The predicted output is compared with the ground truth, and the error is calculated and used to update the weights in the readout network following the linear regression learning rule.

To train the readout network, for classification task-parallel image processing, we used the Python toolkit Keras, which provides a high-level application programming interface to access TensorFlow. A supervised learning algorithm, softmax regression, was used to train the readout network. A softmax function is used as the activation function of the readout network to calculate the probability corresponding to the different possible outputs. The cost is calculated following a categorical crossentropy. A standard gradient-based optimization method is used to minimize the cost function and train the output network. There are several ways of converting images into one-dimensional representations. For simplicity, we used a flattened version of downsampled images as an output vector.

Data availability

The datasets containing the raw information for abalone dataset are from (https://archive.ics.uci.edu/ml/datasets/Abalone), Mnist dataset is from (https://www.tensorflow.org/datasets/catalog/mnist), and COVID-19 dataset is from (https://www.kaggle.com/tawsifurrahman/covid19-radiography-database).

Code availability

The code used for simulation is a standard finite-difference time-domain (FDTD), and all parameters required are presented in Methods. The codes used for ELM and reservoir computing are standard linear and softmax regressions, which can be found at https://scikit-learn.org/stable/modules/linear_model.html and https://scikit-learn.org/stable/modules/generated/sklearn.linear_model.LogisticRegression.html.

References

LeCun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature 521, 436–444 (2015).
Article ADS CAS PubMed Google Scholar
Shen, Y. et al. Deep learning with coherent nanophotonic circuits. Nat. Photonics 11, 441 (2017).
Article ADS CAS Google Scholar
Engheta, N. & Ziolkowski, R. W. Metamaterials: Physics and Engineering Explorations (John Wiley & Sons, 2006).
Achouri, K. & Caloz, C. Electromagnetic Metasurfaces: Theory and Applications (John Wiley & Sons, 2021).
Li, L. et al. Machine-learning reprogrammable metasurface imager. Nat. Commun. 10, 1–8 (2019).
ADS CAS Google Scholar
Lin, D., Fan, P., Hasman, E. & Brongersma, M. L. Dielectric gradient metasurface optical elements. Science 345, 298–302 (2014).
Article ADS CAS PubMed Google Scholar
Silva, A. et al. Performing mathematical operations with metamaterials. Science 343, 160–163 (2014).
Article ADS MathSciNet CAS PubMed MATH Google Scholar
Camacho, M., Edwards, B. & Engheta, N. A single inverse-designed photonic structure that performs parallel computing. Nat. Commun. 12, 1–7 (2021).
Article CAS Google Scholar
Estakhri, N. M., Edwards, B. & Engheta, N. Inverse-designed metastructures that solve equations. Science 363, 1333–1338 (2019).
Article ADS MathSciNet MATH CAS Google Scholar
Matthès, M. W., del Hougne, P., de Rosny, J., Lerosey, G. & Popoff, S. M. Optical complex media as universal reconfigurable linear operators. Optica 6, 465–472 (2019).
Article ADS Google Scholar
Zangeneh-Nejad, F., Sounas, D. L., Alù, A. & Fleury, R. Analogue computing with metamaterials. Nat. Rev. Mater. 6, 207–225 (2021).
Zangeneh-Nejad, F. & Fleury, R. Topological analog signal processing. Nat. Commun. 10, 1–10 (2019).
Article CAS Google Scholar
Babaee, A., Momeni, A., Abdolali, A. & Fleury, R. Parallel analog computing based on a 2 × 2 multiple-input multiple-output metasurface processor with asymmetric response. Phys. Rev. Appl. 15, 044015 (2021).
Article ADS CAS Google Scholar
Momeni, A., Rouhi, K. & Fleury, R. Switchable and simultaneous spatiotemporal analog computing with computational graphene-based multilayers. Carbon 186, 599–611 (2022).
Article CAS Google Scholar
Momeni, A. et al. Reciprocal metasurfaces for on-axis reflective optical computing. IEEE Trans. Antennas Propag. 69, 7709–7719 (2021).
Article ADS Google Scholar
Momeni, A., Safari, M., Abdolali, A., Kherani, N. P. & Fleury, R. Asymmetric metal-dielectric metacylinders and their potential applications from engineering scattering patterns to spatial optical signal processing. Phys. Rev. Appl. 15, 034010 (2021).
Article ADS CAS Google Scholar
del Hougne, P. & Lerosey, G. Leveraging chaos for wave-based analog computation: demonstration with indoor wireless communication signals. Phys. Rev. X 8, 041037 (2018).
Google Scholar
Wetzstein, G. et al. Inference in artificial intelligence with deep optics and photonics. Nature 588, 39–47 (2020).
Article ADS CAS PubMed Google Scholar
Lin, X. et al. All-optical machine learning using diffractive deep neural networks. Science 361, 1004–1008 (2018).
Article ADS MathSciNet CAS PubMed MATH Google Scholar
Zhang, H. et al. An optical neural chip for implementing complex-valued neural network. Nat. Commun. 12, 1–11 (2021).
ADS CAS Google Scholar
Zuo, Y. et al. All-optical neural network with nonlinear activation functions. Optica 6, 1132–1137 (2019).
Article ADS CAS Google Scholar
Qian, C. et al. Performing optical logic operations by a diffractive neural network. Light Sci. Appl. 9, 1–7 (2020).
Article CAS Google Scholar
Xu, X. et al. Photonic perceptron based on a kerr microcomb for high-speed, scalable, optical neural networks. Laser Photonics Rev. 14, 2000070 (2020).
Article ADS Google Scholar
Feldmann, J., Youngblood, N., Wright, C. D., Bhaskaran, H. & Pernice, W. H. All-optical spiking neurosynaptic networks with self-learning capabilities. Nature 569, 208–214 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Hughes, T. W., Williamson, I. A., Minkov, M. & Fan, S. Wave physics as an analog recurrent neural network. Sci. Adv. 5, eaay6946 (2019).
Article ADS PubMed PubMed Central Google Scholar
Hamerly, R., Bernstein, L., Sludds, A., Soljačić, M. & Englund, D. Large-scale optical neural networks based on photoelectric multiplication. Phys. Rev. X 9, 021032 (2019).
CAS Google Scholar
Bueno, J. et al. Reinforcement learning in a large-scale photonic recurrent neural network. Optica 5, 756–760 (2018).
Article ADS Google Scholar
Torrejon, J. et al. Neuromorphic computing with nanoscale spintronic oscillators. Nature 547, 428–431 (2017).
Article CAS PubMed PubMed Central Google Scholar
Vodenicarevic, D., Locatelli, N., Abreu Araujo, F., Grollier, J. & Querlioz, D. A nanotechnology-ready computing scheme based on a weakly coupled oscillator network. Sci. Rep. 7, 1–13 (2017).
Article CAS Google Scholar
Sui, X., Wu, Q., Liu, J., Chen, Q. & Gu, G. A review of optical neural networks. IEEE Access 8, 70773–70783 (2020).
Article Google Scholar
Teğin, U., Yíldírím, M., Oğuz, İ., Moser, C. & Psaltis, D. Scalable optical learning operator. Nat. Comput. Sci. 1, 542–549 (2021).
Article Google Scholar
Skinner, S. R., Steck, J. E. & Behrman, E. C. Optical neural network using kerr-type nonlinear materials. in Proc. fourth international conference on microelectronics for neural networks and fuzzy systems, 12–15 (IEEE, 1994).
Fleury, R., Khanikaev, A. B. & Alu, A. Floquet topological insulators for sound. Nat. Commun. 7, 1–11 (2016).
Article CAS Google Scholar
Wang, X. et al. Nonreciprocity in bianisotropic systems with uniform time modulation. Phys. Rev. Lett. 125, 266102 (2020).
Article ADS CAS PubMed Google Scholar
Koutserimpas, T. T. & Fleury, R. Nonreciprocal gain in non-hermitian time-floquet systems. Phys. Rev. Lett. 120, 087401 (2018).
Article ADS CAS PubMed Google Scholar
Estep, N. A., Sounas, D. L., Soric, J. & Alù, A. Magnetic-free non-reciprocity and isolation based on parametrically modulated coupled-resonator loops. Nat. Phys. 10, 923–927 (2014).
Article CAS Google Scholar
Hadad, Y., Sounas, D. L. & Alu, A. Space-time gradient metasurfaces. Phys. Rev. B 92, 100304 (2015).
Article ADS CAS Google Scholar
cTaravati, S. & Eleftheriades, G. V. Full-duplex nonreciprocal beam steering by time-modulated phase-gradient metasurfaces. Phys. Rev. Appl. 14, 014027 (2020).
Article ADS CAS Google Scholar
Taravati, S. & Eleftheriades, G. V. Full-duplex reflective beamsteering metasurface featuring magnetless nonreciprocal amplification. Nat. Commun. 12, 1–11 (2021).
Article CAS Google Scholar
Shi, Y. & Fan, S. Dynamic non-reciprocal meta-surfaces with arbitrary phase reconfigurability based on photonic transition in meta-atoms. Appl. Phys. Lett. 108, 021110 (2016).
Article ADS CAS Google Scholar
Taravati, S. & Eleftheriades, G. V. Microwave space-time-modulated metasurfaces. ACS Photonics 9 305–318 (2022).
Liu, M., Powell, D. A., Zarate, Y. & Shadrivov, I. V. Huygens’ metadevices for parametric waves. Phys. Rev. X 8, 031077 (2018).
Google Scholar
Salary, M. M., Farazi, S. & Mosallaei, H. A dynamically modulated all-dielectric metasurface doublet for directional harmonic generation and manipulation in transmission. Adv. Optical Mater. 7, 1900843 (2019).
Article CAS Google Scholar
Correas-Serrano, D. et al. Nonreciprocal graphene devices and antennas based on spatiotemporal modulation. IEEE Antennas Wireless Propag. Lett. 15, 1529–1532 (2015).
Article ADS Google Scholar
Zang, J., Alvarez-Melcon, A. & Gomez-Diaz, J. Nonreciprocal phased-array antennas. Phys. Rev. Appl. 12, 054008 (2019).
Article ADS CAS Google Scholar
Taravati, S. & Caloz, C. Mixer-duplexer-antenna leaky-wave system based on periodic space-time modulation. IEEE Trans. Antennas Propag. 65, 442–452 (2016).
Article ADS Google Scholar
Wu, Z. & Grbic, A. Serrodyne frequency translation using time-modulated metasurfaces. IEEE Trans. Antennas Propag. 68, 1599–1606 (2019).
Article ADS Google Scholar
Liu, Z., Li, Z. & Aydin, K. Time-varying metasurfaces based on graphene microribbon arrays. ACS Photonics 3, 2035–2039 (2016).
Article CAS Google Scholar
Moon, J. et al. Temporal data classification and forecasting using a memristor-based reservoir computing system. Nat. Electronics 2, 480–487 (2019).
Article Google Scholar
Nakajima, M., Tanaka, K. & Hashimoto, T. Scalable reservoir computing on coherent linear photonic processor. Commun. Phys. 4, 1–12 (2021).
Article Google Scholar
Du, C. et al. Reservoir computing using dynamic memristors for temporal information processing. Nat. Commun. 8, 1–10 (2017).
Article ADS CAS Google Scholar
Zhong, Y. et al. Dynamic memristor-based reservoir computing for high-efficiency temporal signal processing. Nat. Commun. 12, 1–9 (2021).
Article ADS CAS Google Scholar
Midya, R. et al. Reservoir computing using diffusive memristors. Adv. Intell. Syst. 1, 1900084 (2019).
Article Google Scholar
Hochreiter, S. & Schmidhuber, J. Long short-term memory. Neural Comput. 9, 1735–1780 (1997).
Article CAS PubMed Google Scholar
Dong, J., Gigan, S., Krzakala, F. & Wainrib, G. Scaling up echo-state networks with multiple light scattering. in 2018 IEEE Statistical Signal Processing Workshop (SSP), 448–452 (IEEE, 2018).
Huang, G.-B., Zhu, Q.-Y. & Siew, C.-K. Extreme learning machine: theory and applications. Neurocomputing 70, 489–501 (2006).
Article Google Scholar
Marcucci, G., Pierangeli, D. & Conti, C. Theory of neuromorphic computing by waves: machine learning by rogue waves, dispersive shocks, and solitons. Phys. Rev. Lett. 125, 093901 (2020).
Article ADS CAS PubMed Google Scholar
Pierangeli, D., Marcucci, G. & Conti, C. Photonic extreme learning machine by free-space optical propagation. Photonics Res. 9, 1446–1454 (2021).
Article Google Scholar
Vandoorne, K. et al. Experimental demonstration of reservoir computing on a silicon photonics chip. Nat. Commun. 5, 1–6 (2014).
Article CAS Google Scholar
Pao, Y.-H., Park, G.-H. & Sobajic, D. J. Learning and generalization characteristics of the random vector functional-link net. Neurocomputing 6, 163–180 (1994).
Article Google Scholar
Suykens, J. A. & Vandewalle, J. Least squares support vector machine classifiers. Neural Process. Lett. 9, 293–300 (1999).
Article Google Scholar
Mousavi, S. H., Rakich, P. T. & Wang, Z. Strong thz and infrared optical forces on a suspended single-layer graphene sheet. ACS Photonics 1, 1107–1115 (2014).
Article CAS Google Scholar
Salary, M. M., Jafar-Zanjani, S. & Mosallaei, H. Electrically tunable harmonics in time-modulated metasurfaces for wavefront engineering. N. J. Phys. 20, 123023 (2018).
Article CAS Google Scholar
Salary, M. M., Jafar-Zanjani, S. & Mosallaei, H. Time-varying metamaterials based on graphene-wrapped microwires: Modeling and potential applications. Phys. Rev. B 97, 115421 (2018).
Article ADS CAS Google Scholar
Shawe-Taylor, J. et al. Kernel Methods for Pattern Analysis (Cambridge university press, 2004).
Saade, A. et al. Random projections through multiple optical scattering: Approximating kernels at the speed of light. in 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 6215–6219 (IEEE, 2016).
Bishop, C. M. & Nasrabadi, N. M. In Pattern Recognition and Machine Learning, vol. 4 (Springer, 2006).
Brunner, D. et al. Tutorial: Photonic neural networks in delay systems. J. Appl. Phys. 124, 152004 (2018).
Article ADS CAS Google Scholar
Bertschinger, N. & Natschläger, T. Real-time computation at the edge of chaos in recurrent neural networks. Neural Comput. 16, 1413–1436 (2004).
Article PubMed MATH Google Scholar
Mackey, M. C. & Glass, L. Oscillation and chaos in physiological control systems. Science 197, 287–289 (1977).
Article ADS CAS PubMed MATH Google Scholar
Elsherbeni, A. Z. & Demir, V. The finite-difference time-domain method for electromagnetics with MATLAB simulations (The Institution of Engineering and Technology, 2016).
Kunz, K. S. & Luebbers, R. J. The Finite Difference Time Domain Method for Electromagnetics (CRC press, 1993).

Download references

Acknowledgements

A.M. and R.F. acknowledge funding from the Swiss National Science Foundation under the Eccellenza grant number 181232.

Author information

Authors and Affiliations

Laboratory of Wave Engineering, School of Electrical Engineering, Swiss Federal Institute of Technology in Lausanne (EPFL), Lausanne, Switzerland
Ali Momeni & Romain Fleury

Authors

Ali Momeni
View author publications
You can also search for this author in PubMed Google Scholar
Romain Fleury
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.M. performed the theoretical and numerical simulations, under the supervision of R.F. All authors participated in writing and revising the manuscript.

Corresponding author

Correspondence to Romain Fleury.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Gyorgy Csaba and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Peer Review File

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Momeni, A., Fleury, R. Electromagnetic wave-based extreme deep learning with nonlinear time-Floquet entanglement. Nat Commun 13, 2651 (2022). https://doi.org/10.1038/s41467-022-30297-5

Download citation

Received: 25 August 2021
Accepted: 22 April 2022
Published: 12 May 2022
DOI: https://doi.org/10.1038/s41467-022-30297-5

This article is cited by

Physical reservoir computing with emerging electronics
- Xiangpeng Liang
- Jianshi Tang
- Huaqiang Wu
Nature Electronics (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Universal scaling between wave speed and size enables nanoscale high-performance reservoir computing based on propagating spin-waves

Photonic reservoir computing based on nonlinear wave dynamics at microscale

An optoelectronic synapse based on α-In2Se3 with controllable temporal dynamics for multimode and multiscale reservoir computing

Introduction

Results

Learning highly nonlinear functions

Abalone dataset

Parallel image classifications

Nonlinear Time-Floquet-based RC system for autonomous forecasting chaotic time-series

Methods

Numerical simulations

Details of the proposed wave-based ELM architecture

Training of readout

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Supplementary information

Peer Review File

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Physical reservoir computing with emerging electronics

Comments

Search

Quick links