Hardware implementation of Bayesian network building blocks with stochastic spintronic devices

Debashis, Punyashloka; Ostwal, Vaibhav; Faria, Rafatul; Datta, Supriyo; Appenzeller, Joerg; Chen, Zhihong

doi:10.1038/s41598-020-72842-6

Download PDF

Article
Open access
Published: 29 September 2020

Hardware implementation of Bayesian network building blocks with stochastic spintronic devices

Punyashloka Debashis^1,2,3^na1,
Vaibhav Ostwal^1,2^na1,
Rafatul Faria^1,3,
Supriyo Datta¹,
Joerg Appenzeller^1,2 &
…
Zhihong Chen^1,2

Scientific Reports volume 10, Article number: 16002 (2020) Cite this article

4292 Accesses
21 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Bayesian networks are powerful statistical models to understand causal relationships in real-world probabilistic problems such as diagnosis, forecasting, computer vision, etc. For systems that involve complex causal dependencies among many variables, the complexity of the associated Bayesian networks become computationally intractable. As a result, direct hardware implementation of these networks is one promising approach to reducing power consumption and execution time. However, the few hardware implementations of Bayesian networks presented in literature rely on deterministic CMOS devices that are not efficient in representing the stochastic variables in a Bayesian network that encode the probability of occurrence of the associated event. This work presents an experimental demonstration of a Bayesian network building block implemented with inherently stochastic spintronic devices based on the natural physics of nanomagnets. These devices are based on nanomagnets with perpendicular magnetic anisotropy, initialized to their hard axes by the spin orbit torque from a heavy metal under-layer utilizing the giant spin Hall effect, enabling stochastic behavior. We construct an electrically interconnected network of two stochastic devices and manipulate the correlations between their states by changing connection weights and biases. By mapping given conditional probability tables to the circuit hardware, we demonstrate that any two node Bayesian networks can be implemented by our stochastic network. We then present the stochastic simulation of an example case of a four node Bayesian network using our proposed device, with parameters taken from the experiment. We view this work as a first step towards the large scale hardware implementation of Bayesian networks.

Hardware implementation of Bayesian network based on two-dimensional memtransistors

Article Open access 23 September 2022

CMOS plus stochastic nanomagnets enabling heterogeneous computers for probabilistic inference and learning

Article Open access 27 March 2024

Electrically programmable probabilistic bit anti-correlator on a nanomagnetic platform

Article Open access 23 July 2020

Introduction

There has been increasing demands to realize specialized hardware for unconventional computing tasks where software based approaches running on general purpose CPU cannot efficiently execute the basic steps of the associated stochastic algorithms, such as generating random numbers. For example, a single random number generator based on CMOS linear feedback shift register requires around 1000 transistors¹. This work focusses on a compact hardware that can generate random numbers with controllable probability by using its intrinsic physics and can serve as efficient building block for Bayesian networks.

Bayesian networks (BNs) are directed graphical models that are used to represent the causal dependencies among stochastic variables². In a BN, each node represents a stochastic variable, whose probability of occurrence is determined by the states of its parent nodes. The dependence between a set of such nodes is given by a conditional probability table (CPT). BNs are traditionally implemented in software aiming at applications in areas such as forecasting, diagnosis, and computer vision³. However, as the complexity of the BNs grows, i.e., as the number of parent nodes affecting the probability of a particular child node becomes large, both the assessment of that child node probability, and the inference about the possible cause becomes impractical⁴. Specifically, as the network size grows, the number of terms in the calculation of the joint probability using probability chain rule increases rapidly⁴.

Direct representation of Bayesian networks in hardware has been proposed as an alternative way to perform the two above mentioned tasks, i.e., probability assessment and inference. In this case, each “node” in a Bayesian network is represented by a stochastic device, having a distinct probability of being in one of two possible states. This probability is controlled by the input it receives dependent on the states of its parent nodes, through the weights of the connections between them. The CPT is encoded in the weights of these connections. By representing a BN with a hardware network of this kind, the required probability of a particular event is readily obtained by sampling the output of the corresponding stochastic device. Moreover, inference about the possible cause of a particular event can be evaluated by observing the joint distribution of the two stochastic devices corresponding to the “event” node and the particular “cause” node of interest.

Several hardware implementations of BNs have been proposed based on CMOS hardware. For example, Zermani et al.⁵ demonstrated FPGA based BN implementation utilizing suitable architectural design and memory allocation schemes. Cai et al.⁶ demonstrated another FPGA based architectural design along with a suitable pseudo random number generator. Manisinghka et al.⁷ implemented a BN in digital circuits using a novel abstraction. Chakrapani et al.⁸ and Weijia et al.⁹ proposed a probabilistic CMOS hardware for BN implementation, however there has not been an experimental demonstration in literature to our knowledge. Nevertheless, there is an interest in a compact implementation of the stochastic nodes of a BN and their conditional relations.

In this work, we present an experimental demonstration of a spintronics based compact hardware implementation of BNs. The stochastic elements are implemented naturally by a compact device consisting of a perpendicular nanomagnet. The CPTs are translated directly to the connection weights, implemented by resistive connections between such devices.

Unstable nanomagnet based spintronic devices have recently attracted much research interest for probabilistic spin logic (PSL)^{10,11,12,13,14,15,16,17,18,19,20,21,22} and are given the name “p-bit”, which is the short form of “probabilistic bit”. It has been proposed that inherently unstable nanomagnet can be a natural implementation of the stochastic variable in a BN^{10, 19, 20, 23}. We first present a p-bit implementation using a stochastic spintronic device that has isolated input and output to allow for interconnection in circuitry. The output of such a device is a tunable random number, whose mean is controlled by an electrical input. Then, we build an electrically connected network of two such devices and study the correlation of their outputs for different connections and biases. We show that any CPT can be implemented by changing the connections and biases of this circuit, thus representing a hardware BN building block. Finally, using parameters taken from the experiment, we perform a stochastic Landau Lifshitz Gilbert (sLLG) simulation of a four node BN and compare the results of the forecast with those expected from calculating joint probability distributions.

Experimental results and analysis

Hard axis initialized PMA magnet as p-bit

In our experiment, the stochastic device is based on a hard axis initialized magnet with perpendicular magnetic anisotropy (PMA), whose output probability is controlled by the magnetic field produced by a charge current passing through an isolated metal ring^{15, 16, 18}. The top left of Fig. 1a shows the schematic of our device. It consists of a nanomagnet island with perpendicular magnetic anisotropy (PMA) shown in orange, on top of a heavy metal (Ta) Hall bar, shown in blue. It is well understood that the magnetization of a PMA magnet can be deterministically switched by the Spin Obit Torque (SOT) of a heavy metal under-layer in the presence of a symmetry breaking in-plane magnetic field^{24, 25}. However, when the spin current density is large enough, and when this field is absent, the magnetization gets pinned in the direction of the spin polarization, i.e. the magnets hard axis. Once the spin current is removed, ambient thermal noise makes the magnetization relax to either “up” or “down” with equal probability due to the symmetric energy landscape for these two states^{15, 16, 26} as depicted by the cartoon in the top right of Fig. 1a. The magnetization state is read out by the anomalous Hall effect (AHE), where the transverse V_OUT is + ve for a magnetization in the “up” direction and −ve for “down”. The probability of relaxing back to the “up” or “down” direction can be controlled by applying a small out-of-plane magnetic field that lifts the degeneracy of the energy landscape. A positive field in the z-direction lowers the energy of the “up” state and raises that of the “down” state, thus making the “up” state more favorable. A negative z-directed field does the exact opposite. This is depicted in the energy landscape diagrams shown in the bottom panel of Fig. 1a. This z-directed field is provided by a ring-shaped electrode called the “Oersted ring” henceforth, shown in yellow in the device schematic. A current “$I_{IN}$” passing through the Oersted ring of radius “$r$” produces a magnetic field given by $B = \mu_{0} I_{IN} /2r$.

Figure 1b shows the sLLG simulation of such a device. The top panels show the magnetization dynamics during the pulsing of the device. The current pulse through the GSHE layer is shown in black color in both the panels. The z-component of magnetization ($m_{Z}$) is shown in blue and red. It can be seen that $m_{Z}$ goes to zero while the current pulse is ON. After the pulse is removed, $m_{Z}$ relaxes to − 1 in the first case and it relaxes to + 1 in the second, nominally identical case, highlighting the stochastic nature of the process. The time scale of this relaxation is governed by the material parameters of the nanomagnet such as M_S, H_K and damping. The bottom panel of Fig. 1b shows the average of the magnetization (after the dynamics have settled) in the z-direction (perpendicular easy axis) as a function of the input current, resembling a sigmoidal activation function.

For experimental implementation, starting with a stack of Ta(5 nm)/CoFeB(1 nm)/MgO(2 nm)/Ta(1 nm) thin film, a Hall bar device with a PMA magnetic island located at the center is fabricated by means of successive e-beam lithography and Ar ion milling steps. To generate the out-of-plane field for tunability, the “Oersted ring” is fabricated on top and electrically isolated from the Hall bar by a dielectric layer. A false colored SEM image of the fabricated device is shown in the inset of Fig. 1c.

For the operation of the device, a Keithley 6221 current source is used to provide a current pulse of duration 100 μs through the Ta Hall bar. This current pulse experimentally implements the required hard axis biasing scheme as shown in the sLLG simulation of Fig. 1b. Although the magnet can respond to much faster pulses, as shown in Fig. 1b, we chose to use 100 μs to be safely within the delay times of the measurement circuit. After the pulsing event, the state of the magnetization is read by a lock-in scheme, with a sinusoidal current provided by the same Keithley current source and an SRS830 lock-in amplifier. The device is pulsed repeatedly, and the state of the magnetization is read after each individual pulse. Figure 1c shows the average magnetization as a function of the input current “$I_{IN}$”. Each data point is obtained by averaging 25 pulsing events, as shown for three representative cases in the bottom panels. These measurements clearly demonstrate the successful implementation of a device with an electrical input and output, which behaves stochastically for individual events, but produces a sigmoidal curve for the average output. This is the desired characteristic for many probabilistic spin logic applications including hardware BNs.

Implementing a two node Bayesian network in hardware

Next, we show how the stochastic devices described in the previous section can be used to implement a two node Bayesian network in hardware. The essential characteristic of a BN is captured in the CPT. Figure 2a shows the example of a two-node network, with the first or the parent node ($m_{1}$) representing the packaging material for blocks of cheese in a dairy farm, and the second node ($m_{2}$) representing the probability of finding a stale cheese block. The values “$a$” and “$b$” in the CPT represent the probability of a cheese block being stale if the packaging material is of low quality ($m_{1}$ = 0) versus high quality ($m_{1}$ = 1). Since the packaging material positively affects the shelf life, in this case, $a$ > $b$. If instead of packaging material, $m_{1}$ represents the print design on the package, then the shelf life is not affected by it, and hence, $a$ = $b$ in this case. Similarly, if some other variable, that negatively affects the shelf life is represented by $m_{1}$, then the CPT would have $a$ < $b$. Now, for the first case, if the cheese was stored in a cold and dry storage, then the shelf life is increased, irrespective of the packaging material quality. This corresponds to adding a positive value to both “$a$” and “$b$” in the CPT. Hence, the variables in the CPT can span the entire space between 0 and 1 independently, depending on the problem being modeled.

We first demonstrate that the CPT between the two probabilistic random variables in our example can be implemented by design of proper electrical connections between two of our stochastic devices (of the type shown in Fig. 1). Then, by testing the circuit with designed parameters, we show that the probability of the output device ($m_{2}$) follows the probability of finding a stale cheese block, obtained from calculating the joint probability distribution. We also show that the inference about the potential cause of stale cheese that is evaluated by Bayes theorem is well matched to the directly observed values from the joint distribution of the device outputs. The results are also verified by stochastic LLG simulations with magnet parameters (M_S, H_K and volume) taken to match the sigmoidal activation function obtained from the experiment.

Figure 2a shows the given CPT that represents the relation between the stochastic variables $m_{1}$ and $m_{2}$. This CPT is translated into the parameters $J_{21}$ and $h_{2}$ of the PSL model as shown in Fig. 2b. This translation can be obtained from the analysis below:

The total input, $I_{2}$ received by the second device is given by:

$$I_{2} = J_{21} m_{1} + h_{2}$$

(1)

where $J_{21}$ corresponds to the connection from the first to the second device, $m_{1}$ corresponds to the state of the first device and $h_{2}$ corresponds to the constant bias given to the second device. As Eq. (1) represents the physical input to node 2 (which is current in our hardware design), $m_{1}$ has to enter as a bipolar value (+ 1 for ‘UP’ state and − 1 for ‘DN’ state).

The average state of the second device is given by:

$$m_{2} = \sigma \left( {I_{2} } \right) = \sigma \left( {J_{21} m_{1} + h_{2} } \right)$$

(2)

where σ represents the sigmoidal activation function for device 2. The conditional dependencies can be directly seen from this expression. The probability of $m_{2}$ being high given $m_{1}$ high is obtained by evaluating $m_{2}$ from Eq. (2) by setting $m_{1}$ = 1. Since this probability should match the value specified in the given CPT, we obtain:

$$b = \sigma \left( {J_{21} + h_{2} } \right)$$

(3)

Similarly, ‘$a$’ can be obtained by setting $m_{1}$ = − 1 (as bipolar entry corresponding to $m_{1}$ being ‘DN’ is − 1 instead of 0) in Eq. (2).

$$a = \sigma \left( { - J_{21} + h_{2} } \right)$$

(4)

From Eqs. (3) and (4), we obtain the values of the PSL parameters $J_{21}$ and $h_{2}$, from the given CPT table as follows:

$$J_{21} = 0.5 \times \left[ {\sigma^{ - 1} \left( b \right) - \sigma^{ - 1} \left( a \right)} \right]$$

(5)

$$h_{2} = 0.5 \times \left[ {\sigma^{ - 1} \left( b \right) + \sigma^{ - 1} \left( a \right)} \right]$$

(6)

The parameters $J_{21}$ and $h_{2}$ are then used to design the hardware connection strengths and biases to two stochastic devices, as will be discussed in the following paragraphs.

Figure 2c shows the schematic of our circuit. The output voltage from the first device is amplified by a LT1677 low noise, rail-to-rail precision Op Amp operating in an open loop configuration. The output level of the Op Amp is determined by its +/−$V_{DD}$ supply voltages. This output is then connected to the Oersted ring of the second device through a weight resistor “$R_{weight}$” that determines how much current passes through it, and hence controls the output probability of the second device, corresponding to the $J_{21}$ term in a BN. Additionally, a voltage source “$V_{bias}$” is connected to the input of the second device through a resistor “$R_{bias}$” to mimic the fixed bias ($h_{2}$) in a BN. The values of the circuit parameters $V_{DD}$, $V_{bias}$, $R_{weight}$ and $R_{bias}$ are obtained from the required $J_{21}$ and $h_{2}$ by the following design analysis:

In our circuit as shown in Fig. 2c, $J_{21}$ is the magnetic field produced by the Oersted ring of device 2, normalized with the field required to saturate its magnetization in the “up” or “down” state, denoted by $B_{0}$. This is given by:

$$J_{21} = \pm \mu_{0} V_{DD} /2rB_{0} R_{weight}$$

(7)

where r is the radius of the Oersted ring, $\mu_{0}$ is the permeability of vacuum and the ± sign depends on the connection polarity. Similarly, $h_{2}$ is the additional magnetic field produced by the constant bias $V_{bias}$, normalized to $B_{0}$.

$$h_{2} = \mu_{0} V_{bias} /2rB_{0} R_{bias}$$

(8)

Note that $h_{2}$ contributions due to the remnant magnetic field in the measurement setup have been subtracted out in this analysis for brevity. This additional $h_{2}$ contribution is just added to the calculated $h_{2}$ in Eq. (8).

Next, we show that the same circuit can capture any given CPT, by changing the $R_{weight}$ and $R_{bias}$. In the circuit shown in Fig. 2c, the total input received by device 2 is given by:

$$I_{2} = \pm \left( {\mu_{0} V_{DD} /2rB_{0} R_{weight} } \right)m_{1} + \left( {\mu_{0} V_{bias} /2rB_{0} R_{bias} } \right)$$

(9)

For $R_{weight}$ = ∞, which means $J_{21}$ = 0, the coefficient of m₁ in Eq. (9) vanishes, and so does the correlation between the two devices. For a finite $R_{weight}$, the connection polarity dictates the sign of the correlation between the two devices, with a strength inversely proportional to $R_{weight}$. $V_{bias}$ makes the correlation asymmetric as its corresponding term in Eq. (9) does not change sign with the state of m₁. Therefore, we can span all possible conditional probabilities between two nodes of a BN (given by “$a$” and “$b$” in the CPT) by changing the circuit parameters $R_{weight}$, polarity and $R_{bias}$.

Experimental testing of the hardware Bayesian network

We take five different CPTs with “$a$” and “$b$” spanning the range between 0 and 1, shown in Fig. 3a. We then calculate J₂₁ and h₂ for these five cases and design our circuit according to Eqs. (7) and (8). The designed circuits are then tested by repeating a sequential pulsing scheme. The inset of Fig. 2c shows the timing diagram of the measurement procedure. The two devices are pulsed sequentially by a Keithley 6221 current source that provides the clocking scheme for our devices. During the pulsing of the second device, a constant DC read current is passed through the first device in order to generate the input voltage to the second device. Then, this sequential pulsing is repeated to generate the required statistics. The two devices produce random outputs, but with correlated statistics, as is required by the CPT between the two random variables. The output after each pulse is measured by a lock-in amplifier and then digitized. Representative sections of the device outputs are shown in Fig. 3b for three different connection configuration. It is worth noting that the pulsing method being followed in the presented experiments (shown in the inset of Fig. 2c) is analogous to Gibbs sampling^{27, 28}, which is widely used is used for statistical inference^{29, 30}. Here, each node of the network is pulsed (sampled) sequentially under the influence of all the other nodes, which are fixed to their current values.

The probability of finding a stale cheese block can be found from the joint probability distribution by using the probability chain rule:

$$\begin{aligned} P\left( {m_{2} = 1} \right) & = {\Sigma }_{{m_{1} }} P\left( {m_{1} ,m_{2} = 1} \right) \\ & = {\Sigma }_{{m_{1} }} P\left( {m_{2} = 1|m_{1} } \right)*P\left( {m_{1} } \right) \\ & = P\left( {m_{2} = 1|m_{1} = 0} \right)*P\left( {m_{1} = 0} \right) \\ & \quad + \,P\left( {m_{2} = 1|m_{1} = 1} \right)*P\left( {m_{1} = 1} \right) \\ & = a*P\left( {m_{1} = 0} \right) + b*P\left( {m_{1} = 1} \right) \\ \end{aligned}$$

(10)

where $P\left( {m_{1} = 0 or 1} \right)$ is an input parameter. The number of terms in the above expression grows as 2^N where N is the number of parent nodes for the particular child node of interest⁴. Instead of performing this algebra, the required probability can be obtained from the circuit by directly observing the stochastic output of device 2 and obtaining its mean value over several pulsing cycles. This luxury of having to observe only the nodes of interest while disregarding all the other nodes is an advantage of using a probabilistic algorithm, versus calculating the probabilities using deterministic rules as discussed by Feynman³¹ and utilized in many sampling schemes²⁷.

Similarly, given that a randomly drawn cheese block from a large lot is stale, the probability that it was caused by a low quality packaging material can be found by using Bayes theorem:

$$\begin{aligned} P\left( {m_{1} = 0|m_{2} = 1} \right) & = P\left( {m_{1} = 0,m_{2} = 1} \right)/P\left( {m_{2} = 1} \right) \\ & = P\left( {m_{2} = 1|m_{1} = 0} \right)*P\left( {m_{1} = 0} \right)/P\left( {m_{2} = 1} \right) \\ & = a*P\left( {m_{1} = 0} \right)/\left[ {a*P\left( {m_{1} = 0} \right) + b*P\left( {m_{1} = 1} \right)} \right] \\ \end{aligned}$$

(11)

The number of terms required in the evaluation of the above expression also grows as ~ 2^N where N is the number of potential binary causes of a particular effect⁴. However, from the hardware BN, this probability can be directly obtained by observing the joint distribution of states of the two devices. It is to be noted here that this way of performing the inference always involves observing the joint distributions of only two nodes of the BN: nodes corresponding to the effect and the potential cause of interest, irrespective of N.

In our experiment, after 100 pulsing cycles, the obtained output probabilities for all the five circuits (representing the five different CPTs of Fig. 3a) is comparable with the expectation from calculating the joint probability distribution and is also verified by stochastic LLG simulations, as shown in Fig. 3c. Similarly, the obtained probabilities from inference is comparable with that from Bayes theorem and stochastic LLG simulations, seen in Fig. 3d.

Simulation of a four node Bayesian network

In this section, we present a self-consistently coupled sLLG simulation of the more complicated, four node Bayesian network shown in the top left inset of Fig. 4a. Here, the BN consists of four nodes: cloud ($C$), rain ($R$), sprinkler ($S$), and wetness of grass ($W$). In this case, the evaluation of a node probability from the joint probability distribution requires the following evaluation, for example for the $W$ node:

$$P\left( W \right) = \mathop \sum \limits_{C} \mathop \sum \limits_{R} \mathop \sum \limits_{S} P\left( {C,R,S,W} \right) = \mathop \sum \limits_{C} \mathop \sum \limits_{R} \mathop \sum \limits_{S} P\left( C \right)P\left( {R|C} \right)P\left( {S|C} \right)P\left( {W|RS} \right)$$

(12)

Here the number of terms to be evaluated in the summation is eight, as each of the $C$, $R$ and $S$ nodes could take two possible values “0” or “1”. Similarly performing inference, for example, what is the probability that it had rained, given that the grass is wet requires the following evaluation:

$$P\left( {R{|}W} \right) = P\left( {R,W} \right)/P\left( W \right)$$

(13)

where both the numerator and the denominator of the right-hand side of the above equation must be evaluated by summing over the joint probability distribution $P\left( {C,R,S,W} \right)$, resulting in the evaluation of four and eight terms respectively. However, by using the hardware, the required node probabilities and the inference can be obtained in exactly the same way as our previous two-node example: we simply observe the stochastic output of the corresponding node for probability assessment; and observe the joint distribution of only the $R$ and the $W$ node to perform the required inference. This is demonstrated in the simulation study below.

The parameters used in the sLLG simulation platform such as the magnet dimensions and the output sigmoidal response are benchmarked with the experimental results from the device in Fig. 1c. The coupling and biases are benchmarked with the two node BN network experiments shown in Figs. 2 and 3.

Figure 4a shows the circuit implementation, where each node is represented by a hardware p-bit as described in Fig. 1. It is to be noted here that an auxiliary p-bit (represented by node ‘X’) is needed to implement this four node Bayesian network. This is because, the CPT capturing the dependency of node ‘$W$’ on node ‘$R$’ and ‘$S$’ has four conditional probabilities, which can take any value between 0 and 1 independent of each other. Therefore, from basic principles of linear algebra, we need four independent physical parameters to implement this CPT. Two of the four required parameters are provided by the two interconnection weights ($J_{WR}$ and $J_{WS}$) and another parameter is provided by the bias to the node ‘$W$’ ($h_{W}$). The remaining one parameter is provided by the interconnection to the auxiliary node ‘X’. The requirement of auxiliary nodes in designing Bayesian networks from p-bits is described in more detail by Faria et al.¹⁹ It is to be noted here that we have applied a systematic approach that directly translates a given conditional probability table into coupling weights and biases in the Bayesian network involving auxiliary nodes. This basic topology could result in a large number of p-bits as the number of nodes increases. In general, there are algorithms to train this type of Bayesian/belief networks with predefined number of nodes³². The main focus of this paper is not the algorithms, rather this paper proposes an efficient hardware implementation of inference from a Bayesian network with given weights and biases.

The dynamics of the PMA magnet used in the hardware p-bit design is captured by solving the sLLG equation with a monodomain macrospin assumption:

$$\left( {1 + \alpha^{2} } \right)\frac{{d\hat{m}}}{dt} = - \left| \gamma \right|\hat{m} \times \vec{H} - \alpha \left| \gamma \right|\hat{m} \times \hat{m} \times \vec{H} - \frac{1}{{qN_{s} }}\hat{m} \times \hat{m} \times \overrightarrow {{I_{s} }} + \frac{\alpha }{{qN_{s} }}\hat{m} \times \overrightarrow {{I_{s} }}$$

(14)

where $\vec{H}$ is the total internal and external field along with thermal noise field, $\overrightarrow {{I_{s} }}$ is the spin current, $N_{s} = M_{s} V$ is the total magnetic moment with $M_{s}$ being the saturation magnetization, $\alpha$ is the damping coefficient, $\gamma$ is the gyromagnetic ratio. Magnet parameters used in the simulation are: $H_{k} = 200$ Oe, $M_{s} = 1000$ emu/cc,$D_{1} = 1 \,\upmu {\text{m}}$, $D_{1} = 3\,\upmu {\text{m}}$, $t = 1 \,{\text{nm}}$, $\alpha = 0.1$. The average magnetization of each p-bit can be approximated by $m_{z} = tanh\left( {\frac{H}{{H_{0} }}} \right)$, where $H$ is the Oersted field generated from the current coil and $H_{0}$ is a fitting parameter.

For the system simulation, we start with chosen CPTs for each of the nodes. These are shown as the inputs next to the respective nodes in Fig. 4c. These values are then translated into coupling term $J_{ij}$ and bias term $h_{i}$ by following similar principles as in deriving Eqs. (5) and (6). The derivation for $J_{ij}$ and $h_{i}$ for an n-node Bayesian network is provided by Faria et al.¹⁹ The dimensionless terms $J_{ij}$ and $h_{i}$ are then translated to corresponding Oersted fields to each p-bit by a relation:

$$H_{i} = H_{0} \mathop \sum \limits_{j} J_{ij} m_{j} + h_{i}$$

(15)

The coupling and bias component of $H_{i}$ can be realized through the coupling resistance $R_{weight}$ and $R_{bias}$ respectively with a mapping principle as described in Eqs. (7) and (8) for the two node case.

While solving the coupled sLLG, each p-bit is put along the hard axis by the GSHE current in a sequential order from parent to child node and the magnetizations of all p-bits are recorded after their corresponding pulse is turned off. It is worth noting that the pulse sequence is important for the proper operation of the Bayesian network. The pulsing should start from the first node and move down the hierarchy from parent to corresponding child nodes. The order of pulsing among different nodes on the same hierarchy level (e.g. node $R$ and $S$ in our example) is not critical. Taking these principles into account, the pulsing order for one cycle is shown in Fig. 4b. This cycle is repeated several times to generate the probabilities of each of the four nodes. Figure 4c shows representative data of magnetization of each node for 50 pulses. From this distribution of the magnetization state of each node in ‘UP’ versus ‘DN’ state, probabilities of each node are calculated. For example, the magnetization of the p-bit corresponding to ‘sprinkler’ node shows more occurrences in the ‘DN’ state compared to ‘UP’ state, resulting in a low probability of sprinkler being ON ($P\left( S \right)$ ~ 0.25 in this case). Similarly, the probability of ‘rain’: $P\left( R \right)$ and the probability of ‘grass being wet’: $P\left( W \right)$ are obtained from the magnetization state distribution. The obtained probabilities are compared with those obtained by calculating the joint probability distribution as shown in the output tables alongside each of the four nodes in Fig. 4c. It can be seen that the probabilities obtained from the coupled sLLG result match well with the simple PSL behavioral model and with the values obtained from the evaluation of Eq. (12). Similarly, the probability of rain, given that the grass is wet ($P(R|W)$) is obtained from the coupled sLLG result is 0.73, which is well matched with the value of 0.75 obtained from Eq. (13). It is to be noted that the accuracy in this depends on the number of samples taken to calculate the probabilities.

Circuit implications and improvements

Previously proposed hardware implementations of Bayesian networks have used CMOS based pseudo random number generators realized with XOR-SHIFT circuits⁷ or RAM-based Linear Feedback Gaussian Random Number Generators^{5, 6} that require a large area footprint. What we have demonstrated here is a compact true random number generator (TRNG) capable of operating at few hundreds of MHz. The speed of producing statistically correct random numbers by the device and hence the latency of the network is limited by the time required for SOT hard axis initialization and magnetization relaxation after removal of SOT, which is shown to be around 30 ns in the sLLG time plot panels of Fig. 1b and can be made to be < 5 ns by proper nanomagnet design¹⁵. Compared to previously demonstrated spin based TRNG^33,34,35, this implementation employs a different scheme to generate random numbers. In our approach, any applied current that is larger than that required for hard axis initialization of the magnet will result in the generation of a random number with the correct statistic once the current pulse is removed^{18, 23}. Hence, in a large network, the device to device variation in the required current can be easily mitigated by choosing the largest value of the required current among all devices. Possible variations in the shape and offset of the sigmoidal activation function of our devices can be controlled by appropriately choosing the parameters B₀ and h₂ while translating the given CPTs into the connection weights, shown in Eqs. (5–8). Also note that the Bayesian network proposed here does not require analog voltage sources or CMOS MUX to realize the CPT as proposed previously by Shim et al²³. Using current controlled tunability of the device and auxiliary nodes, any CPT can be realized by using only p-bits, one voltage level ($V_{DD}$) and analog memristive elements for interconnections and individual biases similar to RRAM based neural networks. Such programmable analog memristive elements have been successfully demonstrated recently^{36, 37}. The energy requirement of the device demonstrated here can be improved by using the voltage-controlled magnetism (VCM) effect for hard-axis initialization as proposed by Scott et al.³⁸ in their benchmarking study (section IV of the main text). In addition, employing magnetic tunnel junctions (MTJs) instead of AHE can eliminate the need for OP-AMPs for readout. The typical difference in the two stable resistive states of an MTJ is of the order of 10 kΩ, whereas in case of AHE, it is a few ohms for standard material stacks. This allows the elimination of the OP-AMPs for readout. Implementations of an MTJ based readout scheme, where the state of the free layer magnet is converted to a voltage by a potential divider formed by the MTJ and a normal resistor was presented by Camsari et al.¹³ (figure 3 of the main text) and Hassan et al.³⁹ (figure 4 of the main text). In these references, the voltage swing generated at the output is large enough to be converted to a “rail-to-rail” swing by a single inverter. In the above references, the MTJ free layer was designed to be a low barrier magnet, but the analysis of the output swing remains unchanged for our hard axis initialization scheme with stable magnets. The power dissipated for reading the MTJ resistive state and amplifying the output signal through the inverter would be the same for the device presented in this study when scaled to similar dimensions. The additional power required for hard axis initialization of the nanomagnet is similar to that required for deterministic switching in SOT-MRAM applications as the required currents are comparable⁴⁰. This power requirement could be much smaller if the hard axis initialization is done via VCM, as evaluated by Scott et al.³⁸ (Table 1 of their main text). Finally, the tunability obtained through Oersted field in this work can be replaced by a more efficient STT mechanism, which consumes similar currents as required for hard-axis initialization⁴¹.

Conclusion

We have experimentally demonstrated that by connecting two stochastic spintronic devices and designing the connection and bias parameters, BN building blocks can be implemented in hardware. By implementing BNs using such hardware, both probability assessment and inference can be performed by sampling the output of only the relevant nodes. Using experimentally benchmarked sLLG simulations, we have shown that a four node BN implemented in hardware using the presented stochastic devices can generate probabilities that are well matched to the theoretical values from calculating the joint probability distribution. This demonstration serves as a step towards building large scale hardware systems for implementing Bayesian networks.

References

Borders, W. A. et al. Integer factorization using stochastic magnetic tunnel junctions. Nature 573, 390–393 (2019).
Article ADS CAS Google Scholar
Pearl, J. Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference (Elsevier, Amsterdam, 2014).
MATH Google Scholar
Heckerman, D., Mamdani, A. & Wellman, M. P. Real-world applications of Bayesian networks. Commun. ACM 38, 24–26 (1995).
Article Google Scholar
Heckerman, D. & Breese, J. S. Causal independence for probability assessment and inference using Bayesian networks. IEEE Trans Syst. Man Cybern. Part A Syst. Hum. 26, 826–831 (1996).
Article Google Scholar
Zermani, S., Dezan, C., Chenini, H., Diguet, J.-P. & Euler, R. FPGA implementation of Bayesian network inference for an embedded diagnosis. In 2015 IEEE Conference on Prognostics and Health Management (PHM) 1–10 (IEEE, 2015). https://doi.org/10.1109/ICPHM.2015.7245057.
Cai, R. et al. VIBNN: hardware acceleration of Bayesian neural networks. In Proceedings of the 23rd International Conference on Architectural Support for Programming Languages and Operating Systems - ASPLOS ’18 vol. 53 476–488 (ACM Press, 2018).
Mansinghka, V. K., Jonas, E. M. & Tenenbaum, J. B. Stochastic Digital circuits for probabilistic inference. In Massachusetts Institute of Technology, Technical Report MITCSAIL-TR 2069 (2008).
Chakrapani, L. N., Korkmaz, P., Akgul, B. E. S. & Palem, K. V. Probabilistic system-on-a-chip architectures. ACM Trans. Des. Autom. Electron. Syst. 12, 29 (2007).
Article Google Scholar
Weijia, Z., Ling, G. W. & Seng, Y. K. PCMOS-based hardware implementation of Bayesian network. In 2007 IEEE Conference on Electron Devices and Solid-State Circuits 337–340 (IEEE, 2007). https://doi.org/10.1109/EDSSC.2007.4450131.
Behin-Aein, B., Diep, V. & Datta, S. A building block for hardware belief networks. Sci. Rep. 6, 29893 (2016).
Article ADS CAS Google Scholar
Debashis, P. et al. Experimental demonstration of nanomagnet networks as hardware for Ising computing. Tech. Dig. Int. Electron Devices Meet. IEDM 34.3.1–34.3.4 (2016) https://doi.org/10.1109/IEDM.2016.7838539.
Sutton, B., Camsari, K. Y., Behin-Aein, B. & Datta, S. Intrinsic optimization using stochastic nanomagnets. Sci. Rep. 7, 44370 (2017).
Article ADS Google Scholar
Camsari, K. Y., Faria, R., Sutton, B. M. & Datta, S. Stochastic p-bits for invertible logic. Phys. Rev. X 7, 031014 (2017).
Google Scholar
Faria, R., Camsari, K. Y. & Datta, S. Low-barrier nanomagnets as p-bits for spin logic. IEEE Magn. Lett. 8, 1–5 (2017).
Article Google Scholar
Debashis, P., Faria, R., Camsari, K. Y. & Chen, Z. Design of stochastic nanomagnets for probabilistic spin logic. IEEE Magn. Lett. 9, 1–5 (2018).
Article Google Scholar
Debashis, P. & Chen, Z. Experimental demonstration of a spin logic device with deterministic and stochastic mode of operation. Sci. Rep. 8, 11405 (2018).
Article ADS Google Scholar
Debashis, P. & Chen, Z. Tunable random number generation using single superparamagnet with perpendicular magnetic anisotropy. In 2018 76th Device Research Conference (DRC) 1–2 (IEEE, 2018). https://doi.org/10.1109/DRC.2018.8442154.
Ostwal, V., Debashis, P., Faria, R., Chen, Z. & Appenzeller, J. Spin-torque devices with hard axis initialization as Stochastic Binary Neurons. Sci. Rep. 8, 16689 (2018).
Article ADS Google Scholar
Faria, R., Camsari, K. Y. & Datta, S. Implementing Bayesian networks with embedded stochastic MRAM. AIP Adv. 8, 045101 (2018).
Article ADS Google Scholar
Hassan, O., Camsari, K. Y. & Datta, S. Voltage-driven building block for hardware belief networks. IEEE Des. Test 36, 15–21 (2019).
Article Google Scholar
Camsari, K. Y., Chowdhury, S. & Datta, S. Scalable emulation of sign-problem–free Hamiltonians with room-temperature p-bits. Phys. Rev. Appl. 12, 034061 (2019).
Article ADS CAS Google Scholar
Debashis, P., Upadhyaya, P. & Chen, Z. Electrical annealing and stochastic resonance in superparamagnets for oscillatory networks with dynamic connectivity. Bull. Am. Phys. Soc. 2019, S39-011 (2019).
Google Scholar
Shim, Y., Chen, S., Sengupta, A. & Roy, K. Stochastic spin-orbit torque devices as elements for Bayesian inference. Sci. Rep. 7, 14101 (2017).
Article ADS Google Scholar
Liu, L., Lee, O. J., Gudmundsen, T. J., Ralph, D. C. & Buhrman, R. A. Current-induced switching of perpendicularly magnetized magnetic layers using spin torque from the spin Hall effect. Phys. Rev. Lett. 109, 096602 (2012).
Article ADS Google Scholar
Miron, I. M. et al. Perpendicular switching of a single ferromagnetic layer induced by in-plane current injection. Nature 476, 189–193 (2011).
Article ADS CAS Google Scholar
Bhowmik, D., You, L. & Salahuddin, S. Spin Hall effect clocking of nanomagnetic logic without a magnetic field. Nat. Nanotechnol. 9, 59–63 (2014).
Article ADS CAS Google Scholar
Haykin, S. et al. Neural Networks and Learning Machines 3rd edn (Prentice Hall, New York, 2009).
Geman, S. & Geman, D. Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images. IEEE Trans. Pattern Anal. Mach. Intell. 6, 721–741 (1984).
Article CAS Google Scholar
Gelfand, A. E., Hills, S. E., Racine-Poon, A. & Smith, A. F. M. Illustration of Bayesian inference in normal data models using Gibbs sampling. J. Am. Stat. Assoc. 85, 972–985 (1990).
Article Google Scholar
Yildirim, I. Bayesian Inference: Gibbs Sampling. http://nlp.jbnu.ac.kr/PGM/slides_other/GibbsSampling.pdf (2012). Accessed 1 Mar 2020.
Feynman, R. P. Simulating physics with computers. Int. J. Theor. Phys. 21, 467–488 (1982).
Article MathSciNet Google Scholar
Neal, R. M. Connectionist learning of belief networks. Artif. Intell. 56, 71–113 (1992).
Article MathSciNet Google Scholar
Fukushima, A. et al. Spin dice: a scalable truly random number generator based on spintronics. Appl. Phys. Express 7, 083001 (2014).
Article ADS Google Scholar
Choi, W. H. et al. A Magnetic Tunnel Junction based True Random Number Generator with conditional perturb and real-time output probability tracking. In 2014 IEEE International Electron Devices Meeting 12.5.1–12.5.4 (IEEE, 2014). https://doi.org/10.1109/IEDM.2014.7047039.
Vodenicarevic, D. et al. Low-energy truly random number generation with superparamagnetic tunnel junctions for unconventional computing. Phys. Rev. Appl. 8, 054045 (2017).
Article ADS Google Scholar
The memristor revisited. Nat. Electron.1, 261 (2018).
Choi, S. et al. SiGe epitaxial memory for neuromorphic computing with reproducible high performance based on engineered dislocations. Nat. Mater. 17, 335–340 (2018).
Article ADS CAS Google Scholar
Scott, W. et al. Hybrid piezoelectric-magnetic neurons. In Proceedings of the ACMSE 2018 Conference 7 (ACM Press, 2018). https://doi.org/10.1145/3190645.3190688.
Hassan, O., Faria, R., Camsari, K. Y., Sun, J. Z. & Datta, S. Low-barrier magnet design for efficient hardware binary stochastic neurons. IEEE Magn. Lett. 10, 1–5 (2019).
Article Google Scholar
Garello, K. et al. SOT-MRAM 300MM Integration for Low Power and Ultrafast Embedded Memories. In 2018 IEEE Symposium on VLSI Circuits 81–82 (IEEE, 2018). https://doi.org/10.1109/VLSIC.2018.8502269.
Sengupta, A., Choday, S. H., Kim, Y. & Roy, K. Spin orbit torque based electronic neuron. Appl. Phys. Lett. 106, 143701 (2015).
Article ADS Google Scholar

Download references

Acknowledgements

This work was supported by the Center for Probabilistic Spin Logic for Low-Energy Boolean and Non-Boolean Computing (CAPSL), one of the Nanoelectronic Computing Research (nCORE) Centers as task 2759.003 and 2759.004, a Semiconductor Research Corporation (SRC) program sponsored by the NSF through CCF 1739635.

Author information

These authors contributed equally: P. Debashis and V. Ostwal.

Authors and Affiliations

School of Electrical and Computer Engineering, Purdue University, West Lafayette, IN, 47907, USA
Punyashloka Debashis, Vaibhav Ostwal, Rafatul Faria, Supriyo Datta, Joerg Appenzeller & Zhihong Chen
Birck Nanotechnology Center, Purdue University, West Lafayette, IN, 47907, USA
Punyashloka Debashis, Vaibhav Ostwal, Joerg Appenzeller & Zhihong Chen
Intel Corporation, Hillsboro, OR, 97124, USA
Punyashloka Debashis & Rafatul Faria

Authors

Punyashloka Debashis
View author publications
You can also search for this author in PubMed Google Scholar
Vaibhav Ostwal
View author publications
You can also search for this author in PubMed Google Scholar
Rafatul Faria
View author publications
You can also search for this author in PubMed Google Scholar
Supriyo Datta
View author publications
You can also search for this author in PubMed Google Scholar
Joerg Appenzeller
View author publications
You can also search for this author in PubMed Google Scholar
Zhihong Chen
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

P.D. performed the experiments with V.O.’s help. R.F. performed the numerical simulations. S.D., J.A. and Z.C. supervised the project and helped in the analysis of the results. P.D. wrote the manuscript with inputs from all authors. All authors reviewed the manuscript.

Corresponding author

Correspondence to Zhihong Chen.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Debashis, P., Ostwal, V., Faria, R. et al. Hardware implementation of Bayesian network building blocks with stochastic spintronic devices. Sci Rep 10, 16002 (2020). https://doi.org/10.1038/s41598-020-72842-6

Download citation

Received: 16 May 2020
Accepted: 07 September 2020
Published: 29 September 2020
DOI: https://doi.org/10.1038/s41598-020-72842-6

This article is cited by

Discovery of ultrafast spontaneous spin switching in an antiferromagnet by femtosecond noise correlation spectroscopy
- M. A. Weiss
- A. Herbst
- T. Kurihara
Nature Communications (2023)
Unconventional computing based on magnetic tunnel junction
- Baofang Cai
- Yihan He
- Gengchiau Liang
Applied Physics A (2023)
Hardware implementation of Bayesian network based on two-dimensional memtransistors
- Yikai Zheng
- Harikrishnan Ravichandran
- Saptarshi Das
Nature Communications (2022)
Effects of biochar and biofertilizer on cadmium-contaminated cotton growth and the antioxidative defense system
- Yongqi Zhu
- Haijiang Wang
- Weiju Wang
Scientific Reports (2020)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.