\setstackEOL
\setstackgap

L \SetBgContents\LongstackPREPRINT - Accepted as PhD Forum at VLSI-SoC 2024\SetBgPosition4.5cm,1cm\SetBgOpacity1.0\SetBgAngle0\SetBgScale1.8

Resistive Memory for Computing and Security: Algorithms, Architectures, and Platforms

Simranjeet Singh1, Farhad Merchant2, Sachin Patkar1
1IIT Bombay, India,2Newcastle University, UK
{simranjeet, patkar}@ee.iitb.ac.in, farhad.merchant@newcastle.ac.uk

Abstract

Resistive random-access memory (RRAM) is gaining popularity due to its ability to offer computing within the memory and its non-volatile nature. The unique properties of RRAM, such as binary switching, multi-state switching, and device variations, can be leveraged to design novel techniques and algorithms. This thesis proposes a technique for utilizing RRAM devices in three major directions: i) digital logic implementation, ii) multi-valued computing, and iii) hardware security primitive design. We proposed new algorithms and architectures and conducted experimental studies on each implementation. Moreover, we developed the electronic design automation framework and hardware platforms to facilitate these experiments.

Index Terms:

RRAM, digital logic-in-memory, multi-valued logic, hardware security

I Motivation

The growing popularity of resistive random-access memory (RRAM) is fueled by its remarkable capability to store multi-bit and embed computing functionalities directly within the memory. RRAM’s distinctive properties, particularly its ability to exhibit multi-state behavior and stochasticity, open up new avenues for innovation in the realm of algorithm, architecture, and security design. RRAMs generally consist of a transition metal oxide (TMO) layer sandwiched between top and bottom electrodes in a metal-insulator-metal configuration. The resistance of the TMO layer can modulated using external electrical signals, facilitating data storage in distinct resistive states. Typically, RRAM devices operate in a binary switching mode, where they exhibit two distinct states: a low-resistance state (LRS) representing logic ‘1’ and a high-resistance state (HRS) representing logic ‘0’ [1]. However, the multi-state mode can be achieved by inducing a gradual change in the RESET voltage, leading to a progressive increase in resistance between LRS and HRS rather than an abrupt transition. It is important to note that RRAM devices have a significant impact on the resistance state of the device due to the device-to-device (D2D) and cycle-to-cycle (C2C) variations [2].

Taking into account the Boolean, multi-state switching, and stochastic nature of RRAM, this thesis endeavors to harness the full potential of RRAM by introducing novel methodologies for its utilization. The thesis emphasizes i) digital logic-in-memory (LiM), ii) multi-valued logic (MVL) computing, and iii) the design of hardware security primitives. Additionally, we created an electronic design automation (EDA) framework and hardware prototype centered on RRAM to facilitate experimentation. The thesis explores RRAM’s capabilities in these domains and showcases its adaptability and efficiency in tackling modern computing challenges.

Contributions: The contributions of the underlying thesis are as follows (visualized in Fig. 1):

•

Designing novel logic gates using the Boolean properties of RRAM and conducting comprehensive experimental studies to validate its efficacy.
•

Designing multi-level arithmetic and finite state automata (FSA) utilizing multi-state properties and developing architectures for the Tsetlin machine.
•

Designing architectures and algorithms for i) implementing true random number generators (TRNG) and physical unclonable functions (PUF) on a single RRAM crossbar and ii) introducing a technique for securely locking neural network weights utilizing an integrated hardware security module.
•

Creating an EDA framework for synthesizing hardware description language (HDL) into SPICE-level netlists for accurate energy analysis.
•

Integrating packaged RRAM chips with FPGAs for hardware prototyping.

Refer to caption — Figure 1: RRAM for computing and security

II Digital LiM

Considering Boolean properties, we initially implemented logic gates using the VTEAM model and evaluated their energy consumption. The analysis revealed the prominence of initialization energy in digital LiM [3, 5]. This prompted the exploration of novel logic gate designs on a different material stack with distinct properties from VTEAM, such as HRS/LRS and SET/RESET ratio. Devices fabricated with the TaOx switching stack possess these properties and are supported by an experimentally validated model (JART VCM). Utilizing this model, we proposed novel in-memory cloning methods [6] and logic gate designs, including OR and NOT gates, on the TaOx 1T1R crossbar. Subsequently, we experimentally validated the proposed gates on a fabricated 8x4 TaOx 1T1R crossbar and observed energy consumption trends [1]. Next, we moved to the multi-valued logic design, considering the multi-state properties of RRAM.

III MVL Computing

As a first step, we investigated state-switching dynamics to facilitate MVL operations. Employing a gradual RESET method on TaOx devices, we achieved multi-level behavior, resulting in at least six stable states between the LRS and HRS. These states were leveraged to develop a ternary arithmetic adder, demonstrated by the 41-trit adder (equivalent to a 64-bit digital adder) [4]. In addition to arithmetic circuits, we proposed the design of FSA utilizing the MVL properties, alongside investigating the impact of D2D and C2C variations on state transitions and detection. The FSA implementation is then simulated, showcasing the Krinsky learning automaton [7].

Finally, we introduced a Tsetlin machine inference architecture that harnesses the Boolean and multi-state properties of RRAM altogether [8]. The proposed architecture demonstrated significant accuracy and energy efficiency improvements compared to traditional machine learning implementations.

IV Hardware Security

Variations such as D2D and C2C pose challenges in computing algorithms, as they significantly affect resistance states, leading to computing errors. While schemes like checksum aim to mitigate errors during in-memory computing due to variation [9]. However, we leverage these variations to design a hardware security module. We propose TRNG and PUF architecture on the same RRAM crossbar, where the TRNG generates entropy utilized by the PUF [2]. Subsequently, we propose multiple configurations of these designs to enhance PUF metrics [10, 11, 12]. Finally, we integrate the PUF with a neural network (NN) on the same crossbar to secure the non-volatile NN weights [13].

V EDA and Hardware Prototype

Next, we developed an EDA framework that considered the different aspects of implementation and many existing RRAM models. For digital LiM, The framework allows the synthesis of the HDL to LiM design at the SPICE level. The proposed framework automatically generates a SPICE-level netlist and testbench voltages for a given application/benchmark. Furthermore, it provides fine-grained energy numbers by calculating the energy consumed by each device in the crossbar [14]. The framework empowers researchers to obtain accurate energy estimates for digital designs, offering valuable insights into their methodologies at the circuit level. The framework also performs the D2D and C2C variation simulation for hardware security, especially for PUF and TRNG analysis.

We developed a hardware prototype by integrating the RRAM packaged chip fabricated by CEA-Leti, France, with the FPGA, as shown in Fig. 1. The packaged chip has multiple crossbar configurations with digital selection peripherals, the biggest crossbar size being $512\times 32$ . We integrated the analog peripherals, such as sense amplifiers, ADCs, DACs, and analog switches outside the chip, to process the signals for any given application. The integration supports custom instructions that a designed FPGA controller handles. The hardware prototype was demonstrated at embedded word 2024 [15].

VI Conclusion & research directions

In conclusion, this thesis leverages the RRAM properties to design novel algorithms and architectures for computing and security. The proposed concepts are experimentally validated, providing the EDA and hardware prototype tools to facilitate the experimental studies across domains.

Expanding upon the insights gained from our research, our goal is to investigate a hybrid approach capable of integrating various schemes into a reconfigurable crossbar. This crossbar concept consolidates all peripherals and functionalities for Boolean, multi-state, and security operations within a single crossbar, facilitating the hybrid approach to cater to a wide range of applications. Also, we aim to enhance the hardware prototype by incorporating all peripherals onto the packaged chip and EDA for hardware prototyping. This initiative lays the foundation for developing next-generation architectures leveraging RRAM technology.

Acknowledgment

I sincerely thank Mr. Johannes Mohr for his invaluable contribution to revising the hardware platform. Additionally, I am deeply thankful to Dr. Vikas Rana for inviting me to the Forschungszentrum Jülich, Germany, for a year-long research visit.

References

[1] A. Bende^∗ and S. Singh^∗ et al.., “Experimental Validation of Memristor-Aided Logic Using 1T1R TaOx RRAM Crossbar Array,” in 37th IEEE VLSID, 2024, pp. 565–570.
[2] S. Singh et al., “Hardware Security Primitives Using Passive RRAM Crossbar Array: Novel TRNG and PUF Designs,” in ASP-DAC 2023, New York, NY, USA, 2023, p. 449–454.
[3] S. Singh et al., “Should we even optimize for execution energy? rethinking mapping for magic design style,” IEEE ESL, vol. 15, no. 4, pp. 230–233, 2023.
[4] S. Singh. et al., “Exploring Multi-Valued Logic and its Application in Emerging Post-CMOS Technologies,” in 18th NanoArch, New York, NY, USA, 2024.
[5] C. K. Jha, K. Qayyum, K. Ç. Coşkun, S. Singh, S., et al. “veriSIMPLER: An Automated Formal Verification Methodology for SIMPLER MAGIC Design Style Based In-Memory Computing,” Accepted in TCAS-I 2024.
[6] S. Singh. et al., “In-Memory Mirroring: Cloning Without Reading,” Accepted in IFIP/IEEE VLSI-SoC, 2024.
[7] S. Singh. et al., “Finite State Automata Design using 1T1R ReRAM Crossbar,” in 21st IEEE NEWCAS, 2023, pp. 1–5.
[8] O. Ghazal^∗ and S. Singh^∗ et al., “IMBUE: In-Memory Boolean-to-CUrrent Inference ArchitecturE for Tsetlin Machines,” in ISLPED, 2023, pp. 1–6.
[9] L. Parrini, T. Soliman, B. Hettwer, J. M. Borrmann, S. Singh et al., “Error Detection and Correction Codes for Safe In-Memory Computations,” ETS, 2024 (Accepted).
[10] S. Singh. et al., “PA-PUF: A Novel Priority Arbiter PUF,” in IFIP/IEEE VLSI-SoC, 2022, pp. 1–6.
[11] G. Rajendran, F. Zahoor, and S. Singh et al., “PR-PUF: A Reconfigurable Strong RRAM PUF,” in IFIP/IEEE VLSI-SoC, 2023, pp. 1–6.
[12] G. Rajendran, F. Zahoor, S. S. Thakker, and S. Singh et al., “Harnessing Entropy: RRAM Crossbar-Based Unified PUF and RNG,” in 37th IEEE VLSID, 2024, pp. 560–564.
[13] S. Singh et al., “Integrated Architecture for Neural Networks and Security Primitives using RRAM Crossbar,” in NEWCAS, 2023, pp. 1–5.
[14] S. Singh et al., “MemSPICE: Automated Simulation and Energy Estimation Framework for MAGIC-Based Logic-in-Memory,” in 29th ASP-DAC, 2024, pp. 282–287.
[15] AiML Demonstrator, “embedded word exhibition & conference,” 2024, accessed on May 10, 2024. [Online]. Available: https://memristec.de/2024/04/11/cutting-edge-innovations-unveiled-at-embedded-world-fair-in-nuremberg-2/