Withdraw
Loading…
Energy-efficient Resistive In-memory Computing Architectures
Roy, Saion Kumar
Content Files

Loading…
Download Files
Loading…
Download Counts (All Files)
Loading…
Edit File
Loading…
Permalink
https://hdl.handle.net/2142/125621
Description
- Title
- Energy-efficient Resistive In-memory Computing Architectures
- Author(s)
- Roy, Saion Kumar
- Issue Date
- 2024-07-12
- Director of Research (if dissertation) or Advisor (if thesis)
- Shanbhag, Naresh
- Doctoral Committee Chair(s)
- Shanbhag, Naresh
- Committee Member(s)
- Hanumolu, Pavan
- Ghose, Saugata
- Seo, Jae-sun
- Department of Study
- Electrical & Computer Eng
- Discipline
- Electrical & Computer Engr
- Degree Granting Institution
- University of Illinois at Urbana-Champaign
- Degree Name
- Ph.D.
- Degree Level
- Dissertation
- Keyword(s)
- In-memory Computing
- resistive IMC
- Benchmarking
- SRAM
- eNVM
- MRAM
- compute SNDR
- Model Extraction Attacks
- Security Vulnerabilities
- Abstract
- Embedded non-volatile memory (eNVM)-based in-memory computing (IMC) architectures have garnered significant interest due to their high memory density and non-volatility in realizing efficient implementations of deep neural networks (DNNs). Their high density enables on-chip storage of DNN models, thereby reducing the energy and latency costs associated with data movement. Additionally, their non-volatile nature supports efficient activity-driven duty-cycled operations. However, these benefits have not been realized in real-life eNVM-based IMC integrated chip (IC) prototypes, which significantly lag behind SRAM-based IMCs and digital accelerators in terms of energy efficiency and compute density. This lag is primarily due to the presence of large read-out circuits required to achieve high bank-level accuracy. Moreover, the fundamental energy-accuracy trade-offs of eNVM-based IMCs are not well understood, as most research focuses on achieving high network-level accuracy, which masks bank-level accuracy due to the error tolerance of DNNs and the use of noise-aware training. This dissertation aims to evaluate the intrinsic energy-accuracy trade-offs of eNVM-based IMCs and develop techniques to enhance bank-level accuracy. First, we introduce a comprehensive benchmarking methodology for IMCs that quantifies the significant lag in energy efficiency and compute density of eNVM-based IMCs compared to SRAM-based IMCs and digital accelerators. Second, to understand this lag, which arises from the low bank-level accuracy of resistive IMCs, we use the compute signal-to-noise-plus-distortion ratio (SNDR) to measure accuracy and identify its dependence on device, circuit, and architectural parameters. This is achieved by developing a chip-verified behavioral model, which reveals that the maximum achievable SNDR is limited by pre-analog-to-digital-converter (ADC) analog non-idealities. Additionally, we find that increasing the device-level conductance contrast improves the maximum achievable SNDR only up to a certain point, beyond which it saturates. Third, building on insights from the behavioral model, we propose two bank-level accuracy-boosting techniques at the circuit and algorithmic levels, demonstrating a compute SNDR improvement from 2.7 to 6dB in a 22nm MRAM-based IMC prototype. This boost is traded off to achieve a 5x reduction in energy per 1b-operation, with modest energy and area overhead. Furthermore, our work shows that the bank-level accuracy boost correlates well with network-level accuracy without resorting to noise-aware training. Lastly, we explore the security vulnerabilities of eNVM-based IMC architectures against model extraction attacks (MEAs). These attacks can reconstruct private training data from trained model parameters, leaking sensitive user information. While the presence of analog noise in eNVM-based IMC computation might suggest intrinsic robustness to MEAs, we demonstrate that this is not the case. Our results underscore the critical importance of investigating the security vulnerabilities of IMCs in general and eNVM-based IMCs in particular.
- Graduation Semester
- 2024-08
- Type of Resource
- Thesis
- Handle URL
- https://hdl.handle.net/2142/125621
- Copyright and License Information
- Copyright 2024 Saion Kumar Roy
Owning Collections
Graduate Dissertations and Theses at Illinois PRIMARY
Graduate Theses and Dissertations at IllinoisManage Files
Loading…
Edit Collection Membership
Loading…
Edit Metadata
Loading…
Edit Properties
Loading…
Embargoes
Loading…