Accelerating convolution in deep neural networks on a CAPI-based FPGA
Ma, Yuan
This item is only available for download by members of the University of Illinois community. Students, faculty, and staff at the U of I may log in with your NetID and password to view the item. If you are trying to access an Illinois-restricted dissertation or thesis, you can request a copy through your library's Inter-Library Loan office or purchase a copy directly from ProQuest.
Permalink
https://hdl.handle.net/2142/107265
Description
Title
Accelerating convolution in deep neural networks on a CAPI-based FPGA
Author(s)
Ma, Yuan
Contributor(s)
Kindratenko, Volodymyr
Issue Date
2020-05
Keyword(s)
FPGA
Deep Learning Acceleration
HLS
Hardware Design
CAPI-SNAP
Heterogeneous System
Abstract
Convolutional neural networks (CNNs) have emerged as a crucial part in many applications ranging
from self-driving cars to voice-activated assistants. Numerous cloud computing providers, such as
Amazon (AWS), IBM (SoftLayer), and Microsoft (Azure), choose to use heterogeneous computing
systems to off-load the CNN computations from the CPU to a dedicated hard-ware since such
hardware provides significant improvements in both computing throughput and energy savings. In
this senior thesis, the author presents a weight-stationary systolic convolution kernel design for
a field-programmable gate array (FPGA) and its implementation targeting Nallantech 250s+card
that is enabled for the coherent accelerator processor interface (CAPI). CAPI is an interface for
heterogeneous systems that allows accelerators to access I/O devices as CPU peers. Systolic
array architecture has shown advantages for accelerators in tasks that involve vector dot-product
calculations, such as matrix multiplication and convolution. The proposed hardware is synthesized
by the High-Level Synthesis tool targeting Kintex UltraScale+XCKU15P FPGA and can provide high
throughput (4.6×CPU) computations for real-time applications. In the future, the author plans to
extend this kernel to a complete CNN by supporting CPU-FPGA task partitioning using the coherent
memory space enabled by CAPI.
Use this login method if you
don't
have an
@illinois.edu
email address.
(Oops, I do have one)
IDEALS migrated to a new platform on June 23, 2022. If you created
your account prior to this date, you will have to reset your password
using the forgot-password link below.