Large-scale training of deep neural networks

Dryden, Nikoli Joseph

Large-scale training of deep neural networks

Dryden, Nikoli Joseph

Permalink

https://hdl.handle.net/2142/105916

Description

Title

Large-scale training of deep neural networks

Author(s)

Dryden, Nikoli Joseph

Issue Date

2019-07-09

Director of Research (if dissertation) or Advisor (if thesis)

Snir, Marc

Doctoral Committee Chair(s)

Snir, Marc

Committee Member(s)

Gropp, William
Hwu, Wen-mei
Van Essen, Brian
Schwing, Alexander

Department of Study

Computer Science

Discipline

Computer Science

Degree Granting Institution

University of Illinois at Urbana-Champaign

Degree Name

Ph.D.

Degree Level

Dissertation

Date of Ingest

2019-11-26T20:59:36Z

Keyword(s)

High-performance computing
deep learning
convolutional neural network
parallel computing
machine learning

Abstract

Accelerating and scaling the training of deep neural networks (DNNs) is critical to keep up with growing datasets, reduce training times, and enable training on memory-constrained problems where parallelism is necessary. In this thesis, I present a set of techniques that can leverage large high-performance computing systems for fast training of DNNs. I first introduce a suite of algorithms to exploit additional parallelism in convolutional layers when training, expanding beyond the standard sample-wise data-parallel approach to include spatial parallelism and channel and filter parallelism. Next, I present optimizations to communication frameworks to reduce communication overheads at large scales. Finally, I discuss communication quantization, which can directly reduce communication volumes. In concert, these methods allow rapid training and enable training on problems that were previously infeasible.

Graduation Semester

2019-08

Type of Resource

text

Permalink

http://hdl.handle.net/2142/105916

Copyright and License Information

Owning Collections

Dissertations and Theses - Computer Science

Dissertations and Theses from the Siebel School of Computer Science

Graduate Dissertations and Theses at Illinois PRIMARY

Graduate Theses and Dissertations at Illinois

Large-scale training of deep neural networks

Dryden, Nikoli Joseph

Permalink

Description

Owning Collections

Dissertations and Theses - Computer Science

Graduate Dissertations and Theses at Illinois PRIMARY

Log In