Deep Learning for Computer Architects

Brandon Reagen, Robert Adolf, Paul Whatmough, Gu Yeon Wei, David Brooks

Research output: Chapter in Book/Report/Conference proceedingChapter

Abstract

Machine learning, and specifically deep learning, has been hugely disruptive in many fields of computer science. The success of deep learning techniques in solving notoriously difficult classification and regression problems has resulted in their rapid adoption in solving real-world problems. The emergence of deep learning is widely attributed to a virtuous cycle whereby fundamental advancements in training deeper models were enabled by the availability of massive datasets and high-performance computer hardware. This text serves as a primer for computer architects in a new and rapidly evolving field. We review how machine learning has evolved since its inception in the 1960s and track the key developments leading up to the emergence of the powerful deep learning techniques that emerged in the last decade. Next we review representative workloads, including the most commonly used datasets and seminal networks across a variety of domains. In addition to discussing the workloads themselves, we also detail the most popular deep learning tools and show how aspiring practitioners can use the tools with the workloads to characterize and optimize DNNs. The remainder of the book is dedicated to the design and optimization of hardware and architectures for machine learning. As high-performance hardware was so instrumental in the success of machine learning becoming a practical solution, this chapter recounts a variety of optimizations proposed recently to further improve future designs. Finally, we present a review of recent research published in the area as well as a taxonomy to help readers understand how various contributions fall in context.

Original languageEnglish (US)
Title of host publicationHardware and Software Support for Virtualization
PublisherMorgan and Claypool Publishers
PagesI-109
Edition4
DOIs
StatePublished - Aug 22 2017

Publication series

NameSynthesis Lectures on Computer Architecture
Number4
Volume12
ISSN (Print)1935-3235
ISSN (Electronic)1935-3243

Keywords

  • DNN benchmarking and characterization
  • deep learning
  • hardware software co-design
  • hardware support for machine learning
  • neural network accelerators

ASJC Scopus subject areas

  • Hardware and Architecture

Fingerprint

Dive into the research topics of 'Deep Learning for Computer Architects'. Together they form a unique fingerprint.

Cite this