TY - GEN
T1 - HW/SW co-design and co-optimizations for deep learning
AU - Marchisio, Alberto
AU - Hanif, Muhammad Abdullah
AU - Putra, Rachmad Vidya Wicaksana
AU - Shafique, Muhammad
N1 - Publisher Copyright:
© 2018 Association for Computing Machinery.
Copyright:
Copyright 2019 Elsevier B.V., All rights reserved.
PY - 2018/10/4
Y1 - 2018/10/4
N2 - Deep Learning algorithms have been proven to provide state-of-the-art results in many applications but at the cost of a high computational complexity. Therefore, accelerating such algorithms in hardware is highly needed. However, since the computational requirements are growing exponentially along with the accuracy, their demand for hardware resources is significant. To tackle this issue, we propose a methodology, involving both software and hardware, to optimize the Deep Neural Networks (DNNs). We discuss and analyze pruning, approximations through quantization and specialized accelerators for DNN inference. For each phase of the methodology, we provide quantitative comparisons with the existing techniques and hardware platforms.
AB - Deep Learning algorithms have been proven to provide state-of-the-art results in many applications but at the cost of a high computational complexity. Therefore, accelerating such algorithms in hardware is highly needed. However, since the computational requirements are growing exponentially along with the accuracy, their demand for hardware resources is significant. To tackle this issue, we propose a methodology, involving both software and hardware, to optimize the Deep Neural Networks (DNNs). We discuss and analyze pruning, approximations through quantization and specialized accelerators for DNN inference. For each phase of the methodology, we provide quantitative comparisons with the existing techniques and hardware platforms.
KW - Deep Learning
KW - Pruning
KW - Quantization
KW - Systolic Array
UR - http://www.scopus.com/inward/record.url?scp=85058645892&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85058645892&partnerID=8YFLogxK
U2 - 10.1145/3285017.3285022
DO - 10.1145/3285017.3285022
M3 - Conference contribution
AN - SCOPUS:85058645892
T3 - ACM International Conference Proceeding Series
SP - 13
EP - 18
BT - Workshop Proceedings - 2018 INTelligent Embedded Systems Architectures and Applications, INTESA 2018
PB - Association for Computing Machinery
T2 - 2018 Workshop on INTelligent Embedded Systems Architectures and Applications, INTESA 2018
Y2 - 4 October 2018
ER -