TY - GEN
T1 - Post-Silicon Customization Using Deep Neural Networks
AU - Weston, Kevin
AU - Janfaza, Vahid
AU - Taur, Abhishek
AU - Mungra, Dhara
AU - Kansal, Arnav
AU - Zahran, Mohamed
AU - Muzahid, Abdullah
N1 - Publisher Copyright:
© 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.
PY - 2023
Y1 - 2023
N2 - Dynamically customizing processor architecture after fabrication, also known as Post-Silicon Customization (PSC) is effective in balancing the conflicting demands of power and performance for various applications. Existing approaches either use application-specific profiles or some adhoc heuristics or simpler machine learning models. These techniques often do not unleash the full potential of PSC as they fail to explore and exploit PSC opportunities to a larger extent. Towards that end, we propose the first deep neural network (DNN) based PSC technique, called Forecaster. Forecaster exploits several intuitive observations to cope with the long inference latency of a DNN model and boost customization impact. Forecaster works in two phases. In Phase 1, Forecaster builds a dataset and then, selects and trains a suitable DNN model offline. In Phase 2, Forecaster periodically collects hardware telemetry and uses the trained model to customize hardware resources. We provide a detailed design and implementation of Forecaster and compare its performance against a prior state-of-the-art approach. Our experimental results indicate that on average, Forecaster provides 2.5X more power efficiency gain over the best static configuration setup while sacrificing less than 1.0% of overall performance and less than 3.5% extra system power. Compared to the prior scheme, Forecaster increases the power efficiency gain up to 1.5X while reducing the performance degradation by 44%.
AB - Dynamically customizing processor architecture after fabrication, also known as Post-Silicon Customization (PSC) is effective in balancing the conflicting demands of power and performance for various applications. Existing approaches either use application-specific profiles or some adhoc heuristics or simpler machine learning models. These techniques often do not unleash the full potential of PSC as they fail to explore and exploit PSC opportunities to a larger extent. Towards that end, we propose the first deep neural network (DNN) based PSC technique, called Forecaster. Forecaster exploits several intuitive observations to cope with the long inference latency of a DNN model and boost customization impact. Forecaster works in two phases. In Phase 1, Forecaster builds a dataset and then, selects and trains a suitable DNN model offline. In Phase 2, Forecaster periodically collects hardware telemetry and uses the trained model to customize hardware resources. We provide a detailed design and implementation of Forecaster and compare its performance against a prior state-of-the-art approach. Our experimental results indicate that on average, Forecaster provides 2.5X more power efficiency gain over the best static configuration setup while sacrificing less than 1.0% of overall performance and less than 3.5% extra system power. Compared to the prior scheme, Forecaster increases the power efficiency gain up to 1.5X while reducing the performance degradation by 44%.
KW - Deep neural network
KW - FPGA
KW - post-silicon customization
UR - http://www.scopus.com/inward/record.url?scp=85171445551&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85171445551&partnerID=8YFLogxK
U2 - 10.1007/978-3-031-42785-5_9
DO - 10.1007/978-3-031-42785-5_9
M3 - Conference contribution
AN - SCOPUS:85171445551
SN - 9783031427848
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 120
EP - 136
BT - Architecture of Computing Systems - 36th International Conference, ARCS 2023, Proceedings
A2 - Goumas, Georgios
A2 - Tomforde, Sven
A2 - Brehm, Jürgen
A2 - Wildermann, Stefan
A2 - Pionteck, Thilo
PB - Springer Science and Business Media Deutschland GmbH
T2 - 36th International Conference on Architecture of Computing Systems, ARCS 2023
Y2 - 13 June 2023 through 15 June 2023
ER -