TY - GEN
T1 - A queueing theoretic approach for performance evaluation of low-power multi-core embedded systems
AU - Munir, Arslan
AU - Gordon-Ross, Ann
AU - Ranka, Sanjay
PY - 2011
Y1 - 2011
N2 - With Moore's law supplying billions of transistors on-chip, embedded systems are undergoing a transition from single-core to multi-core to exploit this high transistor density for high performance. However, the optimal layout of these multiple cores along with the memory subsystem (caches and main memory) to satisfy power, area, and often stringent real-time constraints is a challenging design endeavor. The short time-to-market constraint of embedded systems exacerbates this design challenge and necessitates the architectural modeling of embedded systems to reduce the time-to-market by expediting target applications to device/architecture mapping. In this paper, we present a queueing theoretic approach for modeling multi-core embedded systems that provides a quick and inexpensive performance evaluation both in terms of time and resources as compared to the development of multi-core simulators and running benchmarks on these simulators. We also calculate chip area and power consumption for different multi-core embedded architectures with a varying number of processor cores and cache configurations to provide a comparative analysis of multicore embedded architectures in terms of performance, area, and power consumption. Our performance and power results indicate that multi-core embedded system architectures that leverage shared last-level caches (LLCs) provide the best LLC performance per watt but may introduce main memory response time and throughput bottlenecks for high cache miss rates, whereas architectures leveraging a hybrid of private and shared LLCs alleviate main memory bottlenecks at the expense of reduced performance per watt.
AB - With Moore's law supplying billions of transistors on-chip, embedded systems are undergoing a transition from single-core to multi-core to exploit this high transistor density for high performance. However, the optimal layout of these multiple cores along with the memory subsystem (caches and main memory) to satisfy power, area, and often stringent real-time constraints is a challenging design endeavor. The short time-to-market constraint of embedded systems exacerbates this design challenge and necessitates the architectural modeling of embedded systems to reduce the time-to-market by expediting target applications to device/architecture mapping. In this paper, we present a queueing theoretic approach for modeling multi-core embedded systems that provides a quick and inexpensive performance evaluation both in terms of time and resources as compared to the development of multi-core simulators and running benchmarks on these simulators. We also calculate chip area and power consumption for different multi-core embedded architectures with a varying number of processor cores and cache configurations to provide a comparative analysis of multicore embedded architectures in terms of performance, area, and power consumption. Our performance and power results indicate that multi-core embedded system architectures that leverage shared last-level caches (LLCs) provide the best LLC performance per watt but may introduce main memory response time and throughput bottlenecks for high cache miss rates, whereas architectures leveraging a hybrid of private and shared LLCs alleviate main memory bottlenecks at the expense of reduced performance per watt.
KW - embedded systems
KW - low-power
KW - Multi-core
KW - performance evaluation
KW - queueing theory
UR - http://www.scopus.com/inward/record.url?scp=83455196205&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=83455196205&partnerID=8YFLogxK
U2 - 10.1109/ICCD.2011.6081397
DO - 10.1109/ICCD.2011.6081397
M3 - Conference contribution
AN - SCOPUS:83455196205
SN - 9781457719523
T3 - Proceedings - IEEE International Conference on Computer Design: VLSI in Computers and Processors
SP - 198
EP - 205
BT - 2011 IEEE 29th International Conference on Computer Design, ICCD 2011
T2 - 29th IEEE International Conference on Computer Design 2011, ICCD 2011
Y2 - 9 November 2011 through 12 November 2011
ER -