TY - GEN
T1 - Parallelized benchmark-driven performance evaluation of SMPs and tiled multi-core architectures for embedded systems
AU - Munir, Arslan
AU - Gordon-Ross, Ann
AU - Ranka, Sanjay
PY - 2012
Y1 - 2012
N2 - With Moore's law supplying billions of transistors on-chip, embedded systems are undergoing a transition from single-core to multi-core to exploit this high transistor density for high performance. However, there exists a plethora of multi-core architectures and the suitability of these multi-core architectures for different embedded domains (e.g., distributed, real-time, reliability-constrained) requires investigation. Despite the diversity of embedded domains, one of the critical applications in many embedded domains (especially distributed embedded domains) is information fusion. Furthermore, many other applications consist of various kernels, such as Gaussian elimination (used in network coding), that dominate the execution time. In this paper, we evaluate two embedded systems multi-core architectural paradigms: symmetric multiprocessors (SMPs) and tiled multi-core architectures (TMAs). We base our evaluation on a parallelized information fusion application and benchmarks that are used as building blocks in applications for SMPs and TMAs. We compare and analyze the performance of an Intel-based SMP and Tilera's TILEPro64 TMA based on our parallelized benchmarks for the following performance metrics: runtime, speedup, efficiency, cost, scalability, and performance per watt. Results reveal that TMAs are more suitable for applications requiring integer manipulation of data with little communication between the parallelized tasks (e.g., information fusion) whereas SMPs are more suitable for applications with floating point computations and a large amount of communication between processor cores.
AB - With Moore's law supplying billions of transistors on-chip, embedded systems are undergoing a transition from single-core to multi-core to exploit this high transistor density for high performance. However, there exists a plethora of multi-core architectures and the suitability of these multi-core architectures for different embedded domains (e.g., distributed, real-time, reliability-constrained) requires investigation. Despite the diversity of embedded domains, one of the critical applications in many embedded domains (especially distributed embedded domains) is information fusion. Furthermore, many other applications consist of various kernels, such as Gaussian elimination (used in network coding), that dominate the execution time. In this paper, we evaluate two embedded systems multi-core architectural paradigms: symmetric multiprocessors (SMPs) and tiled multi-core architectures (TMAs). We base our evaluation on a parallelized information fusion application and benchmarks that are used as building blocks in applications for SMPs and TMAs. We compare and analyze the performance of an Intel-based SMP and Tilera's TILEPro64 TMA based on our parallelized benchmarks for the following performance metrics: runtime, speedup, efficiency, cost, scalability, and performance per watt. Results reveal that TMAs are more suitable for applications requiring integer manipulation of data with little communication between the parallelized tasks (e.g., information fusion) whereas SMPs are more suitable for applications with floating point computations and a large amount of communication between processor cores.
KW - embedded systems
KW - multi-core
KW - performance evaluation
UR - http://www.scopus.com/inward/record.url?scp=84874304958&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84874304958&partnerID=8YFLogxK
U2 - 10.1109/PCCC.2012.6407785
DO - 10.1109/PCCC.2012.6407785
M3 - Conference contribution
AN - SCOPUS:84874304958
SN - 9781467348812
T3 - 2012 IEEE 31st International Performance Computing and Communications Conference, IPCCC 2012
SP - 416
EP - 423
BT - 2012 IEEE 31st International Performance Computing and Communications Conference, IPCCC 2012
T2 - 2012 IEEE 31st International Performance Computing and Communications Conference, IPCCC 2012
Y2 - 1 December 2012 through 3 December 2012
ER -