TY - JOUR
T1 - A high-throughput distributed shared-buffer NoC router
AU - Soteriou, Vassos
AU - Ramanujam, Rohit Sunkam
AU - Lin, Bill
AU - Peh, Li Shiuan
PY - 2009/1
Y1 - 2009/1
N2 - Microarchitectural configurations of buffers in routers have a significant impact on the overall performance of an on-chip network (NoC). This buffering can be at the inputs or the outputs of a router, corresponding to an input-buffered router (IBR) or an output-buffered router (OBR). OBRs are attractive because they have higher throughput and lower queuing delays under high loads than IBRs. However, a direct implementation of OBRs requires a router speedup equal to the number of ports, making such a design prohibitive given the aggressive clocking and power budgets of most NoC applications. In this letter, we propose a new router design that aims to emulate an OBR practically based on a distributed shared-buffer (DSB) router architecture. We introduce innovations to address the unique constraints of NoCs, including efficient pipelining and novel flow control. Our DSB design can achieve significantly higher bandwidth at saturation, with an improvement of up to 20 when compared to a state-of-the-art pipelined IBR with the same amount of buffering, and our proposed microarchitecture can achieve up to 94 of the ideal saturation throughput.
AB - Microarchitectural configurations of buffers in routers have a significant impact on the overall performance of an on-chip network (NoC). This buffering can be at the inputs or the outputs of a router, corresponding to an input-buffered router (IBR) or an output-buffered router (OBR). OBRs are attractive because they have higher throughput and lower queuing delays under high loads than IBRs. However, a direct implementation of OBRs requires a router speedup equal to the number of ports, making such a design prohibitive given the aggressive clocking and power budgets of most NoC applications. In this letter, we propose a new router design that aims to emulate an OBR practically based on a distributed shared-buffer (DSB) router architecture. We introduce innovations to address the unique constraints of NoCs, including efficient pipelining and novel flow control. Our DSB design can achieve significantly higher bandwidth at saturation, with an improvement of up to 20 when compared to a state-of-the-art pipelined IBR with the same amount of buffering, and our proposed microarchitecture can achieve up to 94 of the ideal saturation throughput.
UR - http://www.scopus.com/inward/record.url?scp=67650581749&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=67650581749&partnerID=8YFLogxK
U2 - 10.1109/L-CA.2009.5
DO - 10.1109/L-CA.2009.5
M3 - Article
AN - SCOPUS:67650581749
SN - 1556-6056
VL - 8
SP - 21
EP - 24
JO - IEEE Computer Architecture Letters
JF - IEEE Computer Architecture Letters
IS - 1
M1 - 4906238
ER -