TY - GEN
T1 - Fault-tolerant aggregation
T2 - 15th International Conference on Principles of Distributed Systems, OPODIS 2011
AU - Almeida, Paulo Sérgio
AU - Baquero, Carlos
AU - Farach-Colton, Martín
AU - Jesus, Paulo
AU - Mosteiro, Miguel A.
PY - 2011
Y1 - 2011
N2 - Flow-Updating (FU) is a fault-tolerant technique that has proved to be efficient in practice for the distributed computation of aggregate functions in communication networks where individual processors do not have access to global information. Previous distributed aggregation protocols, based on repeated sharing of input values (or mass) among processors, sometimes called Mass-Distribution (MD) protocols, are not resilient to communication failures (or message loss) because such failures yield a loss of mass. In this paper, we present a protocol which we call Mass-Distribution with Flow-Updating (MDFU). We obtain MDFU by applying FU techniques to classic MD. We analyze the convergence time of MDFU showing that stochastic message loss produces low overhead. This is the first convergence proof of an FU-based algorithm. We evaluate MDFU experimentally, comparing it with previous MD and FU protocols, and verifying the behavior predicted by the analysis. Finally, given that MDFU incurs a fixed deviation proportional to the message-loss rate, we adjust the accuracy of MDFU heuristically in a new protocol called MDFU with Linear Prediction (MDFU-LP). The evaluation shows that both MDFU and MDFU-LP behave very well in practice, even under high rates of message loss and even changing the input values dynamically.
AB - Flow-Updating (FU) is a fault-tolerant technique that has proved to be efficient in practice for the distributed computation of aggregate functions in communication networks where individual processors do not have access to global information. Previous distributed aggregation protocols, based on repeated sharing of input values (or mass) among processors, sometimes called Mass-Distribution (MD) protocols, are not resilient to communication failures (or message loss) because such failures yield a loss of mass. In this paper, we present a protocol which we call Mass-Distribution with Flow-Updating (MDFU). We obtain MDFU by applying FU techniques to classic MD. We analyze the convergence time of MDFU showing that stochastic message loss produces low overhead. This is the first convergence proof of an FU-based algorithm. We evaluate MDFU experimentally, comparing it with previous MD and FU protocols, and verifying the behavior predicted by the analysis. Finally, given that MDFU incurs a fixed deviation proportional to the message-loss rate, we adjust the accuracy of MDFU heuristically in a new protocol called MDFU with Linear Prediction (MDFU-LP). The evaluation shows that both MDFU and MDFU-LP behave very well in practice, even under high rates of message loss and even changing the input values dynamically.
KW - Aggregate computation
KW - Communication networks
KW - Distributed computing
KW - Radio networks
UR - http://www.scopus.com/inward/record.url?scp=84055212599&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84055212599&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-25873-2_35
DO - 10.1007/978-3-642-25873-2_35
M3 - Conference contribution
AN - SCOPUS:84055212599
SN - 9783642258725
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 513
EP - 527
BT - Principles of Distributed Systems - 15th International Conference, OPODIS 2011, Proceedings
Y2 - 13 December 2011 through 16 December 2011
ER -