TY - GEN
T1 - Parallel visualization on large clusters using MapReduce
AU - Vo, Huy T.
AU - Bronson, Jonathan
AU - Summa, Brian
AU - Comba, João L D
AU - Freire, Juliana
AU - Howe, Bill
AU - Pascucci, Valerio
AU - Silva, Cláudio T.
PY - 2011
Y1 - 2011
N2 - Large-scale visualization systems are typically designed to efficiently "push" datasets through the graphics hardware. However, exploratory visualization systems are increasingly expected to support scalable data manipulation, restructuring, and querying capabilities in addition to core visualization algorithms. We posit that new emerging abstractions for parallel data processing, in particular computing clouds, can be leveraged to support large-scale data exploration through visualization. In this paper, we take a first step in evaluating the suitability of the MapReduce framework to implement large-scale visualization techniques. MapReduce is a lightweight, scalable, general-purpose parallel data processing framework increasingly popular in the context of cloud computing. Specifically, we implement and evaluate a representative suite of visualization tasks (mesh rendering, isosurface extraction, and mesh simplification) as MapReduce programs, and report quantitative performance results applying these algorithms to realistic datasets. For example, we perform isosurface extraction of up to l6 isovalues for volumes composed of 27 billion voxels, simplification of meshes with 30GBs of data and subsequent rendering with image resolutions up to 80000 2 pixels. Our results indicate that the parallel scalability, ease of use, ease of access to computing resources, and fault-tolerance of MapReduce offer a promising foundation for a combined data manipulation and data visualization system deployed in a public cloud or a local commodity cluster.
AB - Large-scale visualization systems are typically designed to efficiently "push" datasets through the graphics hardware. However, exploratory visualization systems are increasingly expected to support scalable data manipulation, restructuring, and querying capabilities in addition to core visualization algorithms. We posit that new emerging abstractions for parallel data processing, in particular computing clouds, can be leveraged to support large-scale data exploration through visualization. In this paper, we take a first step in evaluating the suitability of the MapReduce framework to implement large-scale visualization techniques. MapReduce is a lightweight, scalable, general-purpose parallel data processing framework increasingly popular in the context of cloud computing. Specifically, we implement and evaluate a representative suite of visualization tasks (mesh rendering, isosurface extraction, and mesh simplification) as MapReduce programs, and report quantitative performance results applying these algorithms to realistic datasets. For example, we perform isosurface extraction of up to l6 isovalues for volumes composed of 27 billion voxels, simplification of meshes with 30GBs of data and subsequent rendering with image resolutions up to 80000 2 pixels. Our results indicate that the parallel scalability, ease of use, ease of access to computing resources, and fault-tolerance of MapReduce offer a promising foundation for a combined data manipulation and data visualization system deployed in a public cloud or a local commodity cluster.
KW - Hadoop
KW - MapReduce
KW - cloud computing
KW - gigapixels
KW - large meshes
KW - volume rendering
UR - http://www.scopus.com/inward/record.url?scp=84055184196&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84055184196&partnerID=8YFLogxK
U2 - 10.1109/LDAV.2011.6092321
DO - 10.1109/LDAV.2011.6092321
M3 - Conference contribution
AN - SCOPUS:84055184196
SN - 9781467301541
T3 - 1st IEEE Symposium on Large-Scale Data Analysis and Visualization 2011, LDAV 2011 - Proceedings
SP - 81
EP - 88
BT - 1st IEEE Symposium on Large-Scale Data Analysis and Visualization 2011, LDAV 2011 - Proceedings
T2 - 1st IEEE Symposium on Large-Scale Data Analysis and Visualization 2011, LDAV 2011
Y2 - 23 October 2011 through 24 October 2011
ER -