TY - GEN
T1 - Parallelizing the ZSWEEP Algorithm for Distributed-Shared Memory Architectures
AU - Farias, Ricardo
AU - Silva, Cĺaudio T.
N1 - Funding Information:
We are grateful to Anne Rogers (AT&T) for suggestions and help in analyzing the memory characteristics of the tile-based ZSWEEP, Kwan-Liu Ma (UC, Davis) for suggestions that greatly improved our presentation, Joseph Mitchell (Stony Brook) and Brian Wylie (Sandia) for their collaboration on this research. NASA has gracefully provided the Liquid Oxygen Post dataset. Peter Williams (LLNL) gave us the SPX dataset. This work was made possible with the generous support of Sandia National Labs and the Dept of Energy Mathematics, Information and Computer Science Office. R. Farias acknowledges partial support from CNPq-Brazil under a PhD fellowship.
Publisher Copyright:
© VG 2001. All rights reserved.
PY - 2001
Y1 - 2001
N2 - In this paper we describe a simple parallelization of the ZSWEEP algorithm for rendering unstructured volumetric grids on distributed-shared memory machines, and study its performance on three generations of SGI multiprocessors, including the new Origin 3000 series. The main idea of the ZSWEEP algorithm is very simple; it is based on sweeping the data with a plane parallel to the viewing plane, in order of increasing z, projecting the faces of cells that are incident to vertices as they are encountered by the sweep plane. Our parallel extension of the basic algorithm makes use of an image-based task partitioning scheme. Essentially, the screen is divided in more tiles than the number of processors, then each processor performs the sweep independently on the next available tile, until no more tiles are available to render. Here, we detail the modifications necessary to efficiently extend the sequential algorithm to work on shared-memory machines. We report on the performance of our implementation, and show that the tile-based ZSWEEP is naturally cache friendly, achieves fast rendering times, and substantial speedups on all the machines we used for testing. On one processor of our Origin 3000, we measure the L2 data cache hit rate of the tile-based ZSWEEP to be over 99%; a parallel efficiency of 83% on 16 processors; and rendering rates of about 300 thousand tetrahedra per second for a 1024 image.
AB - In this paper we describe a simple parallelization of the ZSWEEP algorithm for rendering unstructured volumetric grids on distributed-shared memory machines, and study its performance on three generations of SGI multiprocessors, including the new Origin 3000 series. The main idea of the ZSWEEP algorithm is very simple; it is based on sweeping the data with a plane parallel to the viewing plane, in order of increasing z, projecting the faces of cells that are incident to vertices as they are encountered by the sweep plane. Our parallel extension of the basic algorithm makes use of an image-based task partitioning scheme. Essentially, the screen is divided in more tiles than the number of processors, then each processor performs the sweep independently on the next available tile, until no more tiles are available to render. Here, we detail the modifications necessary to efficiently extend the sequential algorithm to work on shared-memory machines. We report on the performance of our implementation, and show that the tile-based ZSWEEP is naturally cache friendly, achieves fast rendering times, and substantial speedups on all the machines we used for testing. On one processor of our Origin 3000, we measure the L2 data cache hit rate of the tile-based ZSWEEP to be over 99%; a parallel efficiency of 83% on 16 processors; and rendering rates of about 300 thousand tetrahedra per second for a 1024 image.
UR - http://www.scopus.com/inward/record.url?scp=11144258267&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=11144258267&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:11144258267
T3 - 2nd IEEE TCVG / Eurographics International Workshop on Volume Graphics, VG 2001
SP - 183
EP - 194
BT - 2nd IEEE TCVG / Eurographics International Workshop on Volume Graphics, VG 2001
A2 - Mueller, K.
A2 - Kaufman, A.
PB - The Eurographics Association
T2 - 2nd IEEE TCVG / Eurographics International Workshop on Volume Graphics, VG 2001
Y2 - 21 June 2001 through 22 June 2001
ER -