Performance of the 3D FFT on the 6D network torus QCDOC parallel supercomputer

Bin Fang, Yuefan Deng, Glenn Martyna

Research output: Contribution to journalArticlepeer-review

Abstract

QCDOC is a massively parallel supercomputer with tens of thousands of nodes distributed on a six-dimensional torus network. The 6D structure of the network provides the needed communication resources for many communication-intensive applications. In this paper, we present a parallel algorithm for three-dimensional Fast Fourier Transform and its implementation for a 4096-node QCDOC prototype. Two techniques have been used to increase its parallel performance: simultaneous multi-dimensional communication and communication-and-computation overlapping. Benchmarking experiments suggest that 3D FFTs of size 128 × 128 × 128 can scale well on such platforms up to 4096 nodes. Our performance results suggest stronger scalability on QCDOC than on IBM BlueGene/L supercomputer.

Original languageEnglish (US)
Pages (from-to)531-538
Number of pages8
JournalComputer Physics Communications
Volume176
Issue number8
DOIs
StatePublished - Apr 15 2007

Fingerprint

Dive into the research topics of 'Performance of the 3D FFT on the 6D network torus QCDOC parallel supercomputer'. Together they form a unique fingerprint.

Cite this