TY - GEN
T1 - Massive Scaling of MASSIF
T2 - 7th Annual Platform for Advanced Scientific Computing Conference, PASC 2020
AU - Kulkarni, Anuva
AU - Kovačević, Jelena
AU - Franchetti, Franz
N1 - Publisher Copyright:
© 2020 Owner/Author.
PY - 2020/6/29
Y1 - 2020/6/29
N2 - Micromechanical Analysis of Stress-Strain Inhomogeneities with Fourier transforms (MASSIF) is a large-scale Fortran-based differential equation solver used to study local stresses and strains in materials. Due to its prohibitive memory requirements, it is extremely difficult to port the code to GPUs with small on-device memory. In this work, we present an algorithm design that uses domain decomposition with approximate convolution, which reduces memory footprint to make the MASSIF simulation feasible on distributed GPU systems. A first-order performance model of our method estimates that compression and multi-resolution sampling strategies can enable domain computation within GPU memory constraints for 3D grids larger than those simulated by the current state-of-the-art Fortran MPI implementation. The model analysis also provides an insight into design requirements for further scalability. Lastly, we discuss the extension of our method to irregular domain decomposition and challenges to be tackled in the future.
AB - Micromechanical Analysis of Stress-Strain Inhomogeneities with Fourier transforms (MASSIF) is a large-scale Fortran-based differential equation solver used to study local stresses and strains in materials. Due to its prohibitive memory requirements, it is extremely difficult to port the code to GPUs with small on-device memory. In this work, we present an algorithm design that uses domain decomposition with approximate convolution, which reduces memory footprint to make the MASSIF simulation feasible on distributed GPU systems. A first-order performance model of our method estimates that compression and multi-resolution sampling strategies can enable domain computation within GPU memory constraints for 3D grids larger than those simulated by the current state-of-the-art Fortran MPI implementation. The model analysis also provides an insight into design requirements for further scalability. Lastly, we discuss the extension of our method to irregular domain decomposition and challenges to be tackled in the future.
KW - Algorithms
KW - Domain Decomposition
KW - Fast Fourier Transforms
KW - GPU
UR - http://www.scopus.com/inward/record.url?scp=85090120275&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85090120275&partnerID=8YFLogxK
U2 - 10.1145/3394277.3401857
DO - 10.1145/3394277.3401857
M3 - Conference contribution
AN - SCOPUS:85090120275
T3 - Proceedings of the Platform for Advanced Scientific Computing Conference, PASC 2020
BT - Proceedings of the Platform for Advanced Scientific Computing Conference, PASC 2020
PB - Association for Computing Machinery
Y2 - 29 June 2020 through 1 July 2020
ER -