TY - GEN
T1 - Application-aware management parallel simulation collections
AU - Yau, Siu Man
AU - Karamcheti, Vijay
AU - Zorin, Denis
AU - Damevski, Kostadin
AU - Parker, Steven G.
N1 - Copyright:
Copyright 2012 Elsevier B.V., All rights reserved.
PY - 2009
Y1 - 2009
N2 - This paper presents a system deployed on parallel clusters to manage a collection of parallel simulations that make up a computational study. It explores how such a system can extend traditional parallel job scheduling and resource allocation techniques to incorporate knowledge specific to the study. Using a UINTAH-based helium gas simulation code (ARCHES) and the SimX system for multi-experiment computational studies, this paper demonstrates that, by using application-specific knowledge in resource allocation and scheduling decisions, one can reduce the run time of a computational study from over 20 hours to under 4.5 hours on a 32-processor cluster, and from almost 11 hours to just over 3.5 hours on a 64-processor cluster.
AB - This paper presents a system deployed on parallel clusters to manage a collection of parallel simulations that make up a computational study. It explores how such a system can extend traditional parallel job scheduling and resource allocation techniques to incorporate knowledge specific to the study. Using a UINTAH-based helium gas simulation code (ARCHES) and the SimX system for multi-experiment computational studies, this paper demonstrates that, by using application-specific knowledge in resource allocation and scheduling decisions, one can reduce the run time of a computational study from over 20 hours to under 4.5 hours on a 32-processor cluster, and from almost 11 hours to just over 3.5 hours on a 64-processor cluster.
KW - High-throughput computing
KW - Parallel system
UR - http://www.scopus.com/inward/record.url?scp=67650070998&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=67650070998&partnerID=8YFLogxK
U2 - 10.1145/1504176.1504184
DO - 10.1145/1504176.1504184
M3 - Conference contribution
AN - SCOPUS:67650070998
SN - 9781605583976
T3 - Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP
SP - 35
EP - 44
BT - Proceedings of the 2009 ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP'09
T2 - 2009 ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP'09
Y2 - 14 February 2009 through 18 February 2009
ER -