Abstract
We present Hadoop-based replica exchange (HaRE), a Hadoop-based implementation of the replica exchange scheme developed primarily for replica exchange statistical temperature molecular dynamics, an example of a large-scale, advanced sampling molecular dynamics simulation. By using Hadoop as a framework and the MapReduce model for driving replica exchange, an efficient task-level parallelism is introduced to replica exchange statistical temperature molecular dynamics simulations. In order to demonstrate this, we investigate the performance of our application over various distributed cyberinfrastructures (DCI), including several high-performance computing systems, our cyberinfrastructure for reconfigurable optical networks testbed, the global environment for network innovations testbed, and the CloudLab testbed. Scalability performance analysis is shown in terms of scale-out and scale-up over a single high-performance computing cluster, EC2, and CloudLab and scale-across with cyberinfrastructure for reconfigurable optical networks and global environment for network innovations. As a result, we demonstrate that HaRE is capable of efficient execution over both homogeneous and heterogeneous DCI of varying size and configuration. Contributing factors to performance are discussed in order to provide insight towards the effects of computing environment on the execution of HaRE. With these contributions, we propose that similar loosely coupled scientific applications can also take advantage of the scalable, task-level parallelism Hadoop MapReduce provides over various DCI. Copyright © 2016 John Wiley & Sons, Ltd.
Recommended Citation
R. Platania et al., "Hadoop-based Replica Exchange Over Heterogeneous Distributed Cyberinfrastructures," Concurrency and Computation: Practice and Experience, vol. 29, no. 4, article no. e3878, Wiley, Feb 2017.
The definitive version is available at https://doi.org/10.1002/cpe.3878
Department(s)
Computer Science
Publication Status
Free Access
Keywords and Phrases
distributed cyberinfrastructure; enhanced conformational sampling; GENI; Hadoop MapReduce; replica exchange; replica exchange statistical temperature molecular dynamics (RESTMD)
International Standard Serial Number (ISSN)
1532-0634; 1532-0626
Document Type
Article - Journal
Document Version
Citation
File Type
text
Language(s)
English
Rights
© 2024 Wiley, All rights reserved.
Publication Date
25 Feb 2017
Comments
National Science Foundation, Grant 1341008