Parallel and distributed systems may operate in an environment that undergoes unpredictable changes causing certain system performance features to degrade. Such systems need robustness to guarantee limited degradation despite fluctuations in the behavior of its component parts or environment. This research investigates the robustness of an allocation of resources to tasks in parallel and distributed systems. The main contributions are 1) a mathematical description of a metric for the robustness of a resource allocation with respect to desired system performance features against multiple perturbations in multiple system and environmental conditions, and 2) a procedure for deriving a robustness metric for an arbitrary system. For illustration, this procedure is employed to derive robustness metrics for three example distributed systems. Such a metric can help researchers evaluate a given resource allocation for robustness against uncertainties in specified perturbation parameters.
S. Ali et al., "Measuring the Robustness of a Resource Allocation," IEEE Transactions on Parallel and Distributed Systems, Institute of Electrical and Electronics Engineers (IEEE), Jan 2004.
The definitive version is available at http://dx.doi.org/10.1109/TPDS.2004.24
Electrical and Computer Engineering
Keywords and Phrases
65; Robustness; Distributed Processing; Distributed System; Mathematical Description; Parallel and Distributed Systems; Parallel System; Perturbation Parameter; Resource Allocation; Resource Management Systems; Robustness Metric; Stability
International Standard Serial Number (ISSN)
Article - Journal
© 2004 Institute of Electrical and Electronics Engineers (IEEE), All rights reserved.