Performability analysis of highly available clusters with break-downs and deferred repairs.

Ever, Enver, Gemikonakli, Orhan ORCID logoORCID: https://orcid.org/0000-0002-0513-1128 and Chakka, Ram (2006) Performability analysis of highly available clusters with break-downs and deferred repairs. In: HET-NETs '06 : 4th International working conference on performance modelling and evaluation of heterogeneous networks., 11th-13th Sept. 2006, Ilkley, West Yorkshire, U.K. . [Conference or Workshop Item]

Abstract

Fault-tolerant systems with repair-upon-failure strategy can become expensive in terms of labour and time. While postponing these repairs, it is essential to keep the whole system capable to deal with user requests. For this purpose, usually, a threshold value is defined which represents the minimum number of servers the system administrator should keep operative. Highly available multiprocessor systems with one head and several computation nodes is a common configuration in various cluster systems used as a low-cost alternative to supercomputers. It is typical to introduce a redundant head for such systems to improve availability. Deferred repairs can be used for such systems for reducing repair costs when no permanent repair facility exists on premises. Performability evaluation of such systems is very important since the systems are fault tolerant. In this paper, the performance modelling for highly available multiprocessor systems is presented. For these systems, one main and several identical computing nodes serving the same stream of arriving jobs is considered. To improve the availability of the system, the head node is backed-up. To account for delays due to switching of head node, such systems are modelled and solved for exact performability measures for both bounded and unbounded queuing systems assuming a deferred repair strategy.

Item Type: Conference or Workshop Item (Paper)
Research Areas: A. > School of Science and Technology > Computer and Communications Engineering
A. > School of Science and Technology > Computer Science > SensoLab group
Item ID: 1741
Useful Links:
Depositing User: Enver Ever
Date Deposited: 30 Mar 2009 17:02
Last Modified: 08 Jul 2019 13:40
URI: https://eprints.mdx.ac.uk/id/eprint/1741

Actions (login required)

View Item View Item

Statistics

Activity Overview
6 month trend
0Downloads
6 month trend
564Hits

Additional statistics are available via IRStats2.