Performability analysis of highly available clusters with break-downs and deferred repairs.

Ever, Enver and Gemikonakli, Orhan and Chakka, Ram (2006) Performability analysis of highly available clusters with break-downs and deferred repairs. In: HET-NETs '06 : 4th International working conference on performance modelling and evaluation of heterogeneous networks., 11th-13th Sept. 2006, Ilkley, West Yorkshire, U.K.

Full text is not in this repository.

Official URL: http://www.comp.brad.ac.uk/het-net/tutorials/P05.p...

Abstract

Fault-tolerant systems with repair-upon-failure strategy can become expensive in terms of labour and time. While postponing these repairs, it is essential to keep the whole system capable to deal with user requests. For this purpose, usually, a threshold value is defined which represents the minimum number of servers the system administrator should keep operative. Highly available multiprocessor systems with one head and several computation nodes is a common configuration in various cluster systems used as a low-cost alternative to supercomputers. It is typical to introduce a redundant head for such systems to improve availability. Deferred repairs can be used for such systems for reducing repair costs when no permanent repair facility exists on premises. Performability evaluation of such systems is very important since the systems are fault tolerant. In this paper, the performance modelling for highly available multiprocessor systems is presented. For these systems, one main and several identical computing nodes serving the same stream of arriving jobs is considered. To improve the availability of the system, the head node is backed-up. To account for delays due to switching of head node, such systems are modelled and solved for exact performability measures for both bounded and unbounded queuing systems assuming a deferred repair strategy.

Item Type:Conference or Workshop Item (Paper)
Research Areas:School of Science and Technology > Computer and Communications Engineering
School of Science and Technology > Computer Science > SensoLab group
ID Code:1741
Useful Links:
Deposited On:30 Mar 2009 17:02
Last Modified:14 Oct 2014 19:09

Repository staff only: item control page

Full text downloads (NB count will be zero if no full text documents are attached to the record)

Downloads per month over the past year