A mathematical model for highly available clusters with one head and several identical computing nodes.

Ever, Enver and Gemikonakli, Orhan and Chakka, Ram (2006) A mathematical model for highly available clusters with one head and several identical computing nodes. In: UKSIM 2006: 9th International conference on computer modelling and simulation. Al-Begain, Khalid and Al-Dabass, David and Orsoni, Alessandra, eds. United Kingdom Society for Modelling and Simulation, Oxford, pp. 32-37. ISBN 9780951650929

Full text is not in this repository.

Official URL: http://www.comp.glam.ac.uk/staff/kbegain/UKSim06/U...

Abstract

Fault-tolerant systems with repair-upon-failure strategy can become expensive in terms of labour and time. While postponing these repairs, it is essential to keep the whole system capable to deal with user requests. For this purpose, usually, a threshold value is defined which represents the minimum number of servers the system administrator should keep operative. Highly available multiprocessor systems with one head and several computation nodes is a common configuration in various cluster systems used as a low-cost alternative to supercomputers. It is typical to introduce a redundant head for such systems to improve availability. Deferred repairs can be used for such systems for reducing repair costs when no permanent repair facility exists on premises. Performability evaluation of such systems is very important since the systems are fault tolerant. In this paper, the performance modelling for highly available multiprocessor systems is presented. For these systems, one main and several identical computing nodes serving the same stream of arriving jobs is considered. To improve the availability of the system, the head node is backed-up. To account for delays due to switching of head node, such systems are modelled and solved for exact performability measures for both bounded and unbounded queuing systems assuming a deferred repair strategy.

Item Type:Book Section
Additional Information:

Conference held at Oriel College, Oxford, 4th - 6th April 2006.

Research Areas:Middlesex University Schools and Centres > School of Science and Technology > Computer and Communications Engineering
Middlesex University Schools and Centres > School of Science and Technology > Computer Science > SensoLab group
ID Code:1739
Useful Links:
Deposited On:30 Mar 2009 16:11
Last Modified:24 Oct 2014 15:33

Repository staff only: item control page

Full text downloads (NB count will be zero if no full text documents are attached to the record)

Downloads per month over the past year