Evaluating availability of UPS architecture


A new white paper from Centiel has examined different UPS architectures and topologies to compare availability

Ensuring a continuous supply of electrical power is vital in safety-critical environments such as hospitals, data centres and commercial institutions, where even the shortest of duration interruption may cause significant financial losses or even endanger lives. An uninterruptible power supply system is used to provide power when the main source is interrupted or fails. It also ensures a high level of power quality. Therefore, ensuring the highest possible level of availability of a UPS system is of paramount importance.

Redundancy is the most common way to improve the availability of a UPS system. However, different architectural solutions with the same number of redundant components may result in significantly different levels of availability. 

The white paper, entitled Evaluating Availability of Centiel UPS Architecture, which was developed through a collaboration between Università della Svizzera Italiana, Lugano and ETH Zurich, can be downloaded in full at centiel.co.uk. In the white paper, different UPS architectures were examined to compare availability.

Availability is defined as readiness of a system to provide a corrective service. Steady-state availability is the most commonly used metric for availability quantification and is defined as the fraction of time a system is operational during its expected lifetime. 

Availability equals mean time to failure (MTTF) divided by MTTF plus mean time to repair (MTTR).

MTTF is the mean time between two consecutive failures, whereas mean time to repair is the mean time needed for the repair. In other words, MTTF is the average time during which the system is up after it has been repaired and before it fails again. 

To make the comparison between architectures straightforward, steady-state availability is usually expressed with a number of nines. For example, ‘five nines’ availability means that steady-state availability is 0.99999 or 99.999% (see Figure 1). 

Figure 1: A simplified reliability block diagram of a UPS module

A single UPS unit comprises of a rectifier, battery and inverter. The mean time to failure for the rectifier is an average of 50,000 hours, the battery 100,0000 hours and the inverter 50,000 hours. Because the availability of the individual components were known, the authors used a hierarchical modelling tool called SHARPE, to calculate the mean time to failure of a single UPS unit as 20,000 hours (more details about this calculation can be found in the white paper). 

This assumes mean time to repair is six hours and so the availability of a single standalone UPS unit is 0.997. Therefore, in one year the expected downtime for a single UPS is 2 hours 37 minutes.

To improve the availability of a single UPS unit, it can be placed in parallel with the main power source using a static bypass switch (SBS). Then, if any of the UPS components fail, power can be switched, ensuring there is no interruption. The authors compared the availability of a UPS with the main power source and static bypass switch (see Figure 2).

Figure 2: A reliability block diagram of a UPS with the main source

By calculation, a customer using only the mains supply without a UPS system will experience 17.5 hours of power interruption per year, compared with a single UPS module where interruptions are 2.6 hours per year. Including the static bypass switch, availability is significantly improved to just one second of annual downtime or availability of 0.99998. 

To get higher power outputs, several UPS units must be combined. Next the authors analysed these architectures.

A system with higher output requires several single UPS units to be connected in parallel to the main power, over a static bypass switch. The UPS units must also be connected to each other to ensure synchronous operation. Communication is established via a parallel bus (PBUS). Despite the fact that the parallel bus is a highly reliable component, in a typical configuration, failure of any of the buses will cause the entire system to fail. Availability of a system example of 160kVA output therefore, has a reduced availability of 0.999993 equivalent to three minutes and 40 seconds per year.

To boost dependability, redundant UPS units are introduced, known as n+1 configuration. Here, the system may continue operation with the remaining n modules. The MTTR of a modular architecture decreases to only 30 minutes. Such systems may tolerate failure of a single UPS unit. However, again failure of any of the parallel buses makes the entire system fail. 

Centiel’s n+1 modular architecture introduces numerous changes to improve the overall dependability. Specifically the control logic allows the communication between UPS units to be maintained even when one of the parallel buses has failed. 

In addition, the MTTR of a single UPS is decreased by placing the static bypass switch fuse on each module at the frame level, out of the module. In this way, the fuse may be replaced without the need of pulling-out or opening the entire module. The authors assumed the MTTR of the fuse is five minutes and the fuse causes 5% of the overall module failures.

In Centiel’s architecture, there are changes that increase system’s MTTF and decrease MTTR. In this white paper, the authors only consider two: the improved parallel bus configuration and isolating the static bypass switch module fuse. 

In this way, Centiel’s architecture is claimed to outperform the one with typical n+1 architecture, as it increases from six and a half to even 10 and a half nines. Annual downtime is decreased from 12 seconds to 0.0006 seconds. 

However, the 1+1 configuration also has a limited output of 40kVA. As the output increases, availability of both architectures slightly decreases but the level of availability improvement with the Centiel architecture (when compared with a typical one with the same level of redundancy) gets even higher. For example, when the output is 160kVA (4+1 configuration), a typical architecture has availability of six nines (annual downtime is 32 seconds), whereas availability with the Centiel architecture is nine nines or 0.03 seconds downtime per year.

 In conclusion, with an advanced architecture it is possible to improve availability by multiple orders of magnitude.


Please enter your comment!
Please enter your name here