Skip to main content

A robust system reliability analysis using partitioning and parallel processing of Markov chain

  • Po Ting Lin (a1), Yu-Cheng Chou (a2), Yung Ting (a3), Shian-Shing Shyu (a4) and Chang-Kuo Chen (a4)...

This paper presents a robust reliability analysis method for systems of multimodular redundant (MMR) controllers using the method of partitioning and parallel processing of a Markov chain (PPMC). A Markov chain is formulated to represent the N distinct states of the MMR controllers. Such a Markov chain has N2 directed edges, and each edge corresponds to a transition probability between a pair of start and end states. Because N can be easily increased substantially, the system reliability analysis may require large computational resources, such as the central processing unit usage and memory occupation. By the PPMC, a Markov chain's transition probability matrix can be partitioned and reordered, such that the system reliability can be evaluated through only the diagonal submatrices of the transition probability matrix. In addition, calculations regarding the submatrices are independent of each other and thus can be conducted in parallel to assure the efficiency. The simulation results show that, compared with the sequential method applied to an intact Markov chain, the proposed PPMC can improve the performance and produce allowable accuracy for the reliability analysis on large-scale systems of MMR controllers.

Corresponding author
Reprint requests to: Yu-Cheng Chou, Institute of Undersea Technology, National Sun Yat-sen University, 70 Lienhai Road, Kaohsiung City 80424, Taiwan. E-mail:
Hide All
Aldemir T., Miller D., Stovsky M., Kirschenbaum J., & Buccim P. (2006). Current State of Reliability Modeling Methodologies for Digital Systems and Their Acceptance Criteria for Nuclear Power Plant Assessments. Washington, DC: US Nuclear Regulatory Commission.
Bhaduri D., Shukla S.K., Graham P.S., & Gokhale M.B. (2007). Reliability analysis of large circuits using scalable techniques and tools. IEEE Transactions on Circuits and Systems I 54(11), 24472460.
Cai B., Liu Y., Liu Z., Tian X., Li H., & Ren C. (2012). Reliability analysis of subsea blowout preventer control systems subjected to multiple error shocks. Journal of Loss Prevention in the Process Industries 25, 10441054.
Cannon L.E. (1969). A cellular computer to implement the Kalman filter algorithm. PhD Thesis. Montana State University.
Chou Y.-C., & Lin P.T. (2014). An efficient and robust design optimization of multi-state flow network for multiple commodities using generalized reliability evaluation algorithm and edge reduction method. International Journal of Systems Science. Advance online publication. doi:10.1080/00207721.2013.879228
Dominguez-Garcia A.D., Kassakian J.G., & Schindall J.E. (2006). Reliability evaluation of the power supply of an electrical power net for safety-relevant applications. Reliability Engineering & System Safety 91(5), 505514.
Grama A., Gupta A., Karypis G., & Kumar V. (2004). Introduction to Parallel Computing. Essex: Pearson Education.
Gropp W. (2002). MPICH2: A new start for MPI implementations. Proc. 9th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface.
Gropp W., Huss-Lederman S., Lumsdaine A., Lusk E., Nitzberg B., Saphir W., & Snir M. (1998). MPI: The Complete Reference—The MPI-2 Extensions. Cambridge, MA: MIT Press.
Guo H., & Yang X. (2008). Automatic creation of Markov models for reliability assessment of safety instrumented systems. Reliability Engineering & System Safety 93(6), 829837.
Han J., Gao J., Jonker P., Qi Y., & Fortes J.A. (2005). Toward hardware-redundant, fault-tolerant logic for nanoelectronics. IEEE Design & Test of Computers 22(4), 328339.
Kim H., Lee H., & Lee K. (2005). The design and analysis of AVTMR (all voting triple modular redundancy) and dual-duplex system. Reliability Engineering & System Safety 88(3), 291300.
Lin P.T., Chou Y.-C., Manuel M.C.E., & Hsu K.S. (2014). Investigation of numerical performance of partitioning and parallel processing of Markov chain (PPMC) for complex design problems. Proc. ASME 2014 Int. Design & Engineering Technical Confs. and Computers & Information in Engineering Conf., IDETC/CIE 2014, Paper No. DETC2014-34652, Buffalo, NY.
Lisnianski A., Elmakias D., Laredo D., & Ben Haim H. (2012). A multi-state Markov model for a short-term reliability analysis of a power generating unit. Reliability Engineering & System Safety 98(1), 16.
Liu Y., & Rausand M. (2011). Reliability assessment of safety instrumented systems subject to different demand modes. Journal of Loss Prevention in the Process Industries 24(1), 4956.
Liu Z., Liu Y., Cai B., Liu X., Li J., Tian X., & Ji R. (2013). RAMS analysis of hybrid redundancy system of subsea blowout preventer based on stochastic Petri nets. International Journal of Security and Its Applications 7(4), 159166.
Liu Z., Ni X., Liu Y., Song Q., & Wang Y. (2011). Gastric esophageal surgery risk analysis with a fault tree and Markov integrated model. Reliability Engineering & System Safety 96(12), 15911600.
Matthews P., & Philip A. (2012). Bayesian project diagnosis for the construction design process. Artificial Intelligence for Engineering Design, Analysis and Manufacturing 26(4), 375391.
Mutha C., Jensen D., Tumer I., & Smidts C. (2013). An integrated multidomain functional failure and propagation analysis approach for safe system design. Artificial Intelligence for Engineering Design, Analysis and Manufacturing 27(4), 317347.
Parashar B., & Taneja G. (2007). Reliability and profit evaluation of a PLC hot standby system based on a master-slave concept and two types of repair facilities. IEEE Transactions on Reliability 56(3), 534539.
Snir M., Otto S., Huss-Lederman S., Walker D., & Dongarra J. (1998). MPI: The Complete Reference—The MPI Core. Cambridge, MA: MIT Press.
Soro I.W., Nourelfath M., & Ait-Kadi D. (2010). Performance evaluation of multi-state degraded systems with minimal repairs and imperfect preventive maintenance. Reliability Engineering & System Safety 95(2), 6569.
Stewart W.J. (2009). Probability, Markov Chains, Queues, and Simulation: The Mathematical Basis of Performance Modeling. Princeton, NJ: Princeton University Press.
Wang S., Ji Y., Dong W., & Yang S. (2007). Design and RAMS analysis of a fault-tolerant computer control system. Tsinghua Science & Technology 12(Suppl. 1), 116121.
Yu H., Chu C., Châtelet Ė., & Yalaoui F. (2007). Reliability optimization of a redundant system with failure dependencies. Reliability Engineering & System Safety 92(12), 16271634.
Zhang C.W., Zhang T., Chen N., & Jin T. (2013). Reliability modeling and analysis for a novel design of modular converter system of wind turbines. Reliability Engineering & System Safety 111, 8694.
Zhang T., Long W., & Sato Y. (2003). Availability of systems with self-diagnostic components-applying Markov model to IEC 61508-6. Reliability Engineering & System Safety 80(2), 133141.
Recommend this journal

Email your librarian or administrator to recommend adding this journal to your organisation's collection.

  • ISSN: 0890-0604
  • EISSN: 1469-1760
  • URL: /core/journals/ai-edam
Please enter your name
Please enter a valid email address
Who would you like to send this to? *



Full text views

Total number of HTML views: 2
Total number of PDF views: 17 *
Loading metrics...

Abstract views

Total abstract views: 171 *
Loading metrics...

* Views captured on Cambridge Core between September 2016 - 18th November 2017. This data will be updated every 24 hours.