Method for assessing of reliability characteristics in designing of failure-resistant real-time operating systems

Authors

DOI:

https://doi.org/10.15276/opu.2.61.2020.13

Keywords:

Real-time operating system, fault tolerance, operating system reliability criterion, fault tolerance grade, operating system with a monolithic kernel, operating system self-healing, recovery agent

Abstract

For many years, real-time OS-based applications have been used in embedded special-purpose systems. Recently they have been used everywhere, from on-board control systems for aircraft, to household appliances. The development of multiprocessor computing systems usually aims to increase either the level of reliability or the level of system performance to values that are inaccessible or difficult to implement in traditional computer systems. In the first case, the question of the availability of special means of ensuring the fault tolerance of computer systems arises, the main feature (and advantage) of which is the absence of any single resource, failure of which leads to a fatal failure of the entire system. The use of a real-time operating system is always associated with equipment, with an object and with events occurring at an object. A real-time system, as a hardware-software complex, includes sensors that record events at an object, input / output modules that convert sensor readings into a digital form suitable for processing these readings on a computer, and finally, a computer with a program that responds to events occurring at the facility. The RTOS is focused on processing external events. It is this that leads to fundamental differences (compared with general-purpose OS) in the structure of the system as well as in the functions of the kernel and in the construction of the input-output system. The RTOS can be similar in its user interface to general-purpose operating systems, but it is completely different in its structure. In addition, the use of RTOS is always specific. If users (not developers) usually perceive a general-purpose OS as a ready-made set of applications, then the RTOS serves only as a tool for creating a specific hard-ware-software complex in real time. Therefore, the widest class of users of RTOS is the developers of real-time complexes, people designing control and data collection systems. When designing and developing a specific real-time system, the programmer always knows exactly what events can occur at the facility, and he knows the critical terms for servicing each of these events. We call a real-time system (SRV) a hard-ware-software complex that responds in predictable times to an unpredictable stream of external events. The system must have time to re-spond to the event that occurred at the facility, during the time critical for this event. The critical time for each event is determined by the object and by the event itself, and, of course, it can be different, but the response time of the system must be predicted (calculated) when creating the system. Lack of response at the predicted time is considered an error for real-time systems. The system must have time to re-spond to simultaneously occurring events. Even if two or more external events occur simultaneously, the system must have time to respond to each of them during time intervals critical for these events. In this study, as part of a network fault-tolerant technology, the RTOS be-comes a special type of control software that is used to organize the operation of embedded applications, which are characterized by limited memory resources, low productivity and the requirements of a guaranteed response time (T<4 μs), high level availability and availability of auto-monitoring facilities.

Downloads

Download data is not yet available.

References

Swift M., Bershad B., Levy H. Improving the Reliability of Commodity Operating Systems. ACM Transactions on Computer Systems. 2005. Vol. 23, No. 1. P. 77–110.

Cordy J., Shukla M. Practical metaprogramming. IBM Centre for Advanced Studies. 1992. P. 2–9.

Reekie H., Hylands C., Lee E. Tcl and Java Performance. University of California at Berkley. 1998.

Krall A. Efficient JavaVM Just-in-Time Compilation. PACT, 1998, 205 p.

Muller G., Moura B., Bellard F., Consel C. JIT vs Offline Compilers: Limits and Benefits of Bytecode Compilation. IRISA. 1997. PI 1063.

Leroy X. Java Bytecode Verification: Algorithms and Formalizations. JOAR. 2005. Vol. 30. P. 3–9.

An experimental Study of Soft Errors in Microprocessors / G.P. Saggese, N.J. Wang, Z.T. Kalbarczyk et al. IEEE Micro. 2005. V. 25. №6. P.30–39. DOI: 10.1109/MM.2005.104.

Basili V.R., Perricone B.T. Software errors and complexity: an empirical investigation. ACM 27. 1984. P. 42–52.

Suri N., Valter C.J., Hugu M.M. Advances in Ultra Dependable Distributed Systems. Computer Society Press. 1995. P. 56–61.

Таненбаум Э., Вудхалл А. Операционные системы. Разработка и реализация. 3-е изд. СПб : Пи-тер, 2007. C. 254–256.

Таненбаум Э., Бос Э. Современные операционные системы. 4-е изд. СПб : Питер, 2015. 457 с.

Назаров С.В., Широков А.И. Технологии многопользовательских операционных систем. М. : Изд. Дом МИСиС, 2012. C. 98–101.

Назаров С.В., Вилкова Н.Н. Структурный рефакторинг многослойных программных систем. Ин-формационные технологии и вычислительные системы. 2016. № 4. С. 13–23. URL: https://elibrary.ru/item.asp?id=27656660 (дата звернення: 05.01.2020).

Кельберт М.Я., Сухов Ю.М. Вероятность и статистика в примерах и задачах. Том 2. Марковские цепи как отправная точка теории случайных процессов и их приложения. М. : МЦНМО. 2009. C. 145–147.

Назаров С.В. Эффективность современных операционных систем. Современные информацион-ные технологиии ИТ-образование. 2017. Т.13, №2. С. 9–24. DOI: https://doi.org/10.25559/SITITO.2017.2.229

Василенко Н.В., Макаров В.А. Модели оценки надежности программного обеспечения. Вестник Новгородского государственного университета. Серия: Технические науки. 2004. № 28. С. 126–132. URL: https://elibrary.ru/item.asp?id=18184720 (дата звернення: 05.01.2020).

Таха Х. Введение в исследование операций. 7-еизд. М. : Вильямс, 2005. C. 34–67.

Мартышкин А.И. Математическая модель диспетчера задач с общей очередью для систем парал-лельной обработки. Современные методы и средства обработки пространственно-временных сигналов: сборник статей XI Всероссийской научно-технической конференции. Пенза : ПДЗ, 2013. С. 87–91.

Андреев А.М., Можаров Г.П., Сюзев В.В. Многопроцессорные вычислительные системы: теоре-тический анализ, математические модели и применение. Москва, Изд-во МГТУ им. Н.Э. Баума-на, 2011, C. 124–156.

Воеводин В.В. Параллельные вычисления. СПб. : БХВ – Петербург, 2002. C. 145–152.

Downloads

Published

2020-07-12

How to Cite

[1]
Lopakov, O., Salii, V., Shvahirev, P. and Kosmachevskiy, V. 2020. Method for assessing of reliability characteristics in designing of failure-resistant real-time operating systems. Proceedings of Odessa Polytechnic University. 2(61) (Jul. 2020), 108–118. DOI:https://doi.org/10.15276/opu.2.61.2020.13.

Issue

Section

Informacion technology. Automation

Most read articles by the same author(s)