Withdraw
Loading…
An Operating-System-Level Framework for Providing Application-Aware Reliability
Wang, Long; Kalbarczyk, Zbigniew; Gu, Weining; Iyer, Ravi
Loading…
Permalink
https://hdl.handle.net/2142/99566
Description
- Title
- An Operating-System-Level Framework for Providing Application-Aware Reliability
- Author(s)
- Wang, Long
- Kalbarczyk, Zbigniew
- Gu, Weining
- Iyer, Ravi
- Issue Date
- 2006-09
- Keyword(s)
- Operating system
- Reliability
- Application aware
- Application specific
- Hang detection
- Checkpointing
- Application Transparent
- Abstract
- Operating systems enable collecting and extracting rich information on application execution characteristics, including program counter traces, memory access patterns, and operating-system-generated signals. This information can be exploited to design highly efficient, application-aware reliability mechanisms that are transparent to applications. This paper describes the Reliability MicroKernel framework (RMK), a loadable kernel module for providing application-aware reliability and dynamically configuring reliability mechanisms installed in RMK. The RMK prototype is implemented in Linux and supports detection of application/OS failures and transparent application checkpointing. Experiment results show that the OS hang detection and application hang detection, which exploit characteristics of application and system behavior, can achieve 100% coverage and low false positive rates. Moreover, the performance overhead of RMK and the detection/checkpointing mechanisms is small (0.6% for application hang detection and 0.1% for transparent application checkpointing in the experiments).
- Publisher
- Coordinated Science Laboratory, University of Illinois at Urbana-Champaign
- Series/Report Name or Number
- Coordinated Science Laboratory Report no. UILU-ENG-06-2218, CRHC-06-12
- Type of Resource
- text
- Language
- en
- Permalink
- http://hdl.handle.net/2142/99566
- Sponsor(s)/Grant Number(s)
- National Science Foundation / CNS-04-06351, CNS-05-24695, and ACI-0121658 ITR/AP
- Gigascale Systems Research Center/MARCO
- Motorola
Owning Collections
Manage Files
Loading…
Edit Collection Membership
Loading…
Edit Metadata
Loading…
Edit Properties
Loading…
Embargoes
Loading…