Design and Analysis of Fault-Tolerant Processor Arrays for Numerical Applications
Chen, Chien-Yi
This item is only available for download by members of the University of Illinois community. Students, faculty, and staff at the U of I may log in with your NetID and password to view the item. If you are trying to access an Illinois-restricted dissertation or thesis, you can request a copy through your library's Inter-Library Loan office or purchase a copy directly from ProQuest.
Permalink
https://hdl.handle.net/2142/69573
Description
Title
Design and Analysis of Fault-Tolerant Processor Arrays for Numerical Applications
Author(s)
Chen, Chien-Yi
Issue Date
1987
Department of Study
Computer Science
Discipline
Computer Science
Degree Granting Institution
University of Illinois at Urbana-Champaign
Degree Name
Ph.D.
Degree Level
Dissertation
Keyword(s)
Computer Science
Abstract
The availability of fast devices in low cost and high density technologies promises a major breakthrough in future supercomputer designs, especially in the design of highly concurrent processors. Locally interconnected processor arrays, such as systotic arrays, are well suited to efficiently implement a major class of numerical algorithms due to their massive parallelism and regular structure. However, the successful operation of a processor array depends very much on the correctness of all the processing elements in the array. Any single faulty processor may easily jeopardize the results of the whole system. This makes the fault tolerance a very important issue in the design of processor arrays, and is the subject of this thesis.
In the first part of this thesis, a fault tolerance scheme using an encoding based on the linear property is proposed; this can be applied to a class of processor arrays where each processor in the regular part of the array is a linear system. Many algorithms in digital signal processing and matrix operations are shown to be mapped to systems which belong to this class.
The computations of eigenvalues and singular values are key to many applications including signal and image processing. In the second part of this thesis, fault tolerance schemes are proposed for the computation of eigenvalues and singular values on several high-performance processor arrays which were proposed recently. Special properties of each algorithm are used to perform the concurrent error detection. Since, in most cases, the data encoding is not necessary, the introduced overhead is extremely low in terms of both hardware and time redundancy.
In the next section, a novel concurrent error detection technique using residue codes is proposed, which can be applied to the processor arrays derived from signal flow graphs.
After detecting an error, fault location is performed either through some special algorithms or through the use of time redundancy. Then, the reconfiguration process isolates the faulty unit and allows the system to resume its normal operations. In the final part, efficient reconfiguration techniques are proposed, which can be applied to a wide range of architectures, including binary trees, rectangular arrays, and shuffle-exchange processor arrays.
Use this login method if you
don't
have an
@illinois.edu
email address.
(Oops, I do have one)
IDEALS migrated to a new platform on June 23, 2022. If you created
your account prior to this date, you will have to reset your password
using the forgot-password link below.