Performance guarantees for deadline-driven MapReduce jobs under failure
Faghri, Faraz
Loading…
Permalink
https://hdl.handle.net/2142/45663
Description
Title
Performance guarantees for deadline-driven MapReduce jobs under failure
Author(s)
Faghri, Faraz
Issue Date
2013-08-22T16:57:06Z
Director of Research (if dissertation) or Advisor (if thesis)
Beck, Carolyn L.
Department of Study
Industrial & Enterprise Systems Engineering
Discipline
Industrial Engineering
Degree Granting Institution
University of Illinois at Urbana-Champaign
Degree Name
M.S.
Degree Level
Thesis
Keyword(s)
Performance of systems
Service Level Objectives
Fault tolerance
Cloud computing
Hadoop
MapReduce
Abstract
Increasingly, large systems and data centers are being built in a 'scale out' manner, i.e. using large numbers of commodity hardware components, instead of traditional 'scale up' using expensive, specialized equipment. However, large numbers of commodity components imply higher rates of failure across such systems. Such failures can cause applications to miss their deadlines for task completion. For this reason, cloud service providers and cloud applications must anticipate failures and engineer their services accordingly.
In this thesis, we first analyze the availability of a commodity data center designed for MapReduce applications. MapReduce is increasingly used in industry for efficient large scale data processing tasks including personal advertising, spam detection, as well as data mining. We show how MapReduce software level fault tolerance can be used to achieve the same availability as scale up data centers. Second, we extend existing job schedulers for deadline-driven jobs to handle machine and software failures and satisfy the service level objectives.
Use this login method if you
don't
have an
@illinois.edu
email address.
(Oops, I do have one)
IDEALS migrated to a new platform on June 23, 2022. If you created
your account prior to this date, you will have to reset your password
using the forgot-password link below.