Extracting and utilizing hidden structures in large datasets
Gao, Yihan
Loading…
Permalink
https://hdl.handle.net/2142/104764
Description
Title
Extracting and utilizing hidden structures in large datasets
Author(s)
Gao, Yihan
Issue Date
2019-03-26
Director of Research (if dissertation) or Advisor (if thesis)
Parameswaran, Aditya
Doctoral Committee Chair(s)
Parameswaran, Aditya
Committee Member(s)
Chang, Kevin
Sundaram, Hari
Wang, Jiannan
Department of Study
Computer Science
Discipline
Computer Science
Degree Granting Institution
University of Illinois at Urbana-Champaign
Degree Name
Ph.D.
Degree Level
Dissertation
Keyword(s)
Structure Extraction
Automatic Data Processing
Abstract
The hidden structure within datasets --- capturing the inherent structure within the data not explicitly captured or encoded in the data format --- can often be automatically extracted and used to improve various data processing applications. Utilizing such hidden structure enables us to potentially surpass traditional algorithms that do not take this structure into account. In this thesis, we propose a general framework for algorithms that automatically extract and employ hidden structures to improve data processing performance, and discuss a set of design principles for developing such algorithms. We provide three examples to demonstrate the power of this framework in practice, showcasing how we can use hidden structures to either outperform state-of-the-art methods, or enable new applications that are previously impossible. We believe that this framework can offer new opportunities for the design of algorithms that surpass the current limit, and empower new applications in database research and many other data-centric disciplines.
Use this login method if you
don't
have an
@illinois.edu
email address.
(Oops, I do have one)
IDEALS migrated to a new platform on June 23, 2022. If you created
your account prior to this date, you will have to reset your password
using the forgot-password link below.