This item is only available for download by members of the University of Illinois community. Students, faculty, and staff at the U of I may log in with your NetID and password to view the item. If you are trying to access an Illinois-restricted dissertation or thesis, you can request a copy through your library's Inter-Library Loan office or purchase a copy directly from ProQuest.
Permalink
https://hdl.handle.net/2142/81869
Description
Title
Large-Scale Constraint-Based Pattern Mining
Author(s)
Zhu, Feida
Issue Date
2009
Doctoral Committee Chair(s)
Jeff Erickson
Han, Jiawei
Department of Study
Computer Science
Discipline
Computer Science
Degree Granting Institution
University of Illinois at Urbana-Champaign
Degree Name
Ph.D.
Degree Level
Dissertation
Keyword(s)
Computer Science
Language
eng
Abstract
We studied the problem of constraint-based pattern mining for three different data formats, item-set, sequence and graph, and focused on mining patterns of large sizes. Colossal patterns in each data formats are studied to discover pruning properties that are useful for direct mining of these patterns. For item-set data, we observed robustness of colossal patterns. By defining the concept of core patterns, we developed a randomized mining framework to efficiently find the set of colossal patterns which gives a good approximation to the complete pattern set. The essential idea of pattern fusion and leaping toward large patterns is then extended to the cases of sequential and graph data. In sequential data, we developed a novel algorithm to accommodate approximate patterns. For graph data, we proposed the concept of spiders and used these pre-computed frequent structures of small sizes to quickly leap to reach those much larger ones. We also proposed a general graph mining framework, called gPrune, to take advantage of both pattern and data space pruning. Ideas and techniques developed in this work can be extended to handle other user-specified constraints for direct efficient mining in large-scale data.
Use this login method if you
don't
have an
@illinois.edu
email address.
(Oops, I do have one)
IDEALS migrated to a new platform on June 23, 2022. If you created
your account prior to this date, you will have to reset your password
using the forgot-password link below.