Learning Despite Complex Attribute Interaction: An Approach Based on Relational Operators
Perez, Eduardo
This item is only available for download by members of the University of Illinois community. Students, faculty, and staff at the U of I may log in with your NetID and password to view the item. If you are trying to access an Illinois-restricted dissertation or thesis, you can request a copy through your library's Inter-Library Loan office or purchase a copy directly from ProQuest.
Permalink
https://hdl.handle.net/2142/81874
Description
Title
Learning Despite Complex Attribute Interaction: An Approach Based on Relational Operators
Author(s)
Perez, Eduardo
Issue Date
1997
Doctoral Committee Chair(s)
Rendell, Larry A.
Department of Study
Computer Science
Discipline
Computer Science
Degree Granting Institution
University of Illinois at Urbana-Champaign
Degree Name
Ph.D.
Degree Level
Dissertation
Keyword(s)
Artificial Intelligence
Language
eng
Abstract
When shortage of knowledge prevents human experts from choosing good attributes to represent empirical observations, learning is difficult. Expressing concepts by using only primitive (low-level) attributes is intricate because the contribution of each attribute to the definition is almost unnoticeable. This aggravates attribute interaction, a situation where an attribute's effect on classification depends on the value of other attributes. When attribute interaction is complex, involving many combinations of several attributes, current learners cannot handle it. Such complex interaction appears in real-world domains (e.g., protein folding). These domains are not random; they have structure although it may be concealed by the complexity of the interactions. Concepts in these domains often have embedded, implicit structure, which may be revealed through explicit relations. Because of the existence of structure, there is still hope for learning despite complex attribute interaction if appropriate techniques are devised. The design of these techniques, however, must be guided by a required functionality: finding interactions, more precisely, finding relations. In particular, this thesis focuses on interactions due to complex relations among primitive attributes. Then, concepts can be simple in terms of relations, but complex in terms of primitive attributes. Such focus is motivated by relations that appear in protein folding and other domains. To facilitate learning when relations create complex interactions, an approach based on the algebraic notions of relation and relational operators is proposed. The approach is implemented in MRP, a learning system that relies on multidimensional relational projection to find relations and hence, find interactions that can be captured as relations among attributes. Synthetic and real-world problems are used to empirically evaluate MRP with respect to five classical and advanced machine learning systems. MRP's distinctive behavior is analyzed in terms of concept characteristics (such as DNF size, entropy, variation, and Fourier spectrum), and related to the system's performance in real-world domains. Finally, a small family of simplified versions of MRP is evaluated empirically to analyze what components of MRP are responsible for its improved performance, and this analysis is used to focus the proposal for future research directions and system extensions.
Use this login method if you
don't
have an
@illinois.edu
email address.
(Oops, I do have one)
IDEALS migrated to a new platform on June 23, 2022. If you created
your account prior to this date, you will have to reset your password
using the forgot-password link below.