This study proposes and evaluates a document analysis strategy for information
retrieval with visualization interfaces. The goal of document analysis is to highlight
structure that helps searchers make their own relevance judgments, rather than to shift
judgments from humans onto machines. Searchers can investigate that structure with
tools for visualizing multidimensional data.
The structure of interest in this study is discrimination of documents into clusters.
Two diagnostic measures may inform selection of document attributes for cluster
discrimination: term discrimination value and the sum of pairwise term-vector correlations. A series of experiments tests the reliability of these measures for predicting clustering tendency, as measured by proportion of elongated triples and skewness of the distribution of document dissimilarities.
Use this login method if you
don't
have an
@illinois.edu
email address.
(Oops, I do have one)
IDEALS migrated to a new platform on June 23, 2022. If you created
your account prior to this date, you will have to reset your password
using the forgot-password link below.