Mapping Genre at the Page Level in English-Language Volumes from HathiTrust, 1700-1899
Underwood, Ted; Ballard, Shawn; Black, Michael L.; Capitanu, Boris
Loading…
Permalink
https://hdl.handle.net/2142/50291
Description
Title
Mapping Genre at the Page Level in English-Language Volumes from HathiTrust, 1700-1899
Author(s)
Underwood, Ted
Ballard, Shawn
Black, Michael L.
Capitanu, Boris
Issue Date
2014-07-10
Keyword(s)
machine learning, genre, metadata, classification
Abstract
Using regularized logistic regression and hidden Markov models, we predict genre at the page level in a collection of 469,000 volumes from HathiTrust Digital Library. Accuracy is comparable to human crowdsourcing.
Use this login method if you
don't
have an
@illinois.edu
email address.
(Oops, I do have one)
IDEALS migrated to a new platform on June 23, 2022. If you created
your account prior to this date, you will have to reset your password
using the forgot-password link below.