MASHing metadata: Legacy issues in OAI harvesting from three digital libraries
Author(s)
Khoo, Michael
Tudhope, Doug
Binding, Ceri
Jones, Hilary
Orrego, Ivan
Ahn, Jae-wook
Issue Date
2013-02
Keyword(s)
digital libraries
oai-pmh
knowledge management
information systems
research methods
dublin core
legacy issues
metadata harvesting
Abstract
This note reports on efforts to build a generalizable OAI-PMH workflow to retrieve metadata sets from unrelated digital libraries. This effort is part of a wider effort to build a database to aggregate metadata from different digital libraries, which can then be used as the basis for content analysis and data mining experiments with the metadata records. A pilot metadata harvest from three digital libraries using OAIPMH encountered a number of issues, arising from idiosyncratic legacy characteristics of each of the three metadata sets. In the end, the harvests had to be manually tailored to each library. OAI-PMH proved to be a useful approach, but only after communication with each digital library had identified important characteristics of each metadata set, including many legacy characteristics, which had to be accounted for in the harvest.
Use this login method if you
don't
have an
@illinois.edu
email address.
(Oops, I do have one)
IDEALS migrated to a new platform on June 23, 2022. If you created
your account prior to this date, you will have to reset your password
using the forgot-password link below.