Predicting Medical Subject Headings Based on Abstract Similarity and Citations to MEDLINE Records
Kehoe, Adam K.; Torvik, Vetle I.
Loading…
Permalink
https://hdl.handle.net/2142/89942
Description
Title
Predicting Medical Subject Headings Based on Abstract Similarity and Citations to MEDLINE Records
Author(s)
Kehoe, Adam K.
Torvik, Vetle I.
Issue Date
2016-06
Keyword(s)
Controlled vocabularies
Medical subject headings
Machine Learning
Curation of bibliographic databases
Abstract
"We describe a classifier-enhanced nearest neighbor approach to assigning Medical Subject Headings (MeSH) to unlabeled documents using a combination of abstract similarities and direct citations to labeled MEDLINE records. The approach frames the classification problem by decomposing it into sets of siblings in the MeSH hierarchy (e.g., training a classifier for predicting ""Heterocyclic Compounds, 2-Ring"" vs. other ""Heterocyclic Compounds""). Preliminary experiments using a small but diverse set of MeSH terms shows the highest performance when using both abstracts and citations compared to each alone, and coupled with a non-naive classifier: 90+% precision and recall with 10-fold cross-validation. NLM's Medical Text Indexer (MTI) tool achieves similar overall performance but varies more across the terms tested. For example, MTI performs better on ""Heterocyclic Compounds, 2-Ring"", while our approach performs better on Alzheimer Disease and Neuroimaging. Our approach can be applied broadly to documents with abstracts that are similar to (or cite) MEDLINE abstracts, which would help linking and searching across bibliographic databases beyond MEDLINE."
Publisher
ACM
Type of Resource
text
Language
en
Permalink
http://hdl.handle.net/2142/89942
Copyright and License Information
Copyright held by the authors. Publication rights licensed to ACM.
Use this login method if you
don't
have an
@illinois.edu
email address.
(Oops, I do have one)
IDEALS migrated to a new platform on June 23, 2022. If you created
your account prior to this date, you will have to reset your password
using the forgot-password link below.