Wikipedia entity retrieval with word and entity embedding space alignment
Zhang, Hongshuo
This item is only available for download by members of the University of Illinois community. Students, faculty, and staff at the U of I may log in with your NetID and password to view the item. If you are trying to access an Illinois-restricted dissertation or thesis, you can request a copy through your library's Inter-Library Loan office or purchase a copy directly from ProQuest.
Permalink
https://hdl.handle.net/2142/110270
Description
Title
Wikipedia entity retrieval with word and entity embedding space alignment
Author(s)
Zhang, Hongshuo
Contributor(s)
Chang, Kevin
Issue Date
2021-05
Keyword(s)
Information Retrieval
Search Engine
Abstract
User queries used to search on Wikipedia might not always correspond to a Wikipedia article.
Searching for such queries on the built-in search engine for Wikipedia does not return as many
relevant Wikipedia articles as expected, since it is only text-based, and it overlooks the relationship
between different Wikipedia articles. To improve the results of the Wikipedia search engine, with the
help of the Wikipedia Link Graph, we propose a simple word – entity embedding space alignment
model for searching relevant Wikipedia articles using fringe keywords from the Computer Science
domain. By using this model, our problem can be reduced to finding the closest neighbors to a
user query translated from the word space to the entity space. Due to a lack of dataset available
for this purpose, a custom dataset constructed from Wikipedia2Vec is used to evaluate the model.
Empirically, our word – entity alignment model does not always fair better than the text-based
models.
Use this login method if you
don't
have an
@illinois.edu
email address.
(Oops, I do have one)
IDEALS migrated to a new platform on June 23, 2022. If you created
your account prior to this date, you will have to reset your password
using the forgot-password link below.