Entity sematic based system for question and answer search
Qian, Yanli
This item is only available for download by members of the University of Illinois community. Students, faculty, and staff at the U of I may log in with your NetID and password to view the item. If you are trying to access an Illinois-restricted dissertation or thesis, you can request a copy through your library's Inter-Library Loan office or purchase a copy directly from ProQuest.
Permalink
https://hdl.handle.net/2142/100028
Description
Title
Entity sematic based system for question and answer search
Author(s)
Qian, Yanli
Contributor(s)
Chang, Kevin C.C.
Issue Date
2018-05
Keyword(s)
entity semantic search
pattern search
natural language processing
Abstract
Entity semantic search aims at allowing users to search entity patterns inside documents, where
entities are referred to as semantic data objects. The system is built over the open-source web server
Elasticsearch and Apache Lucene. Through performing Named Entity Recognition on plain
Q&A corpus, entities are extracted and annotated with respect to the categories they belong to.
Each entity is indexed into JSON format document following the principle of inverted index, and
the resultant documents are imported into Elasticsearch for further query operation.
A plugin is built for the purpose of clustering and ranking the query results. It contains a
RESTful handler which has a customized response handler and a request handler. Before the
start of the Elasticsearch server, the plugin will be loaded and the nature of it is to extend the
Elasticsearch runtime by adding a RESTful endpoint. The plugin will help to group the
Elasticsearch results by entity content and the results that share the same entity content will be
placed in the same cluster.
The system is visualized via a web interface. The thesis elaborates the innovativeness of
searching entity patterns inside documents, and methods or models we used to built the entire
search engine.
Use this login method if you
don't
have an
@illinois.edu
email address.
(Oops, I do have one)
IDEALS migrated to a new platform on June 23, 2022. If you created
your account prior to this date, you will have to reset your password
using the forgot-password link below.