Mining social media stimulus from news article text using weakly-supervised narrative classification
Qiu, Wenda
Loading…
Permalink
https://hdl.handle.net/2142/110579
Description
Title
Mining social media stimulus from news article text using weakly-supervised narrative classification
Author(s)
Qiu, Wenda
Issue Date
2021-04-27
Director of Research (if dissertation) or Advisor (if thesis)
Han, Jiawei
Department of Study
Computer Science
Discipline
Computer Science
Degree Granting Institution
University of Illinois at Urbana-Champaign
Degree Name
M.S.
Degree Level
Thesis
Keyword(s)
Text Mining
News Classification
Abstract
To make an accurate simulation for social media, we first need to find the stimulus in external sources. In this work, we model the stimulus mining into a narrative classification task on a news article dataset. The previous state-of-the-art text classification methods can not be directly applied here, mainly due to the following challenges we need to solve: 1) Lack of training data: the given news article data does not have labeling for narratives and we can not afford manual labeling other than a small evaluation set. 2) The complexity in narratives: narratives are defined in a more complex way comparing to the classes used in a classical news classification dataset, which stops us from using existing weakly supervised text classification methods that heavily depend on class name semantics. 3) The noisy news article dataset: the collected dataset does not guarantee the documents will belong to any of the narratives. In such cases, the power of the self-training strategy widely used in existing methods on weak supervision will be limited. To solve these challenges, we proposed a narrative decomposition and re-grouping strategy and a relevance filtering module, to fully utilize the power of weakly supervised classification methods. We conduct extensive experiments on two datasets under the background of real global events and further proposed two ways to combine different results for an optimal stimulus time-series.
Use this login method if you
don't
have an
@illinois.edu
email address.
(Oops, I do have one)
IDEALS migrated to a new platform on June 23, 2022. If you created
your account prior to this date, you will have to reset your password
using the forgot-password link below.