Exploration into rare sound detection using LSTM-RNN
Chen, Yikuan
This item is only available for download by members of the University of Illinois community. Students, faculty, and staff at the U of I may log in with your NetID and password to view the item. If you are trying to access an Illinois-restricted dissertation or thesis, you can request a copy through your library's Inter-Library Loan office or purchase a copy directly from ProQuest.
Permalink
https://hdl.handle.net/2142/99977
Description
Title
Exploration into rare sound detection using LSTM-RNN
Author(s)
Chen, Yikuan
Contributor(s)
Chen, Deming
Issue Date
2018-05
Keyword(s)
LSTM
recurrent neural network
scream detection
gunshot detection
audio event detection
Abstract
Rare Audio Event Detection (AED) plays a crucial role in domestic and public security applications. The
goal of this research is to recognize key acoustic events using Long Short-Term-Memory Recurrent Neural
Network (LSTM-RNN) based classifiers. We compared different existing methods on rare sound
recognition, such as Gaussian Mixture Model (GMM), Hidden Markov Model (HMM), zero-phase signal
method and neural networks. Specifically, we investigated different neural network architectures, such as
feedforward DNN, RNN, LSTM-RNN, bi-directional RNN etc. After experimenting with different neural
network structures and different acoustic features, we propose a mixed neural network which consists of
multiple subnets, each dedicated to recognizing one type of sound. Each subnet contains multi-input layers,
feed forward layers, LSTM-RNN layer and output smoothing units. The final classification will be given
based on the output of all subnets. Different acoustic features are fed into the network at different input
layers to enhance the efficiency. Our model has exceeded the baseline performance of Detection and
Classification of Acoustic Scenes and Events (DCASE) 2017 competition. However, there still exists a
performance gap between our model and the current best model, and we are currently analyzing the
advantages and drawbacks of our model and the top-ranking model.
Use this login method if you
don't
have an
@illinois.edu
email address.
(Oops, I do have one)
IDEALS migrated to a new platform on June 23, 2022. If you created
your account prior to this date, you will have to reset your password
using the forgot-password link below.