This item is only available for download by members of the University of Illinois community. Students, faculty, and staff at the U of I may log in with your NetID and password to view the item. If you are trying to access an Illinois-restricted dissertation or thesis, you can request a copy through your library's Inter-Library Loan office or purchase a copy directly from ProQuest.
Permalink
https://hdl.handle.net/2142/103998
Description
Title
Detailed study of speech recognition
Author(s)
Chen, Brian
Contributor(s)
Levinson, Stephen
Issue Date
2019-05
Keyword(s)
Speech recognition
speech recognition classification
Abstract
This research project provides a very detailed comprehensive overview and experimentation
of the speech recognition process. Generally, speech recognition can be broken down into
three phases. The first phase is the sample and denoise stage (Endpoint Detection
Technique), which helps us collect the signals and separate background noise from the
actual information. The second phase is the feature extraction stage (Spectrogram, Filter
bank, MFCCs), which help us convert the received time domain signals to meaningful, useful
frequency domain information prior feeding into classification model. Between the second
and third phases, there is an optional phase that people often do known as the data
compression phase, which will also be discussed in this paper. Lastly the final stage is the
classification stage (KNN, CNN), which classified a specific input signal to one of the possible
classes.
On top of the different phases, we will also look into the classification results in great detail
and see whether factors such as minimum squared error between different classes or length
of the signal can play a role in the classified result.
Use this login method if you
don't
have an
@illinois.edu
email address.
(Oops, I do have one)
IDEALS migrated to a new platform on June 23, 2022. If you created
your account prior to this date, you will have to reset your password
using the forgot-password link below.