Speech Bandwidth Extension Using Articulatory Features
Shin, Dongeek
This item is only available for download by members of the University of Illinois community. Students, faculty, and staff at the U of I may log in with your NetID and password to view the item. If you are trying to access an Illinois-restricted dissertation or thesis, you can request a copy through your library's Inter-Library Loan office or purchase a copy directly from ProQuest.
Permalink
https://hdl.handle.net/2142/46538
Description
Title
Speech Bandwidth Extension Using Articulatory Features
Author(s)
Shin, Dongeek
Contributor(s)
Hasegawa-Johnson, Mark
Issue Date
2011-12
Keyword(s)
speech processing
speech bandwidth extension
articulatory features
Abstract
In this thesis, we present a technique for bandwidth extension (BWE) of a narrow-band
(0 - 4 kHz) signal using articulatory features. The proposed technique recovers
high-band components (4 - 8 kHz) through Gaussian mixture regression (GMR) on
both the acoustic and articulatory features from the X-ray Microbeam (XRMB)
speech production database. The Gaussian mixture model (GMM) that is based on
acoustic and articulatory features is initialized using k-means and iteratively
trained using the expectation-maximization (EM) algorithm. BWE experiments were run
using data files from different speakers in the XRMB database as train and test
data. Time-frequency plots of speech recovered by different training methods are
presented in order to show that articulatory trajectories are helpful in characterizing
high-frequencied consonants in speech. Finally, we confirm our hypothesis that using
GMM with articulation gives better recovery rate is true by performing Student’s
t-test on SNR data between original and recovered speech.
Use this login method if you
don't
have an
@illinois.edu
email address.
(Oops, I do have one)
IDEALS migrated to a new platform on June 23, 2022. If you created
your account prior to this date, you will have to reset your password
using the forgot-password link below.