Probabilistic generative modeling of speech

Zhang, Yang

Probabilistic generative modeling of speech

Zhang, Yang

Content Files

ZHANG-THESIS-2015.pdf

Permalink

https://hdl.handle.net/2142/89006

Description

Title

Probabilistic generative modeling of speech

Author(s)

Zhang, Yang

Issue Date

2015-11-24

Director of Research (if dissertation) or Advisor (if thesis)

Hasegawa-Johnson, Mark A.

Department of Study

Electrical & Computer Engineering

Discipline

Electrical & Computer Engineering

Degree Granting Institution

University of Illinois at Urbana-Champaign

Degree Name

M.S.

Degree Level

Thesis

Date of Ingest

2016-03-02T19:33:51Z

Keyword(s)

Probabilistic acoustic tube
speech modeling
speech analysis
generative model

Abstract

Speech processing refers to a set of tasks that involve speech analysis and synthesis. Most speech processing algorithms model a subset of speech parameters of interest and blur the rest using signal processing techniques and feature extraction. However, evidence shows that many speech parameters can be more accurately estimated if they are modeled jointly; speech synthesis also benefits from joint modeling. This thesis proposes a probabilistic generative model for speech called the Probabilistic Acoustic Tube (PAT). The highlights of the model are threefold. First, it is among the very first works to build a complete probabilistic model for speech. Second, it has a well-designed model for the phase spectrum of speech, which has been hard to model and often neglected. Third, it models the AM-FM effects in speech, which are perceptually significant but often ignored in frame-based speech processing algorithms. Experiment shows that the proposed model has good potential for a number of speech processing tasks.

Graduation Semester

2015-12

Type of Resource

text

Permalink

http://hdl.handle.net/2142/89006

Copyright and License Information

Owning Collections

Graduate Dissertations and Theses at Illinois PRIMARY

Graduate Theses and Dissertations at Illinois

Dissertations and Theses - Electrical and Computer Engineering

Dissertations and Theses in Electrical and Computer Engineering

Probabilistic generative modeling of speech

Zhang, Yang

Permalink

Description

Owning Collections

Graduate Dissertations and Theses at Illinois PRIMARY

Dissertations and Theses - Electrical and Computer Engineering

Log In