Phase difference and tensor factorization models for audio source separation
Traa, Johannes
Loading…
Permalink
https://hdl.handle.net/2142/95277
Description
Title
Phase difference and tensor factorization models for audio source separation
Author(s)
Traa, Johannes
Issue Date
2016-10-10
Director of Research (if dissertation) or Advisor (if thesis)
Smaragdis, Paris
Doctoral Committee Chair(s)
Smaragdis, Paris
Committee Member(s)
Hasegawa-Johnson, Mark
Bresler, Yoram
Stein, Noah
Department of Study
Electrical & Computer Eng
Discipline
Electrical & Computer Engr
Degree Granting Institution
University of Illinois at Urbana-Champaign
Degree Name
Ph.D.
Degree Level
Dissertation
Keyword(s)
Nonnegative matrix factorization
Nonnegative tensor factorization
Interchannel phase differences
Audio Source Separation
Abstract
Audio source separation is a well-known problem in the speech community. Many methods have been proposed to isolate speech signals from a multichannel mixture. In this thesis, we will explore a number of techniques involving interchannel phase difference (IPD) features within a tensor factorization framework. IPD features can be extracted on a time-frequency (TF) grid and are a function of the phase characteristics of the mixing process. Thus, the ultimate goal is to form a clustering of these features and produce TF masks that can be used to perform the separation. We discuss various non-tensor-based methods that are capable of modeling linear and nonlinear IPD trends. Then, we discuss generalizations to both nonnegative and complex tensor factorizations (NTF, CTF). We show that each method performs best in certain circumstances and we conclude by saying that more work is needed to devise a generally superior approach.
Use this login method if you
don't
have an
@illinois.edu
email address.
(Oops, I do have one)
IDEALS migrated to a new platform on June 23, 2022. If you created
your account prior to this date, you will have to reset your password
using the forgot-password link below.