Withdraw
Loading…
Cross-correlations in medical data: theory, algorithms, and applications in disease analytics
Vaishnavi Subramanian, -
Loading…
Permalink
https://hdl.handle.net/2142/116232
Description
- Title
- Cross-correlations in medical data: theory, algorithms, and applications in disease analytics
- Author(s)
- Vaishnavi Subramanian, -
- Issue Date
- 2022-07-14
- Director of Research (if dissertation) or Advisor (if thesis)
- Do, Minh N.
- Doctoral Committee Chair(s)
- Do, Minh N.
- Syeda-Mahmood, Tanveer
- Committee Member(s)
- Sinha, Saurabh
- Hajek, Bruce
- Bresler, Yoram
- Department of Study
- Electrical & Computer Eng
- Discipline
- Electrical & Computer Engr
- Degree Granting Institution
- University of Illinois at Urbana-Champaign
- Degree Name
- Ph.D.
- Degree Level
- Dissertation
- Keyword(s)
- mutli-modality
- canonical correlation analysis
- cancer
- Abstract
- Cancer, like other complex diseases, involves a multitude of interactions which make it challenging to understand, diagnose, and cure. With the advancements in data science, there is immense potential to characterize the complexities of cancer by taking advantage of datasets that provide patient information from different views or modalities. For example, the Cancer Genome Atlas (TCGA) and the Cancer Imaging Archive (TCIA) provide data from multiple modalities, including radiology, histopathology, and genomics, from the same set of patients. The relations and correlations across features from these different modalities capture the joint variation in cancer properties and enable the characterization of the underlying cancer state in patients. This dissertation explores the potential of correlations as a tool for prediction and disease understanding. Under a two-modality probabilistic graphical model, we first show mathematically that the outputs of canonical correlation analysis (CCA), a linear correlation tool, is effective in the prediction of the model's unknown latent variable. The CCA-based predictor can alternatively be viewed as an interpretable unsupervised embedding generator which captures the correlations across two sets of features using CCA, followed by a supervised predictor. To adapt CCA to high dimensional, low sample, real world medical data, penalties encouraging sparsity, group structures, or graph structures are frequently included in the CCA formulation. To solve the graph-constrained CCA problem in a computationally efficient manner, we develop an alternating algorithm utilizing proximal gradients. We introduce novel matrix deflation (update) rules which enforce orthogonality properties to generate informative embeddings for CCA with penalties. Our results on simulated data and TCGA breast cancer data highlight the potential of our proposed framework. Lastly, we work on spatially-resolved proteomics data which provide expression levels of proteins spatially across tissue. The application of CCA on a recent spatial proteomics data from breast cancer allows the discovery of cross-correlations between neighbourhoods and protein levels, and provides insights into the differences in tissue structure between normal and cancerous tissue. In summary, this dissertation presents algorithms, techniques and proofs to design approaches which utilize cross-correlations in medical data for disease analytics.
- Graduation Semester
- 2022-08
- Type of Resource
- Thesis
- Copyright and License Information
- Copyright 2022 - Vaishnavi Subramanian
Owning Collections
Graduate Dissertations and Theses at Illinois PRIMARY
Graduate Theses and Dissertations at IllinoisManage Files
Loading…
Edit Collection Membership
Loading…
Edit Metadata
Loading…
Edit Properties
Loading…
Embargoes
Loading…