Withdraw
Loading…
Modeling Saint Lawrence Island Yupik morphology to support revitalization
Chen, Emily
Loading…
Permalink
https://hdl.handle.net/2142/120250
Description
- Title
- Modeling Saint Lawrence Island Yupik morphology to support revitalization
- Author(s)
- Chen, Emily
- Issue Date
- 2023-04-16
- Director of Research (if dissertation) or Advisor (if thesis)
- Schwartz, Lane
- Doctoral Committee Chair(s)
- Schwartz, Lane
- Committee Member(s)
- Schreiner, Sylvia L. R.
- Girju, Roxana
- Yoon, James
- Department of Study
- Linguistics
- Discipline
- Linguistics
- Degree Granting Institution
- University of Illinois at Urbana-Champaign
- Degree Name
- Ph.D.
- Degree Level
- Dissertation
- Keyword(s)
- Saint Lawrence Island Yupik
- language revitalization
- computational morphology
- morphological analysis
- linguistic resource
- Abstract
- This thesis explores two approaches to modeling the morphology of Saint Lawrence Island Yupik, the native language of the Indigenous people of Saint Lawrence Island, Alaska. Since the beginning of the new millennium, Saint Lawrence Island Yupik, hereafter Yupik, has been waning in usage among its Native population, particularly among younger generations. Unless steps are taken to reverse these trends, the language, already classified as ``Shifting'' on the EGIDS scale, will likely become dormant within the next century. Exacerbating this is the relative lack of documentation on Yupik, as compared to some other languages in Inuit-Yupik-Unangam Tunuu language family. Linguistic research has been intermittent over the past few decades, and the few Yupik-language texts that do exist were digitized only recently (within the past six years). From a computational perspective then, Saint Lawrence Island Yupik is considered low-resource. Despite these challenges, the Yupik community has expressed a desire for revitalization, and the research described in this thesis seeks to support that effort. The contributions of this work are thus twofold, one being documentary and the other being computational. In particular, this thesis extends and updates existing documentation of the approximately 500 derivational morphemes listed in the Badten et al. (2008) dictionary. Each morpheme was assessed for productivity and example sentences were elicited from speakers that demonstrate their usage. The sentences form a dataset that not only illustrates Yupik as it is spoken in present day, but also serves as an evaluation set for the morphological models implemented later. The remainder of the thesis explores two computational approaches to modeling Yupik morphology. The first constitutes a rule-based approach, involving finite-state transducers, and two such models were implemented (Chen and Schwartz, 2018; Chen et al., 2020). They differed structurally and subsequently yielded differing results in performance. The second is a neural-based approach, implemented using recurrent neural networks and intended to compensate for the shortcomings of the finite-state models. The success and failures of each approach are assessed at length in order to present the best model to support revitalization.
- Graduation Semester
- 2023-05
- Type of Resource
- Thesis
- Copyright and License Information
- Copyright 2023 Emily Chen
Owning Collections
Graduate Dissertations and Theses at Illinois PRIMARY
Graduate Theses and Dissertations at IllinoisManage Files
Loading…
Edit Collection Membership
Loading…
Edit Metadata
Loading…
Edit Properties
Loading…
Embargoes
Loading…