Withdraw
Loading…
Next generation DNA-based data recorders
Tabatabaei, Seyed Kasra
Loading…
Permalink
https://hdl.handle.net/2142/115306
Description
- Title
- Next generation DNA-based data recorders
- Author(s)
- Tabatabaei, Seyed Kasra
- Issue Date
- 2022-05-15
- Keyword(s)
- DNA Data Storage, Nicking, DNA Sequencing, Metadata, Modified Nucleotides, Neural Networks, Nanopores
- Abstract
- DNA-based data storage systems have received significant attention in the synthetic biology, computer science and information theory communities due to their promise of ultrahigh storage density, recording durability, energy efficiency, environment friendliness and potential capability of integration with in-memory computing platforms. In such systems, user content is stored in synthetic DNA oligos comprised of natural DNA nucleotides (A, T, C, and G) and retrieved via next generation (e.g., Illumina) or third generation (e.g., Oxford Nanopores) sequencing technologies. Despite recent advances in DNA synthesis and sequencing methods, all known DNA-based data storage platforms suffer from high cost, read-write latency and significant error rates that render them noncompetitive with modern electronic storage devices. Here, we introduce new approaches for encoding and reading information in DNA molecules. We first demonstrate that one can use readily available native DNA extracted from living cells (rather than using synthetic DNA molecules) to store information, and we further show that information can be stored in the topology of the sugar-phosphate backbone in the form of single-bond breaks known as ‘nicks’ (rather than storing information only in the sequence content). We show that information written in nicks can also be retrieved via a commonly used sequencing platform such as Illumina MiSeq, which is similar to synthetic DNA-based data storage systems. We further demonstrate that nick-based and sequence content-based recording approaches can be combined to generate a two-dimensional data storage system, where the sequence is reserved for archival data and metadata is written in the backbone of the molecule. In a third project, we introduce a fundamentally new concept for a prototype of a DNA-based recorder that uses an extended DNA alphabet comprised of the four canonical DNA nucleotides in addition to seven chemically modified nucleotides. The DNA data storage platform with an extended alphabet holds the potential for a ~2-fold increase in the information storage density. We demonstrate that combinatorial patterns, generated from these additional nucleobases as well as the natural nucleotides, can be accurately discriminated using MspA and Oxford nanopores, making them suitable candidates for carrying digital information. Overall, the work presented in this thesis fundamentally advances the field of macromolecular data storage.
- Graduation Semester
- 2022-11-12T11:31:24-06:00
- Type of Resource
- text
Owning Collections
Graduate Dissertations and Theses at Illinois PRIMARY
Graduate Theses and Dissertations at IllinoisManage Files
Loading…
Edit Collection Membership
Loading…
Edit Metadata
Loading…
Edit Properties
Loading…
Embargoes
Loading…