Exploration of Microbial Genomic Sequences via Comparative Analysis
Badger, Jonathan Harold
This item is only available for download by members of the University of Illinois community. Students, faculty, and staff at the U of I may log in with your NetID and password to view the item. If you are trying to access an Illinois-restricted dissertation or thesis, you can request a copy through your library's Inter-Library Loan office or purchase a copy directly from ProQuest.
Permalink
https://hdl.handle.net/2142/86743
Description
Title
Exploration of Microbial Genomic Sequences via Comparative Analysis
Author(s)
Badger, Jonathan Harold
Issue Date
1999
Doctoral Committee Chair(s)
Olsen, Gary J.
Department of Study
Microbiology
Discipline
Microbiology
Degree Granting Institution
University of Illinois at Urbana-Champaign
Degree Name
Ph.D.
Degree Level
Dissertation
Keyword(s)
Biology, Biostatistics
Language
eng
Abstract
This work presents three new applications of comparative sequence analysis. (1) CRITICA (Coding Region Identification Tool Invoking Comparative Analysis) is a suite of programs for identifying likely protein coding sequences in prokaryotic DNA by combining comparative analysis of DNA sequences with more common noncomparative methods. In the comparative component of the analysis, regions of DNA are aligned with related sequences from the DNA databases; if the translation of the aligned sequences has greater amino acid identity than expected for the observed percentage nucleotide identity, this is interpreted as evidence for coding. CRITICA also incorporates noncomparative information in the form of dicodon bias. The dicodon bias information is derived by iterative analysis of the data so that CRITICA is not dependent upon the existence or accuracy of coding sequence annotations in the databases and hence is well suited for the analysis of novel genomes. CRITICA was tested by analyzing the available Salmonella typhimurium DNA sequences and proved to be more accurate than GenMark. (2) The complete genome of Methanococcus jannaschii, combined with sequence data from its mesophilic relatives, is an excellent resource for the study of thermostability. 115 complete and partial protein sequences from the mesophilic relatives were aligned to M. jannaschii sequence and the differences were noted. Compared to their homologs in its mesophilic relatives, M. jannaschii proteins are more hydrophobic, contain more charged residues, have a higher average residue volume, and have fewer uncharged polar residues. (3) Codon usage varies both between organisms and between different genes in the same organism. This observation has been used as the basis for earlier work in identifying highly expressed and horizontally transferred genes in Escherichia coli. The complete genomes of Escherichia coli , Methanococcus jannaschii, Archaeoglobus fulgidus, Methanobacterium thermoautotrophicum, Pyrococcus horikoshii and Aquifex aeolicus were analyzed for evidence of highly expressed and horizontally transferred genes. The apparent percentage of horizontally transferred genes in the analyzed genomes ranges from 0--14%, and the apparent percentage of highly expressed genes ranges from 0--11 %.
Use this login method if you
don't
have an
@illinois.edu
email address.
(Oops, I do have one)
IDEALS migrated to a new platform on June 23, 2022. If you created
your account prior to this date, you will have to reset your password
using the forgot-password link below.