Withdraw
Loading…
Novel methods in transcriptome analysis using RNA-seq
Li, Yang
Loading…
Permalink
https://hdl.handle.net/2142/31098
Description
- Title
- Novel methods in transcriptome analysis using RNA-seq
- Author(s)
- Li, Yang
- Issue Date
- 2012-05-22T00:27:43Z
- Director of Research (if dissertation) or Advisor (if thesis)
- Ma, Jian
- Department of Study
- Bioengineering
- Discipline
- Bioengineering
- Degree Granting Institution
- University of Illinois at Urbana-Champaign
- Degree Name
- M.S.
- Degree Level
- Thesis
- Keyword(s)
- RNA-seq
- splice junction
- alternative splicing
- gene fusion
- Abstract
- RNA-seq has proven to be a powerful technique for transcriptome profiling based on next-generation sequencing (NGS) technologies. Using RNA-seq, we want to solve two critical challenges: identifying Splice Junctions (SJs) and annotating gene fusion transcripts. Due to the limited read length of NGS data, it is extremely challenging to accurately map RNA-seq reads to SJs, which is important for the analysis of alternative splicing and isoform construction. In this thesis, we describe a novel method, called TrueSight, that combines information from (i) RNA-seq read mapping quality and (ii) coding potential from the reference genome sequences into a unified model that utilizes semi-supervised self-training to precisely identify SJs. Both simulation and real data evaluations showed that TrueSight achieved higher sensitivity and specificity than existing tools. We also applied TrueSight to discover novel splice forms in honey bee transcriptomes that cannot be detected by other methods and found that 94.6% of honey bee multi-exon genes are alternatively spliced. Utilizing high coverage transcriptome profiling data and a gene model enhanced by TrueSight, our quantitative analysis revealed that the expression ratio of the splice variants from a single gene is significantly correlated with the gene's exon-intron structure, splice site strength, and methylation pattern. We believe this new tool will be highly useful to comprehensively study splice variants based on RNA-seq. Fusion transcripts can be created as a result of genome rearrangement in cancer. Some of them play important roles in carcinogenesis, and can serve as diagnostic and therapeutic targets. With more and more cancer genomes being sequenced by next-generation sequencing technologies, we believe an efficient tool for reliably identifying fusion transcripts will be desirable for many groups. With the alignment tool we developed, we designed and implemented an open-source software tool, called FusionHunter, which reliably identifies fusion transcripts from transcriptional analysis of paired-end RNA-seq. We show that FusionHunter can accurately detect fusions that were previously confirmed by RT-PCR in a publicly available dataset. The purpose of FusionHunter is to identify potential fusions with high sensitivity and specificity and to guide further functional validation in the laboratory.
- Graduation Semester
- 2012-05
- Permalink
- http://hdl.handle.net/2142/31098
- Copyright and License Information
- Copyright 2012 Yang Li
Owning Collections
Graduate Dissertations and Theses at Illinois PRIMARY
Graduate Theses and Dissertations at IllinoisManage Files
Loading…
Edit Collection Membership
Loading…
Edit Metadata
Loading…
Edit Properties
Loading…
Embargoes
Loading…