精品PPT课件----Using works to Analyze Expression Data.ppt


文档分类:外语学习 | 页数:约25页 举报非法文档有奖
1/ 25
下载提示
  • 1.该资料是网友上传的,本站提供全文预览,预览什么样,下载就什么样。
  • 2.下载该文档所得收入归上传者、原创者。
  • 3.下载的文档,不会出现我们的网址水印。
1/ 25
文档列表 文档介绍
A Probabilistic Learning Approach to Whole Genome Operon Prediction
Mark Craven, David Page, Jude Shavlik
Joseph Bockhorst, Jeremy Glasner
B' 00 
Talker: Jinsan Yang
Center for Bioinformation Technology (CBIT)
Abstract
Present putational approach to predict operons in the genomes of anisms.
Machine learning methods to induce predictive models from sequence data, gene expression data, functional annotations of genes.
Use of multiple models to predict promoters, terminators, operons.
Use of dynamic programming method to map every known and putative genes to the most probable operon.
Data analysed: E. Coli K-12 genome.
2
Center for Bioinformation Technology (CBIT)
Introduction
Approach: combining following two steps to predict an operon map for an entire genome.
First step: model to estimate the probability that an arbitrary sequence of genes constitute an operon.
Second step: dynamic programming method to assign every gene in the given genome to its most probable operon.
Multilevel-learning approach
3
Center for Bioinformation Technology (CBIT)
Problem Domain (1)
Primary task: predict operons in the E. coli genome.
E. coli genome: sequenced at the U. of Wisconsin (Blattner et al. 1997), consists of a single circular chromosome of double-stranded DNA, 4,639,221 base pairs, 4,400 genes.
Operon: a sequence of one or more genes that are transcribed as a unit.
4
Center for Bioinformation Technology (CBIT)
Problem Domain (2)
Available data:
Complete DNA sequence of the gene
Beginning and ending positions of 3,033 genes and 1,372 putative genes.
Positions and sequences of 438 known promoters, 289 terminators.
Functional annotation codes characterizing 1,668 genes. (3 level, 123-leaf hierarchy)
Gene expression data for the activity levels of 4,097 genes /putative genes for 39 experiments.
365 known operons
It is estimated that there are several hundred undiscovered operons in E. coli.
Generation of negative examples (non-operons) by the fact that most oper

精品PPT课件----Using works to Analyze Expression Data 来自淘豆网www.taodocs.com转载请标明出处.

相关文档 更多>>
非法内容举报中心
文档信息
  • 页数 25
  • 收藏数 0 收藏
  • 顶次数 0
  • 上传人 wz_198614
  • 文件大小 0 KB
  • 时间2014-10-24
最近更新