Exact Maximum Likelihood Estimation for Word Mixtures.ppt


文档分类:外语学习 | 页数:约13页 举报非法文档有奖
1/13
下载提示
  • 1.该资料是网友上传的,本站提供全文预览,预览什么样,下载就什么样。
  • 2.下载该文档所得收入归上传者、原创者。
  • 3.下载的文档,不会出现我们的网址水印。
1/13
文档列表 文档介绍
Carnegie Mellon Exact Maximum Likelihood Estimation for Word Mixtures Yi Zhang & Jamie Callan Carnegie Mellon University {yiz, callan }@ cs. cmu . edu Wei Xu NEC C&C Research Lab xw@ ccrl . sj. nec .com Carnegie Mellon Outline u Introduction 1. Why this problems? some retrieval applications 2. Traditional solutions: EM algorithm u New algorithm: exact MLE estimation u Experimental Results Carnegie Mellon Query Q D?) ||( DQD?? Document D Results Feedback Docs F={d 1, d 2 , …, d n} ),(' FQQf???? Q? F? Example 1: Model-based Feedback in the Language Modeling Approach to IR Based on Zhai &Lafferty ’ s slides in CIKM 2001 Carnegie Mellon ? F Estimation based on Generative Mixture Model ww F={d 1,…,d n} ))|()|() log(( );()|( logCwpwpdwcFp iw i????????? 1)|( log max arg???Fp F? Maximum Likelihood P(w| ?) P(w| C) ?1-? P(source) Background words Topic words Based on Zhai &Lafferty ’ s slides in CIKM 2001 Given: F, P(w|c) and ? Find: MLE of ? Carnegie Mellon M T : ? Topic M E : ? general English M I : ? new ? E? T? new Example 2: Model-based Approach for Novelty Detection in Adaptive Information Filtering Based on Zhang& Callan ’s paper in SIGIR 2002 Given: ? general English, ? Topic ? E ? T ? new Find: MLE of ? new Carnegie Mellon Problem Setting and Traditional Solution Using EM u Observe : data generated by a mixture multinomial distribution r=(r 1, r 2, r 3, …, r k) u Given: interpolation weights ? and ?, another multinomial distribution p =(p 1, p 2, p 3, …, p k) u Find: the maximum likelihood estimation (MLE) of multinomial distribution q =(q 1, q 2, q 3, …, q k) u Traditional solution: EM algorithm l Iterative process which can putationally expensive l Only provide approximate solution qpr???? Carnegie Mellon Finding q (1) ????? ki iiiqqq qqqqpf LL q k k1 ) ,..., ,( ) ,..., ,() log( max arg max arg 21 21?? Under the constraints: 0 and 1 1???? i ki iqq Where: f i is observed frequency of word i Carnegi

Exact Maximum Likelihood Estimation for Word Mixtures 来自淘豆网www.taodocs.com转载请标明出处.

相关文档 更多>>
非法内容举报中心
文档信息
  • 页数13
  • 收藏数0 收藏
  • 顶次数0
  • 上传人luyinyzha
  • 文件大小0 KB
  • 时间2016-04-16