1/73
文档分类:IT计算机

信息检索与搜索引擎ppt课件.pptx


下载后只包含 1 个 PPTX 格式的文档,里面的视频和音频不保证可以播放,查看文件列表

特别说明:文档预览什么样,下载就是什么样。

下载所得到的文件列表
信息检索与搜索引擎ppt课件.pptx
文档介绍:
信息检索与搜索引擎 Introduction to Information Retrieval GESC1007
Philippe Fournier-Viger
Full professor
School of Natural Sciences and Humanities
******@yahoo.com
Spring 2021
1
Last week
We have discussed:
A complete search system
Today:
Brief review of last week
Evaluation in an information retrieval system
2
Course schedule (日程安排)
3
Week 1
Introduction (Chapter 1)
Boolean retrieval
Week 2
Term vocabulary and posting lists (Chapter 2)
Week 3
Dictionaries and tolerant retrieval (Chapter 3)
Week 4
Index construction (Chapter 4)
Week 5
Scoring, term weighting, the vector space model (Chapter 6)
Week 6
A complete search system (Chapter 7)
Week 7
Evaluation in information retrieval
Week 8
Web search engines, advanced topics, conclusion
Final exam
LAST WEEK
4
5
1) Initially, we have a set of documents.
6
2)Linguistic processing is applied to these documents (tokenization, stemming, language detection…)
Each document is a set of terms.
7
3) The IR System keeps a copy of each document in a cache (缓存).
This is useful to generate snippets (片段)
8
Snippet: a short text that accompany each document in the result list of a search engine
9
4) A copy of each document is given to indexers. These programs will create different kind of indexes: positional indexes, indexes for spell correction, structures for inexact retrieval….
10
5) When a user searches using a free-text query, the query parser transforms the query, and spell-correction is applied.
内容来自淘豆网www.taodocs.com转载请标明出处.
非法内容举报中心
文档信息
  • 页数73
  • 收藏数0 收藏
  • 顶次数0
  • 上传人1017848967
  • 文件大小1.09 MB
  • 时间2021-06-26