信息检索与搜索引擎Introduction to Information RetrievalGESC1007
Philippe Fournier-Viger
Full professor
School of Natural Sciences and Humanities
******@
Spring 2021
1
Last week
We have discussed:
A complete search system
Today:
Brief review of last week
Evaluation in an information retrieval system
2
Course schedule (日程安排)
3
Week 1
Introduction (Chapter 1)
Boolean retrieval
Week 2
Term vocabulary and posting lists (Chapter 2)
Week 3
Dictionaries and tolerant retrieval (Chapter 3)
Week 4
Index construction (Chapter 4)
Week 5
Scoring, term weighting, the vector space model (Chapter 6)
Week 6
A complete search system (Chapter 7)
Week 7
Evaluation in information retrieval
Week 8
Web search engines, advanced topics, conclusion
Final exam
LAST WEEK
4
5
1) Initially, we have a set of documents.
6
2)Linguistic processing is applied to these documents (tokenization, stemming, language detection…)
Each document is a set of terms.
7
3) The IR System keeps a copy of each document in a cache (缓存).
This is useful to generate snippets (片段)
8
Snippet: a short text that accompany each document in the result list of a search engine
9
4) A copy of each document is given to indexers. These programs will create different kind of indexes: positional indexes, indexes for spell correction, structures for inexact retrieval….
10
5) When a user searches using a free-text query, the query parser transforms the query, and spell-correction is applied.
信息检索与搜索引擎ppt课件 来自淘豆网www.taodocs.com转载请标明出处.