下载此文档

信息检索与搜索引擎课件.pptx


文档分类:IT计算机 | 页数:约77页 举报非法文档有奖
1/77
下载提示
  • 1.该资料是网友上传的,本站提供全文预览,预览什么样,下载就什么样。
  • 2.下载该文档所得收入归上传者、原创者。
  • 3.下载的文档,不会出现我们的网址水印。
1/77 下载此文档
文档列表 文档介绍
信息检索与搜索引擎 Introduction to Information Retrieval GESC1007
Philippe Fournier-Viger
Full professor
School of Natural Sciences and Humanities
******@
Spring 2021
1
Last week
We have discussed:
Evaluation in an information retrieval system
Today:
Web search engines
Second assignment
About the final exam
2
Course schedule (日程安排)
3
Week 1
Introduction (Chapter 1)
Boolean retrieval
Week 2
Term vocabulary and posting lists (Chapter 2)
Week 3
Dictionaries and tolerant retrieval (Chapter 3)
Week 4
Index construction (Chapter 4)
Week 5
Scoring, term weighting, the vector space model (Chapter 6)
Week 6
A complete search system (Chapter 7)
Week 7
Evaluation in information retrieval
Week 8
Web search engines, advanced topics, conclusion
Final exam (to be announced)
Web Search engines
4
The Web
What is special about the Web?
The number of documents (very large)
Lack of coordination in the creation of documents,
Diversity of backgrounds and motives of content creators.
5

The Web
The Web is a set of webpages (网页)
Webpages are created using a language called HTML
6
Webpage
HTML
-a-Simple-Web-Page-with-HTML
The Web
Webpages are stored on servers (服务器)
To access a webpage, one must use a software called a Web browser (浏览器)
7
Browser
SERVER of
HITSZ
Internet
Home
The Web
Webpages are stored on servers (服务器)
To access a webpage, one must use a software called a Web browser (浏览器)
8
Browser
SERVER of
HITSZ
Internet
Webpages are sent over the internet using the HTTP protocol (HTTP协议)
Home
The Web
The idea of the Web: each webpage contain links to other webpages (hyperlinks - 超链接).
Each webpage has an address (URL) .
Creating a simple webpage is not very difficult.
Webpages have become one of the best way to supply and consume information.
9
The Web
Billions of webpages containing information.
But if we cannot search this informati

信息检索与搜索引擎课件 来自淘豆网www.taodocs.com转载请标明出处.

相关文档 更多>>
非法内容举报中心
文档信息
  • 页数77
  • 收藏数0 收藏
  • 顶次数0
  • 上传人1017848967
  • 文件大小5.34 MB
  • 时间2021-07-05