Building a Terminological Database from Heterogeneous.ppt


文档分类:IT计算机 | 页数:约36页 举报非法文档有奖
1/36
下载提示
  • 1.该资料是网友上传的,本站提供全文预览,预览什么样,下载就什么样。
  • 2.下载该文档所得收入归上传者、原创者。
  • 3.下载的文档,不会出现我们的网址水印。
1/36
文档列表 文档介绍
Building a Terminological Database from Heterogeneous Definitional Sources
Smaranda Muresan, Peter T. Davis
Samuel D. Popper, Judith L. Klavans
Columbia University
May 21, 2003
Why Terminology is Important?
Each agency and each department might have different ways to define the same concept
Working with multiple databases requires understanding the data across multiple agencies and domains
What’s an Employee?
An appointed officer or employee of USDA including special Government employees (collaborators, consultants and panel members). The term excludes independent contractors.
An individual who is engaged pensated by a railroad or by a contractor to a railroad, who is authorized by a railroad to use its munications in connection with railroad operations.
A person who works for wages or salary in the service of an employer.
The term "employee" does not include a director, trustee, or officer.
US Department of Agriculture
Federal Railroad Administration
Mine Safety and Health Administration
US SEC
Desiderata for Terminological Resources
Capture the ongoing evolution of language
Provide consistency, ease of sharing and integration across agencies.
Architecture
Collection
SemanticAnalysis
Use
Heterogeneous Definitional Corpus
GetGloss
Definder
ParseGloss
Database Building
Terminological Database
dynamic sources
relations among concepts and their attributes
fast access, flexibility, sharing
database query
Building the terminological DB
Collection
Motivation
Definitions are rich in terminological knowledge
On-line dictionaries are static and generally plete
Need to capture the evolution of language
Acquisition of Heterogeneous Definitional Corpus
GetGloss
Definder
Solution
GetGloss – identification and extraction of glossaries
Definder - extraction of definitions from online free text
Building the Terminological DB
Motivation
Need to identify relationships among concepts
. synonyms, hypernyms, cross-reference
Need to sto

Building a Terminological Database from Heterogeneous 来自淘豆网www.taodocs.com转载请标明出处.

相关文档 更多>>
非法内容举报中心
文档信息
  • 页数36
  • 收藏数0 收藏
  • 顶次数0
  • 上传人新起点
  • 文件大小1.68 MB
  • 时间2018-10-13