关键词: 垂直搜索引擎 Heritrix DWR HtmlParser I 华 中 科 技 大 学 硕 士 学 位 论 文 Abstract With the increase of software on the Alisoft Economic PlatForm(AEP),finding the necessary software simply rely on directory-style method is unable to meet the needs of users, also can not enhance the Customer Experience Index(CEI). A search engine must been developed in the near future. A vertical search engine which can get software details is established based on the open-source tool kits . This can be applied to practical application. The major works in this thesis are as follows: Analyze search engine theory including the model of information retrieval; introduce the related technology of search engine, including the theory and application of Heritrix and DWR; extend the spider Heritrix and crawl information on AEP; DWR is applied to the search module of AEP search engine, which can save the system resource to a large extent; During the design and implementation, Htmlparser is used to deal with the document. It transforms html page to txt document that includes all