节点文献

搜索引擎现状与发展研究

Study on the Current Situation and Development of Search Engine

【作者】 乔冬梅

【导师】 崔慕岳; 柯平;

【作者基本信息】 郑州大学 , 图书馆学, 2002, 硕士

【摘要】 搜索引擎是WWW上出现最早的网络二次信息组织工具,也是WWW上最有效的信息检索工具。搜索引擎经历了近十年的发展,它一方面极大地改善了人们在网络上进行信息搜索的手段,另一方面由于种种原因在信息收录完备性、检全率、检准率、检索功能和用户检索界面等方面还存在许多不足之处。 本文首先回顾了搜索引擎的发展历程,说明了搜索引擎的基本工作原理、类型与功能。在我们建立的搜索引擎评价指标体系基础上,对搜索引擎进行重新评价。得出搜索引擎现存的最主要问题是:信息完备性差、检准率不高、检索界面不够友好。 本文集中解决如何提高搜索引擎信息收录完备性与。改善搜索引擎检索功能这两个问题。综合运用协作式搜索和分布式检索,通过移动Agents技术来实现搜索引擎之间的信息共享。针对当前搜索引擎将关键词检索和分类主题检索分离的缺陷,借鉴关键词检索、概念词检索和分类主题检索一体化的思想,阐述了一体化的实现对于提高搜索引擎检准率和检全率以及改善检索界面友好性的作用与意义。

【Abstract】 The study revolves around the search engine, which is the earliest and the most efficient tool for information organization and retrieval on the Internet. The methods of searching information on the Internet world have been improved greatly by search engine. However, for some reasons, search engine does not collect information perfectly, and provides unsatisfied recall ratio, precision ratio, retrieval function and the interface.The article looks back on the course of the development of search engine, and illustrates the way by which search engine works, and explains its categories and functions. The article studies the traditional evaluation system, including coverage, recall ratio, precision ratio, response time, and the friendship of the interface.Then, it explains the major difference happened to the search engine on the Internet. On the base of the new principles, the article evaluates the search engine again,, and makes the conclusion that the major problems of the present search engines are poor coverage, low precision ratio, and relatively unfriendly interface.We focus on how to better the coverage and how to improve the functions of search engine. We suggest that search engine should comprehensively use the collaborative search robots and the distributive retrieval system to make the information shared. In order to remove the defects of the retrieval function of the present search engine, the paper make use of the idea that the keyword retrieval, the subject retrieval and the classification retrieval should be integrated, and expounds the importance of the idea to improve the recall ratio, the precision ratio and the friendship of the interface of search engine.

  • 【网络出版投稿人】 郑州大学
  • 【网络出版年期】2002年 02期
  • 【分类号】G254
  • 【被引频次】27
  • 【下载频次】1717
节点文献中: 

本文链接的文献网络图示:

本文的引文网络