节点文献

元搜索引擎技术的研究与应用

【作者】 彭丽

【导师】 赵政文;

【作者基本信息】 西北工业大学 , 计算机科学与技术, 2007, 硕士

【摘要】 元搜索引擎是基于搜索引擎基础之上的搜索引擎,它可以同时检索多个成员搜索引擎,对成员引擎返回的结果信息进行融合、再加工后二次陈列给用户。元搜索引擎是当今学术界研究的热门领域之一。 本文首先对搜索引擎和元搜索引擎的发展和搜索原理等进行了概述,然后分别对元搜索引擎的几个关键技术,包括成员引擎的调度、搜索结果的整合、个性化服务的实现等,进行了研究和分析,并在此基础上提出了本文设计的算法。本文主要的研究工作如下: (1) 成员引擎调度算法的分析,并在此基础上根据本文的成员引擎的特点提出本文使用的成员引擎调度算法。 (2) 跟踪用户的搜索行为(包括隐式的点击浏览和显式的投票),并对用户行为进行分析,动态地修改用户模型。这为成员搜索引擎的调度和搜索结果的整合与排名提供了依据。 (3) 提出了基于用户行为的搜索结果合并算法。它根据对用户行为的分析进行搜索结果的排名值计算,从而获得贴近用户偏好的搜索结果和排名。 最后,本文设计了一个基于用户搜索行为分析基础之上的元搜索引擎。相较于其它的元搜索引擎,该引擎具有友好的用户界面,为用户提供了一个快速查看网页内容的捷径,并且由于是基于用户行为分析进行的成员搜索引擎调度和搜索结果整合,因此更贴近用户对搜索引擎的偏好。

【Abstract】 Meta search engine is base on component search engines. It sends the user query to a number of component search engines simultaneously, then merges the results lists returned from them into a single ranked list and presents the merged results to users. It has become a main prospect of research.First, the state-of-the-art of the traditional search engines and the Meta search engines are overviewed, then analysis of the several main technologies of Meta search engine are proposed, including the scheduling of component search engines, the merging of search results, and personalized service. Based on these researches, the algorithm of the Meta search engine in this paper is proposed. The main work of this paper includes:(1) Analyze the scheduling of component search engines, and select a proper scheduling based on the characteristics of the component search engines in this paper.(2) Track the users’ behavior (including clicking and voting), and upon that, we analyze the behaviors and modify the user model continuously. This user model provides the foundation of component search engine scheduling and results merging.(3) Propose the results merging algorithm base on the users’ behavior. It computes the rank value of a document to a user query, and removes repeated results away, so as to get good search results close to users’ favor.In the end, we designed a Meta Search Engine on the basis of analyzing users’ behavior. Comparing to the existing Meta search engine, it has a friendly user interface, and provide a convenient way of checking the rough content of a webpage quickly. Also, as its component search engine scheduling algorithm and result merging algorithm are based on the users’ behavior, it’s more prone to users’ favor of using search engine.

  • 【分类号】TP391.3
  • 【被引频次】8
  • 【下载频次】450
节点文献中: 

本文链接的文献网络图示:

本文的引文网络