节点文献

基于XML/Java的元搜索引擎的研究

【作者】 何玉菁

【导师】 傅秀芬;

【作者基本信息】 广东工业大学 , 计算机应用技术, 2004, 硕士

【摘要】 元搜索引擎通常被称为搜索引擎之上的搜索引擎。用户只需递交一次检索请求,由元搜索引擎负责转换处理后提交给多个预先选定的独立搜索引擎,并将所有查询结果集中起来以整体统一的格式呈现到用户面前。而Java是由Sun Microsystems公司所开发的一个高级程序语言,Java提供了一个跨平台的方案,可支持分布式处理环境。Java语言成为了结合XML(eXtensible Markup Language)的最佳选择。XML以一种开放的自我描述方式定义了数据结构,在描述数据内容的同时能突出对结构的描述。由于数据显示与内容分开,XML定义的数据允许指定不同的显示方式,使数据更合理地表现出来。 本文介绍了搜索引擎和元搜索引擎的发展历史,讨论了元搜索引擎的基本工作原理并对元搜索引擎进行了分类,比较了元搜索引擎与独立搜索引擎相比的优点,讨论了元搜索引擎的几个关键技术,并分析了元搜索引擎面临的问题和将来的发展趋势。作者提出了一个元搜索引擎模型MySearch,它包括了用户界面代理,检索代理,查询数据库这三个部分。在此基础上,还探讨HTML数据到XML数据的转换;研究了JAVA,XML与JDBC的结合问题,也即与数据库的结合问题。并用JAVA SERVLET和XML建了一个基于XML、JAVA的元搜索引擎。XML作为一种数据表示的形式对Web上的数据检索和挖掘应用将带来巨大的优势。

【Abstract】 Meta search engine is regarded as search engine based on search engines. Users only need to submit search requirements once, it is the responsibility of the meta search engine to transform, process and hand over the requirements to multiple pre-selected independent search engines, then present the search results in a uniform format to users. Java is a kind of advanced programming language developed by Sun Microsystems, and it provides a scheme independent of platforms, and it also can sustain distributing processing environment. Java is the best choice to be combined with XML. XML uses an open, self-described mode to define data structure; it can describe data content as well as structure. Due to the separation of data display and data content, it is allowed to show XML data with different method.This thesis introduces the developing history of search engine and meta search engine; discusses the working principle of meta search engines and classify them; compares the strong points of meta search engine with search engine; it also discusses several key technology of meta search engine, and analyses the problems and trend of meta search engine in the future. The author bring forward meta search engine model MySearch, it mainly comprises user interface agent, search agent and search database. Based on MySearch model, the author probes into the transform of HTML to XML, the combination of Java, XML and JDBC, and builds a meta-search engine based on XML, Java using Java Servlet and XML techniques. XML will bring great superiority to Web searching and mining as a data expressing forms.

【关键词】 XMLJava元搜索Web挖掘MySearch模型
【Key words】 XMLJavaMeta SearchWeb MiningMySearch Model
  • 【分类号】TP393.09
  • 【被引频次】5
  • 【下载频次】392
节点文献中: 

本文链接的文献网络图示:

本文的引文网络