节点文献

基于Deep Web的图书信息集成与查询系统

Based on the Deep Web of Books Information Integration and Inquires the System

【作者】 邓丽君

【导师】 伏玉琛;

【作者基本信息】 苏州大学 , 计算机技术, 2011, 硕士

【摘要】 该系统是使用在手机上的图书搜索系统,通过搜索,可以为用户提供基本的图书信息查询,并将查询结果显示在手机屏幕上,方便手机用户查阅。在本文中,笔者提供了一种基于Deep Web(深网)技术的网络爬虫以实现对特定主题的网络信息的收集、整合,该爬虫被设计成一个基于JAVA语言的多线程的多级队列爬虫,在这个队列中采用HTMLParser工具和正则表达式技术对抓取的URL进行处理和存储。在URL队列的设计上引入了Berkeley DB,实现了队列的高效存取,并将抓取到的数据存入MySql数据库。笔者采用基于Lucene技术对处理后的信息建立索引,在成功收集资源并建立索引后,笔者基于软件Android,当今最重要的手机开发平台建立了一个搜索界面,以方便用户使用手机通过Web搜索到与特定主题相关的资源。该系统可以为手机使用者提供方便快捷的信息服务,用户可以随时获取所查询的图书的各类信息,为用户带来了便利。

【Abstract】 The system is used in cellphones for searching books. Through searching, it can provide phone users with basic information of books, and show the result on the phone’s screen. That is convenient for phone users. In this paper, the author present a web crawler based on the technology of Deep Web to complete the network collection and integration on a particular theme. The reptile is designed to be a multi-thread multilevel queued reptile based on the JAVA language, and in the queue, the HTMLParser tool and regular expressions technology are used to process and store the grabbed URL. The Berkeley DB is introduced in the design of the URL queue. That realizes the efficient access of the queue and deposits the grabbed data in the MySql databases. The author uses the technology based on Lucece to establish the index of the processed information. After successfully collecting resources and establishing index, based on the Android software, nowadays’most important cellphones development platform, the author sets up a searching interface as a convenience to users so that through the Web, they can use cellphones to find the resources related to the specific topics.The system can provide phone users with convenient and quick information service. The users can access all kinds of information on the books they are searching in any time. That’s so convenient for users.

【关键词】 网络爬虫Deep WebAndroid平台
【Key words】 CrawlerDeep WebAndroid platform
  • 【网络出版投稿人】 苏州大学
  • 【网络出版年期】2012年 06期
节点文献中: 

本文链接的文献网络图示:

本文的引文网络