节点文献

基于Lucene教学资源搜索引擎的设计与实现

Design and Implementation of the Teaching Resources Search Engine Based on Lucene

【作者】 闫毛毛

【导师】 周荣辉;

【作者基本信息】 西南交通大学 , 计算机软件与理论, 2011, 硕士

【摘要】 20世纪最后20年至今,互联网的发展和普及为人们提供了一个全新的信息存储、加工、传递和使用的载体,网络信息也迅速成为社会成员获得知识和信息的主要渠道之一。传统的信息检索技术与网络应用实际相结合的搜索引擎就应运而生。该系统源于北京某公司针对特定用户群体设计的一个主题搜索引擎系统。主要解决目前通用搜索引擎信息量广、搜索专业知识时结果不集中,用户很难从浩瀚的网络信息中检索到自己领域的知识等问题,基于此目的,为满足公司特定客户群体对于教学资源主题搜索引擎的需要,开发和设计此系统,并投入测试和应用。论文以用户对教学资源信息需求为研究背景,采用主题搜索理念与技术,通过分析将国内互联网上与教学资源相关的信息作为采集对象,基于开源软件包Lucene,研究和实现了对教学资源信息搜索引擎的搜索引擎。首先,介绍搜索引擎的基本概念以及搜索引擎的基本运行原理,并对搜索引擎的排序技术和评价指标进行了探讨。同时对开源项目Lucene进行概述,包括Lucene的系统结构、包结构、索引机制、搜索机制进行了论述。其次,在对公司业务分析的基础上,运用UML用例分析并确定了系统的开发目标和基本功能需求,对系统的关键功能进行了详细的设计,同时对系统的运行架构、组织结构以及系统部署作了设计说明。最后,对搜索引擎的相关开发技术进行介绍,基于Lucene开源框架,运用Spring、Struts 2等技术开发并实现了教学资源搜索引擎系统。

【Abstract】 Since the 20th century for the last 20 years, the development and popularity of the Internet provides a new carrier for people who take as information storage, processing, transmission and making use of, Network information also quickly became one of the main channel which takes as an access to knowledge and information for the member of society. Thus the traditional information retrieval techniques combined with the practical application of network made the search engine.The system which originated from a Beijing company for a project-specific user groups, is a subject search engine. It is mainly to solve the problems that information more widely for general search engine, search results not focusing on professional knowledge and the user is very difficult to retrieve their field of knowledge from the vast network of information, On this basis, the groups develop a search engine on teaching resources in order to meet company-specific customer, and put into testing and application.The thesis takes the information needs of users for teaching resources as the study background, using the concepts and techniques of the theme search.By analysis, the thesis takes the teaching resources information on the domestic Internet as a collection object based on the open source software package-Lucene, studies and achieves the search engines for the teaching resources information.Firstly, this thesis introduces the basic concepts and basic operating principles of search engine, and discusses ranking techniques and evaluation indexes of the Search engine, then an overview of the open source Lucene project, including the Lucene system structure, including structure, indexing mechanism, the search mechanism, was discussed.Secondly, the thesis determine the system’s development objectives and basic functional requirements based on the use case analysis-UML, and carries out a detailed design for the critical functions of the system, and the operation of the system architecture, the organizational structure and the system deployment are made a design specification.Finally, introducing the key technology of the search engine, and developing and achieving the search engine system for teaching resources by using Spring, Struts 2 and other technology based on the Lucene open-source framework.

【关键词】 主题搜索Lucene教学资源SpringStruts 2
【Key words】 topic searchLuceneteaching resourcesSpringStruts 2
节点文献中: 

本文链接的文献网络图示:

本文的引文网络