节点文献

融合本体和用户兴趣的专利信息检索系统的研究与实现

Research and Implementation of Fusing Ontology and Users’ Interests in Patent Information Retrieval System

【作者】 缪涵琴

【导师】 孙涌;

【作者基本信息】 苏州大学 , 计算机应用技术, 2007, 硕士

【摘要】 专利文献集技术、法律与经济等信息于一体,反映了最新的科学技术研究情况。自增强自主创新能力在全国科技大会上被提出以来,作为对科技创新有重要指导作用的专利信息服务受到越来越多地关注,专利检索作为专利信息服务的核心,为专利文献的获取提供了一个有效的途径。本文通过分析和比较国内外著名的专利信息检索系统,了解到目前广泛使用的专利检索系统普遍存在着检索智能化程度不高,无法获取检索条件的语义信息以及不能体现用户的个性化需求等不足。随着1998年Tim Berners-Lee提出语义网的概念,本体作为一种能在语义和知识层次上描述信息系统的概念模型的建模工具得到越来越多的应用。本文在介绍本体相关技术的基础上,设计了专利检索领域本体、国际专利分类表和用户兴趣模型的本体表示,实现了融合本体和用户兴趣模型的专利信息检索系统。本文的主要工作包括以下四点:(1)通过分析专利检索的常用概念,设计了专利检索领域本体和国际专利分类表(International Patent Classification,IPC)的本体表示;(2)根据用户的需求以及历史检索记录,从用户主动提供和系统被动学习两方面着手,设计了用户兴趣模型的本体表示;(3)为满足不同层次、不同需求的用户,设计并实现了三种专利检索方式:快速检索、表格检索、高级检索,并将本体、用户兴趣模型、用户输入的检索条件三者相结合,实现满足用户需求的个性化检索;(4)通过设计将检索结果按IPC号分类,由用户自主选择需要浏览的类别来为用户提供个性化服务。由此避免了当用户输入条件范围过大时,检索的查准率大大降低,用户看到过多的不相关信息。经过对所设计系统的测试,证实通过本体和用户兴趣模型在专利检索中的应用,在一定程度上提高了专利信息检索的查全率和查准率。最后,本文对所做工作进行了总结,并给出了进一步完善该系统需要解决的若干问题。

【Abstract】 Patent literatures gather informations from domains including technique, law and economy and reflect latest technology research status. Since enhancing independence and innovate ability was proposed, patent information service gets increasing attention because of showing technique developing direction. Patent retrieval as the core of patent information service proposes an effective approach to gain patent literatures.Firstly, analyze and compare some famous patent retrieval systems both here and abroad. Their intelligentization hierarchy of present extensively employed patent retrieval systems is not high. They are unable to get semantic information from the search conditions and can’t express users’personalized demands. Semantic web was proposed by Tim Berners-Lee in 1998. Ontology as a modeling tool describes concepts of information term system in semantic and knowledge hierarchy is applied ever-increasinly. This paper designs the patent retrieval domain ontology based on the introducing ontology technique, then uses the ontology to express the international patent classification table and users’interests, and also implements the patent retrieval system fusing ontology and users’interests.The main works of this paper include four points as follows:(1) Patent retrieval domain ontology is designed through analyzing the common concepts in searching patent information. And the international patent classification table is expressed using ontology;(2) Design users’interests model from the information actively provided by users and passively provided by system based on users’requirements and historical searching record;(3) Design and implement three patent retrieval modes: quick search, table search, and advance search to meet various users’demands. And combine ontology, user’s interest model and the inputed retrieval condition when users search for patent informations. This realizes individualized retrieval which meets the demands of users;(4) When users’s search term is inaccurate, the precision will drop markedly and users will receive many disrelated informations. This paper designs the classification of results according to the IPC. Users can choose any sort to browse by their own needs.Through testing the system, this paper validates that the application of ontology and user’s interest model improves the recall and the precision of patent information retrieval to a certain extent. Finally, this paper summarizes all of the work and raises further work to make the system more perfect.

  • 【网络出版投稿人】 苏州大学
  • 【网络出版年期】2008年 03期
  • 【分类号】TP391.3
  • 【被引频次】7
  • 【下载频次】335
节点文献中: 

本文链接的文献网络图示:

本文的引文网络