节点文献

基于Ontology的非结构化信息访问机制研究

Research of Access Mechanism of Unstructured Information Based on Ontology

【作者】 杨玲贤

【导师】 陈和平;

【作者基本信息】 武汉科技大学 , 计算机应用技术, 2004, 硕士

【摘要】 随着网络技术日新月异的发展,网络上的数据量呈指数级激增,大多数信息已不再局限于传统的结构化形式,而是以诸如电子邮件、图像、网页、工作流等非结构化形式存在。如何采用统一的方法表示和访问这些非结构化信息,并从中归纳及获取知识是各组织机构实施信息化建设的核心,也是目前一个新兴的研究方向。 XML作为数据表示和数据交换的新标准,具有统一的非结构化信息描述机制,但其在语义表达能力上存在不足,限制了语义异构环境下信息的表示、交换和共享。Ontology技术通过建立领域知识的概念模型,解决了XML语义差异问题,减少或消除概念及术语上的混乱,使得获取那些隐含或不明确的信息成为现实。利用Ontology可以给XML所表示的非结构化信息增加丰富的语义知识背景。 本文提出了一套通用的基于Ontology的访问策略和实现方案作为对非结构化信息访问机制研究特别是语义层次访问的探索和尝试,主要包括下列理论及技术: 1.结合Frame-Logic和SQL语言特点,提出一种新型类SQL数据操作语言Fl-Plus,支持各种数据访问操作; 2.初步设计和实现推理引擎,用以完成推理规则和语义词典的解析映射工作,实现了语义级别信息访问的核心技术,推理引擎的引入帮助计算机识别文档信息的语义,完成智能访问; 3.基于Schema生成的模式约束信息,约束各类信息访问操作,以最大程度保证底层数据的有效性和完整性; 4.针对处理XML应用的瓶颈问题,结合路径优化、Ontology集合访问和JDOM缓冲三大技术,在一定程度上提高了系统处理性能; 5.借鉴JDBC技术,设计了JXSC服务接口,为三层模式的信息访问方式提供支持。 最后,笔者在参与湖北省教育厅科研资助项目——“基于XML的WEB存储系统研究”的研究开发过程中,以上述理论为指导,结合JAVA及XML技术,初步实现了本文所提出的OBSA-AM(Ontology-based Storage Architecture—Access Mechanism)访问系统模型。

【Abstract】 Along with the fast development of the Network technology, information on the Network increases rapidly at the speed of exponential, and most information now presents as unstructured form such as emails, graphics, Web pages, workflows etc. instead of traditional structured form. How to utilize a uniform method to express and access this unstructured information, then acquire knowledge from it is each organization’s information construct kernel and also is a new research direction at present.As a new standard for data express and exchange, XML has a uniform describing mechanism for unstructured information. But the shortage of semantic expression for XML restricts information show, exchange and share in the different environments. Fortunately, Ontology technique brings concept model in the domain knowledge to resolve XML semantic difference problems, reduce or eliminate confusion for concepts and terms, so as to get those hidden or ambiguous information becomes realization. There are adding really abundant semantic background knowledge for unstructured information based on XML with ontology.Supported by the relative theories above-mentioned, this thesis provides a uniform accessing strategy and realizing scheme based on ontology as access mechanism research for unstructured information. Mainly includes techniques and theories as follows:1. Combines Frame-Logic and SQL language’s characteristics, bring forward a new type language (Fl-Plus) to support each data operation;2. Designs and realizes the inference engine concisely, in order to complete the parse mapping for logic inferential rules and semantic dictionary and to realize the kernel technique for semantic level information access, inference engine helps computer distinguish semantic from XML documents and complete intelligent access;3. Pattern restriction information based on schema can restrict all kinds of access operations, and hope to guarantee bottom data’s validity as furthest as possible;4. Aim at the bottleneck problem in applications that is deal with XML, this thesis combines three techniques such as path optimizing, ontology set access and creating buffer based on JDOM to improve system performance at a certain extent;5. Using JDBC for reference, designing JXSC service interface provides support for three levels model access way.Finally, guided by the techniques and theories above-mentioned, this thesis brings forward the model during the project-Research of Web Storage Based on XML, which staked by the Hubei Provincial Department of Education organically, and primarily realizes the OBSA-AM (Ontology-based Storage Architecture - Access Mechanism) during the research by using JAVA and XML techniques.

  • 【分类号】TP393
  • 【被引频次】1
  • 【下载频次】195
节点文献中: 

本文链接的文献网络图示:

本文的引文网络