节点文献

XML数据库的规范化理论研究

The Theory’s Research on Normalization of XML Database

【作者】 李高仕

【导师】 刘先锋;

【作者基本信息】 湖南师范大学 , 计算机应用技术, 2008, 硕士

【摘要】 在网络数据交换日益增多的今天,XML作为一种半结构化数据以其简单易标记和跨平台等优点被越来越广泛的应用到数据存储和数据传输领域,成为Internet上的主要的数据表示和交换标准之一,应用范围非常广泛。XML数据库是一项在最近几年发展起来的新技术。和关系数据库类似,在XML数据文档中由其模式定义形成的树型结构可能包含数据冗余,从而引起数据的更新、插入和删除异常。引起数据冗余的根本原因是其中包含异常数据依赖,包括部分函数依赖、传递函数依赖和多值依赖。由于Web的开放性,XML数据异常的危害性要远远大于关系数据异常的危害性。研究XML数据依赖是进行XML数据库技术中其他相关研究如XML数据的存储与发布技术、XML数据查询与优化技术等的基础。目前对于XML数据依赖的研究主要集中在XML文档的强函数依赖这一领域,对于XML树型结构中引入空值后的弱函数依赖方面的研究就更少。本文的主要工作是在已有的DTD规范基础上,采用路径表达式和树元组的表示方法对XML数据进行规范化研究,主要内容如下:1、基于路径表达式和树元组给出XML函数依赖、部分函数依赖、传递函数依赖和多值依赖的概念,给出了部分函数依赖、传递函数依赖和多值依赖的推理规则。2、基于XML数据依赖形式化定义,给出XML不同级别范式的定义,提出XML文档规范化规则——元素提升规则、元素创建规则和元素上移规则。在规范化基础上给出XML文档规范化算法,并分析了算法的正确性、可终止性、时间复杂性、无损联接性和函数依赖保持性等。3、在XML树型结构中引入空值概念,提出XML弱函数依赖的逻辑蕴含问题,给出一组适合XML空值模型的函数依赖推理规则集;定义了单依赖集合,证明了单依赖集合判定定理和单依赖集合判定可终止定理。

【Abstract】 Nowadays, network data exchanging is increasing day by day, as a half- structured data, XML is widely applied in data storage and data transmission fields because of its simplicity, easily marked and running in various platforms. XML database which is sprung up in recent years is a new technology. Like the relational database, the tree structure formed by schema definition may contain data redundancy in the XML data documents, which leads exception in data updating, data inserting and data deleting. The radical reason that brings data redundancy is exception of data dependency, such as partial functional dependency , transitive functional dependency and multiple value dependency.In fact, studying the XML functional dependency plays a fundamental role in other related studies in the XML database technology such as storage and distribution of data, XML data querying and data optimization. Currently, the research on XML functional dependency is focused on strong functional dependency of XML document. The research on weak XML functional dependency—the functional dependency when introducing null in XML tree structure is even seldom.This paper studies the XML data standardization based on the existing DTD rules, according to path expressions and tree component present, the main achievements are listed as follow: 1.The conceptions and inference rules of XML partial functional dependency, transitive functional dependency and multiple value dependency are given, basing on path expressions and tree component.2. Different levels of XML paradigms are defined according to XML formalized definition. It also deduces the XML document standardization rules----element upgrading rules, element establishing rules and element upper moving rules. It gives the XML document standardization algorithm, analyzing the correctness, term- inability, time complexity and nondestructive reliance and maintenance functional dependency of the algorithm .3. It introduces the conception of null in XML tree structure, puts forward XML weak functional dependency containment issue and a set of functional dependency inferences rules suit for XML null model, defines single dependency set, testifies single dependency theorem and single dependency term- inability theorem.

  • 【分类号】TP311.13
  • 【被引频次】2
  • 【下载频次】178
节点文献中: 

本文链接的文献网络图示:

本文的引文网络