节点文献

嵌入式主动实时数据库ARTs-EDB的索引技术

Index Technologies for the Embedded Active Real-Time Database ARTs-EDB

【作者】 蔡博克

【导师】 许贵平;

【作者基本信息】 华中科技大学 , 计算机软件与理论, 2007, 硕士

【摘要】 实时数据库(Real-Time Database, RTDB)中的数据和事务都具有显式的定时限制,系统的正确性不仅依赖于逻辑结果,更依赖于逻辑结果产生的时间。RTDB的高性能要求以内存数据库(Main-Memory Database, MMDB)作为低层支持。索引是提高数据库系统执行效率的一种有效工具,索引选择问题是数据库物理设计中一个重要的优化问题,在内存数据库上建立索引必然要受内存的快速存取以及高效利用影响。ARTs-EDB是自行研发、拥有自主知识产权的嵌入式主动实时数据库系统。以内存数据库作为其底层支持。大量的数据放到内存中,使得在内存中找到目标数据所用的时间不可忽略,因此,在内存数据库中,除了要使用传统的索引机制外,还要引入用于查找内存中的数据的索引。而非主键索引是一个比主键索引使用频率更高、设计更为复杂的索引机制。在分析了用于查找内存数据的非主键索引的特性之后,设计并实现了一个适合于内存数据库的非主键索引机制。此索引机制使用倒排表实现,支持区间查找,并使用一个高效算法实现联合查找,而索引机制本身不需要创建联合索引。实验证明,此索引结构比传统的散列表结构有更优的查找性能和更高的内存利用率。此索引结构的缺点在于创建的代价高,因此,系统引入了热度评价模块来支持此索引机制。热度评价模块根据属性在当前一段时间内使用的频率等因素对其进行合理的分类。系统依据属性的类型决定是否要基于此属性创建非主键索引、是否要保存此非主键索引。使用有限的内存空间保存“热门”数据的非主键索引,使得系统在不频繁的创建非主键索引的前提下,有较高的“索引命中率”。在自主研制的嵌入式主动实时数据库管理系统ARTs-EDB上,实验结果表明上述索引机制较之传统的索引机制,在时空性能上有大幅度的提高。

【Abstract】 Real-time database (RTDB) needs to face the challenge of simultaneously satisfying data integrity and timing requirements. The transactions and data in real-time database systems have explicit time constraints, and the correctness of transaction execution relies on not only the logical results but also the time constraint. In general, RTDB requires the support of main-memory database system. Index is an efficient tool that can improve the DBS’executing performance. Establishing index in MMDB must be influenced by quick access and efficient use of the memory.ARTs-EDB is an embedded active real-time database system, which is researched and developed independently and possesses proprietary intellectual property rights. It is based on the support of main-memory database system. A large number of data reserves in the main-memory, leading to the result that the time spending on searching the aiming data in the main-memory can not be ignored. Therefore, in the MMDB, except for the use of traditional index, another index that supports to search data in the main-memory should be introduced. And the non-primary index is used more frequently and designed more complicatedly than the primary index.After analyzing the characteristics of the non-primary index used to search the data in the main memory, a new non-primary index system is designed and implemented which fits to MMDB. The index is implemented as a inverted table index that supports range and united search, and the united search is implemented with a quick algorithm rather than a united index. The experiment demonstrates that the new index structure has better performance of searching and needs less main memory than the hashing table. The defect of the index structure is that it needs a more complicated establishing algorithm. Therefore, the system introduces popularity diagnose sub-system to support the new index structure.Popularity diagnose sub-system classifies the attribute by the use frequency during a latest period of time at the present and some other elements. The system determines whether it needs establishing and reserving the non-primary index by the classification of the attribute. Reserving the popular index with limited main memory supports high index shooting average, on the premise that the system needn’t establish non-primary index frequently.The experiments’result based on ATRs-EDB shows that the above index strategy can greatly improve the time and spatial performance than the traditional index strategy.

节点文献中: 

本文链接的文献网络图示:

本文的引文网络