节点文献

数据仓库数据集成处理中的异构数据接口的设计与实现

The Design and Implement of ETL about Heterogeneous Data in Data Warehouse

【作者】 李素军

【导师】 胡建华;

【作者基本信息】 昆明理工大学 , 计算机软件与理论, 2008, 硕士

【摘要】 数据仓库系统是随着分析型应用的兴起而发展而来,旨在帮助企业充分利用宝贵的信息资源,做出正确的决策。数据仓库虽然为决策支持系统的数据存储和管理提供了解决方案,但是原始数据还存储在文本文件、XML文档、电子表格和关系数据库等各种数据源中,如何将这些数据加载到数据仓库中成了新的问题。因为数据仓库的数据存储和组织形式与文件、电子表格相去甚远,即使是与操作型关系数据库相比也有相当大的差异,所以把各种原始数据直接导入数据仓库是不切实际的。而本论文通过实现一个集成转换工具,为数据仓库提供清洁、规范的数据。本论文首先讨论分析一些本系统中所采用的相关技术,Web Services技术、元数据技术和数据仓库ETL技术,然后分析了如何通过Web Services来封装各个远程的异构数据源,这些异构数据源的元数据信息通过元数据库来统一管理,并对数据仓库中关键技术ETL进行了深入的研究,最终实现了一种灵活、操作方便、可扩展的数据集成转换工具。本系统基于微软的.NET平台开发,从异构数据源的析取、转换、装载到元数据管理提供了一整套的解决方案。

【Abstract】 The data warehouse system is emerges along with the analysis application develops comes, to be for the purpose of helping the enterprise to fully utilize the valuable information resources and making the correct decisions. Although the data warehouse has provided the solution for decision support system’s data storage and the management, but the raw data also saves in the text document, the XML documents, the electronic forms, the relational database and other data sources, how these data will be loaded into the data Warehouse has become a new problem. Because data warehouse’s data storage, the configuration of organization and the document are very different with the electronic forms, even if compares with the operation relational database also has the quite big difference, therefore directly load different kind of law data into the data warehouse is impractical. But this article we through realize a general ETL tool, provides a general solution for data warehouse’s policy-maker.This article first analyzes some and system-related technology, the Web Services technology, the metadata technology and the data warehouse ETL technology, then discussed the integration and application of Web Services in different data sources, and have an in-depth study for ETL, the key technology of data warehouse.This system based on Microsoft’s .NET platform, Provide a whole solution for the course of the different data source’s extract, transform, load and the metadata’s management.

【关键词】 数据仓库Web Services元数据ETL
【Key words】 Data warehouseWeb ServicesMetadataETL
  • 【分类号】TP311.13
  • 【下载频次】263
节点文献中: 

本文链接的文献网络图示:

本文的引文网络