节点文献

决策树分类算法的研究及应用

Research and Application of Decision Tree Classification Algorithms

【作者】 卜亚杰

【导师】 胡朝举;

【作者基本信息】 华北电力大学(河北) , 计算机应用技术, 2008, 硕士

【摘要】 分类是数据挖掘领域研究的重要课题。常用的分类模型有决策树、神经网络、遗传算法、粗糙集等。本文主要研究决策树ID3算法及其改进算法。首先阐述了决策树的相关理论,并对几种典型的决策树算法进行了分析比较。然后,针对ID3算法存在的不足,提出了基于属性优先关联度的ID3算法(AID3),实验证明AID3算法加快了决策树的构建速度,同时也克服了ID3算法往往偏向于选择取值较多的属性的缺点,随着数据规模的增大,决策树的分类性能也越来越好。最后,探讨了AID3算法在人力资源管理中的实际应用,结果分析进一步表明AID3是有效的。

【Abstract】 Classification is the important topic in the research field of data mining. There are many classification models such as decision tree, neural networks, genetic algorithms, rough sets, and so on. The thesis mainly research ID3 decision tree algorithm and its improved algorithm. First of all, the thesis introduced relative theories of decision tree, and compared several kinds of typical decision tree algorithms. Then a new algorithm based on attribute priority associate ID3(AID3) was proposed with advantages of ID3. The results of experiments proved that AID3 could raise the speed of constructing decision tree, at the same time, and overcame the ID3’s shortcoming which was often partial to select some attributes with more value. Furthermore, the performance of classification of decision tree was also getting better and better with the enlarging of dataset scale. At last, the thesis discussed the application of AID3 algorithm to human resources management, the results had proved that AID3 algorithm was effective.

【关键词】 决策树分类ID3算法AID3算法
【Key words】 decision treeclassificationID3 algorithmAID3 algorithm
  • 【分类号】TP301.6
  • 【被引频次】7
  • 【下载频次】544
节点文献中: 

本文链接的文献网络图示:

本文的引文网络