节点文献

数据挖掘模型的创建及其在中医药文献中的应用研究

Establishment of Data Mining Model and Its Application in Literatures of Chinese Medicine

【作者】 麦乔智

【导师】 蔡宝昌;

【作者基本信息】 南京中医药大学 , 中药学, 2009, 博士

【摘要】 五千年中华民族的文化底蕴是中医药发生、发展的基础。中医药领域的无数临床实践与理论研究积累了大量的科学知识,这些知识包含在中医药古籍文献以及当前的研究文献中。面对如此海量的中医药数据,如何有效地利用这些宝贵资源就成了发展中医药必须面对的一个问题。中医药学有其自身的思维模式,具有系统性、整体性、复杂性、不确定性等特点,不适宜运用传统的还原论的方法研究。数据挖掘可以从海量的数据中寻找潜在的规律,完成普通人不能完成的任务。目前,数据挖掘相关技术和方法已经较为成熟,且存在着一套行之有效的方法。因此,应用数据挖掘技术进行有效模式、知识的获取研究,必将加速推进中医药国际化、现代化、规范化和知识化进程,对中医药学的长期稳定发展具有重要意义。数据挖掘(DM)是近20年来随着人工智能和数据库技术发展起来的,是一门涉及人工智能与数据库、统计学、机器学习等不同学科和领域的交叉学科。本文中数据挖掘采用广义观点,即等同于KDD,为从存放在数据库、数据仓库或其他信息库中的大量数据中挖掘有趣知识的过程。面对中医药数据描述多样化且不完备等现象,在标准化处理的同时,还必须对现有的数据挖掘技术进行改进和发展。本文以KDD方法为基础,创建了一种人机互动的数据挖掘模型。人工作业仅为编排及指定,最小化人为建档的工作量,并留下原始文本数据的换行断词噪声,作为操作标的,分析其产生结果。本程序可直接由文本数据作为处理标的。值得注意的是,基本辨认语料库必须正确,方有正确的结论。而数据资料的标准化则是可做可不做,重点在于我们对结论精度范围的要求。将此模型应用于选定的中医药文献资料进行挖掘研究,结果表明:(1)可以按照中医学理、法、方、药顺序做出标示及索引,能够揭示六名医家常用的相同或相似药物。(2)通过对《傅青主女科》方药规律的数据挖掘,发现当归、人参、川芎、酒、白芍、茯苓等药物及其配伍药对、药团最为常用,生化汤使用频率最高,提示补血调血及补气健脾的重要性。其中,对酒的普遍运用非常例外,这在之前的文献研究中很少述及。(3)通过对487首治噎膈病方剂的较为深入地研究,发现理气药所占频次最高,陈皮、木香、甘草、肉桂、人参等药物及其配伍药对或药团最为常用,而对于温里药及诃子的重视与现代临床用药有较大差异。还有,在剂型方面多选用散剂和丸剂,寓缓消渐散之意;在服法方面多选用不拘时候、内服、噙服,意在延长了药物与病灶局部的接触时间以提高药效。

【Abstract】 The occurrence and development of traditional Chinese medicine is based on the Five-thousand year’s Chinese cultural deposits.The numerous clinical practices and theoretical studies in the field of traditional Chinese medicine accumulate a large amount of scientific knowledge,which contained in ancient Chinese medicine literatures and current research literatures.Facing with such massive traditional Chinese medicine data,how to use of these valuable resources effectively has become a problem to develop traditional Chinese medicine.Traditional Chinese medicine has its own mode of thinking,such as systematicness,entirety,complexity,uncertainty and so on,which is not appropriate to be researched in the method of traditional reductionism.Data mining can seek for the potential law from the massive data and complete the tasks that ordinary people can’t do.At present,the correlative technologies and methods of data mining have been mature,which also exist a kind of good-effective methods.Therefore,applying data mining technology to research on acquisition of the effective mode and knowledge,will accelerates the internationalization,modernization,standardization and knowledgeization processes of traditional Chinese medicine,which has great significance on the long-term stable development of traditional Chinese medicine.Data Mining(DM),developing with artificial intelligence and technique of database nearly 20 years,is a cross discipline of different subjects and interdisciplinary areas,involving artificial intelligence and databases,statistics, machine learning,etc.In this paper,Data mining is used by the broad point of view (KDD), Namely the process of mining interesting knowledge from massive data stored in the database,data warehouse or other information base.Facing with the diversification and imperfection of traditional Chinese medicine data description,it is necessary to improve and develop existing data mining techniques.In this paper,we use KDD method as a basis to create a data mining mode of man-machine interaction,to minimize the workload of people performance and directly use textual data as handling purpose.This method is used in the traditional Chinese medicine literatures,which could extract,unite and discover knowledge from a large number of literatures and could make the data mining mode quickly handle a large number of traditional Chinese medicine literatures and excavate knowledge in specific areas.The results about the data mining model in the application and research of traditional Chinese medicine literatures show that:(1) it can make the mark and index in accordance with the principle,law,formula and drugs of traditional Chinese medicine,reveal the same or similar drugs used by the six famous doctors.(2) Mining the data from drug laws of "Fu Qing-zhu gynecology",it found that Radix Angelicae Sinensis,Radix Ginseng,Rhizoma Chuanxiong,liquor,Radix Paeoniae Alba,poria,etc and their compatibility drugs are most commonly used,which suggest the importance of nourishing and adjusting blood,invigorating vital energy and spleen.Among,the widespread use of alcohol is an extraordinary exception, which is rarely mentioned in the previous literatures.(3) Through deeply researching 487 prescriptions for cardiac spasm,it shows that Qi regulating agents share the highest frequency,seasoned orange peel,Radix Aucklandiae,Radix Glycyrrhizae, Cortex Cinnamomi Cassiae,Radix Ginseng,etc and their compatibility drugs are most commonly used.However,there is a major difference of clinical medication of Fructus Chebulae and drugs for dispelling internal cold.Additional,prescriptions for cardiac spasm are more chosen in the form of powder and pills,the method of drug administration are irrespective of time,oral and hold in the mouth,which could prolong contact time of the drug and partial lesions so as to enhance efficacy.

  • 【分类号】R2-03
  • 【被引频次】4
  • 【下载频次】953
节点文献中: 

本文链接的文献网络图示:

本文的引文网络