节点文献

基于三维人脸特征的计算机辅助疾病诊断技术研究

Technical Research on Computer Aided Diagnosis with 3D Facial Features

【作者】 朱文浩

【导师】 魏宝刚;

【作者基本信息】 浙江大学 , 计算机科学与技术, 2009, 博士

【摘要】 近年来,随着计算机技术和医学研究的不断发展,计算机辅助诊断(ComputerAided Diagnosis,CAD)已经成为医学临床诊断的一个重要工具,在某种程度上已经成为现代化医疗的行业标准。目前,CAD已经从诊断的辅助参考逐渐向计算机自动诊断(Automated Computer Diagnosis,ACD)发展。现代医学研究表明,基因遗传综合症(Genetic Syndrome,GS)是导致儿童精神发育迟滞(Mental Retardation)的主要原因之一,而多数综合症都会导致人的脸部形态发生微妙变化,这使得利用计算机进行GS诊断成为可能和必要。本文目标是解决GS计算机辅助诊断中的若干关键技术问题,并构造面向计算机和医学领域交叉的体系结构框架。从三维人脸模型中提取特征是GS诊断的一个重要问题,由于本文研究背景的特殊性,特征提取算法面临两个主要问题:1)由于难以获取正对的三维人脸模型,需要在非正对条件下定位模型;2)GS通常会导致脸部特征形态变化,需要研究同时适用于正常人和GS患者的提取算法。针对上述问题,本文提出了NIFE人脸特征提取算法。算法首先根据鼻尖的几何形状和对称性筛选并获取鼻尖及相应的对称平面位置;随后,利用鼻尖和对称平面校准模型至基本正对;最后,根据其他特征点所在区域的曲率及其相对于鼻尖的位置关系,分割特征区域并提取特征点。NIFE算法利用鼻尖在几何形态上的稳定性,通过分布处理确保了对正常人和GS患者都能有效提取特征。实验结果表明,NIFE算法能在模型朝向和位置未知的情况下提取人脸特征,具有较强的鲁棒性,运行速度较快,准确率较高。智能推理模型是诊断的核心,模型的选择需要在保证准确率的基础上提高泛化能力。由于实验数据是正常人样本和已确诊的病例,因此采用监督学习算法是一个较好的选择。第三章介绍了几种典型的机器学习算法,针对诊断的特点和要求,从样本质量、精度要求以及先验知识的引入三个方面分析对比各种算法,提出以支持向量机(Support Vector Machine,SVM)作为诊断系统的推理核心。随后,在详细介绍SVM算法及其理论依据的基础上,进一步阐明了该算法作为智能推理模型的优势,并讨论了利用SVM解决多类分类问题的方法。为了提高诊断推理的效果,需要充分利用医学诊断先验知识,将其引入学习过程。针对这一目标,需要研究确定ACD的体系结构。第四章首先介绍了CAD/ACD体系结构的研究现状,指出现有的体系结构无法灵活的适应外部知识的变化。针对该问题,论文提出了一个面向计算机和医学领域交叉的ACD体系结构,其主要特点是引入自然语言处理技术,使得医学专家能够利用诊断指令,自主的将诊断知识结合到诊断体系结构中。论文阐述了自然语言处理的相关技术,分析构造了面向医学诊断指令的产生式,并实现了一个诊断指令解析过程。在此基础上,提出并实现了一种将先验知识引入机器学习的方法,该方法的特点体现在样本筛选和输入数据调整两个方面。通过对诊断指令的处理,将不符合语义描述的样本去除(样本筛选),减少噪音数据对学习的影响;增加与语义相关的维度(数据调整),增强分类器对样本语义的敏感。实验结果证明,该方法能有效提高学习效率,同时也证明了所提出体系结构的可行性。由于论文中的数据来自不同的单位,样本标注缺乏严谨和一致性,因此存在同类样本分属不同子类的情况,即同类样本中存在一定的“子差异”。针对这一问题,论文提出了一种基于子空间划分的分类算法GBSVM。该算法的最大特点在于通过聚类将同类样本中具有不同“子差异”的样本预先分组,并利用样本筛选和分类器的构造避免这种“子差异”影响分类结果。实验结果表明,相对于普通的SVM算法和不对称打包SVM算法,GBSVM算法有效的提高了多种样本组成混合分类情况下的学习效率。在论文的最后,介绍了一个儿童遗传综合症诊断原型系统。该原型系统采用了分层次、模块化的结构设计,融合了本文理论和技术研究的成果,实现从数据获取到智能推理的整个过程,具有实用性和参考价值。

【Abstract】 With the rapid development of computer science and medical research, Computer Aided Diagnosis (CAD) has become an important or even clinical standard tool for disease diagnosis. Now, the research of CAD is targeting the developing of Automated Computer Diagnosis (ACD).Genetic research shows that Genetic Syndromes (GS) are one of the main reasons, which lead to mental retardation. The syndromes will usually cause the change of facial morphology, and this makes it possible and necessary to diagnosis genetic syndromes with computer. The aim of this paper is to look into some important issues of CAD research and develop the system architecture with respect to this interdisciplinary topic.To extract facial features from facial 3D model is an important issue in CAD research. On considering of the application domain, the extraction needs to solve two problems: i) it is difficult to acquire frontal images, so that face orientation is required to be extracted on non-frontal 3D images; ii) normal extraction algorithms are not fit for the change of facial morphology, a solution is needed to deal with both normal and abnormal faces. To address the problems, this thesis presents a Nose Identification-based Feature Extraction (NIFE). The extraction algorithm starts by identifying the position of nose tip and its corresponding symmetry plane according to geometry characteristics. After that, 3D models are adjusted to be frontal based on the results of nose identification. Finally, facial regions are segmented by curvature parameters and the position to the nose tip, and then, feature points are extracted in their corresponding facial regions. NIFE is applicable on both normal and abnormal faces by using nose tip, which is stable in shape, as a reference point. The experiment results show that NIFE is fast and efficient to extract facial features on non-frontal 3D modelsIntelligent reasoning is the core of an ACD system. It must be ensured that the reasoning algorithm is accurate and has the ability of generalization. Because the experiment data is labeled, a supervised learning algorithm is a proper choice. With respect to the special requirements of ACD, chapter 3 introduces some typical machine learning algorithms. By analyzing these algorithms from the angles of the sample quality, algorithm accuracy and the incorporating of prior knowledge, the thesis chooses Support Vector Machine (SVM) as the engine of intelligent reasoning. The static learning theory and the implementation of multi-classification are discussed to show the validness of the choice.Incorporating prior knowledge is an efficient way to improve the accuracy of machine learning. To incorporate prior knowledge, we have to develop a proper diagnosis schema. Chapter 4 first gives a brief overview on state-of-art CAD/ACD schemas, and then arguing that current schemas are not able to meet the interdisciplinary requirement of CAD/ACD, and cannot accommodate the change of diagnosis knowledge. To address the problem, this thesis proposes an interdisciplinary-oriented ACD schema. The main idea is to incorporate medical expert in diagnosis schema by employing Natural Language Processing (NLP) so that prior knowledge can be incorporated as medical diagnosis instructions. To deal with the medical diagnosis instructions, production rules are proposed and implemented based on the discussion of NLP. At the end of this chapter, the thesis proposes an approach to incorporate prior knowledge. The incorporation acts on both sample selection and feature vector generation. With the interpreted prior knowledge, irrelevant samples are eliminated and feature vectors can be expanded with relevant dimensions. According to the experiment results, both of the operations are effective and the feasibility of the schema is proved.Because the experiment data comes from different institutions, the labels of training samples are not consistent. The inconsistent labeling brings a sub-separable problem. That is, samples with the same label may represent different sub classes. To address the problem, a Grouped Bagging SVM (GBSVM) is proposed. The GBSVM uses cluster algorithms to regroup the sub-separable samples and then construct sub classification SVM with an even sample selection procedure, which makes sure that the sub-separable samples with the same label will not be separated. On compared with ABSVM and normal SVM, GBSVM is more accurate and effective on the experimentdata.The last chapter of the thesis presents an ACD prototype system. The system isdesigned in a multi-layer manner with the theories and techniques applied. The entirediagnosis process is implemented including data capture and intelligent reasoning, andthe system is highly practical.

  • 【网络出版投稿人】 浙江大学
  • 【网络出版年期】2010年 07期
节点文献中: 

本文链接的文献网络图示:

本文的引文网络