节点文献

基于半监督学习的遥感影像分类技术研究

The Study of Semi-supervised Classification Technology in Remote Sensing Image Processing

【作者】 任广波

【导师】 郑荣儿; 张杰;

【作者基本信息】 中国海洋大学 , 海洋信息探测与处理, 2010, 博士

【摘要】 基于统计模式识别的分类是遥感影像应用中最主要的信息提取方法。由于遥感影像信息统计分布的高度复杂性和随机性、人为选择样本时对待分类影像认知的有限性以及选择时的盲目性等因素,常导致得到的样本数量少且代表性不好,从而无法保证取得理想的分类效果。但目前传统的遥感影像分类方法研究和应用中都忽视了该问题,常以“样本选择无任何问题”为出发点评价方法的适用性和参数设置,这不利于遥感影像信息提取技术研究和应用的深入发展。本文的工作主要针对遥感影像分类中人为选择样本少且代表性不好的问题,研究如何充分挖掘分类器和待分类影像的潜力,使影像分类结果最大限度的得到改善。论文主要包括以下两个部分:第一,在深入分析遥感影像分类时经常出现的人为选取样本代表性不好问题的基础上,针对遥感影像数据特点和分类特点,引入半监督学习分类方法,开展了基于半监督学习的遥感影像分类技术研究;第二,面向遥感应用中全自动分类和对无法获取有效样本区域的分类问题,基于发展的适用于遥感影像的半监督学习分类技术,分别提出了有效的解决方法。具体内容如下:(1)发展了基于半监督学习的遥感影像分类方法在分析遥感影像数据特点的基础上,基于半监督学习方法中与遥感影像数据分布特点一致的两种假设,研究并发展了基于生成模型和直推式思想的遥感影像半监督学习分类技术。基于生成模型的技术研究中,推导并修正了EM算法的递归公式,从遥感影像分类应用的角度给出了修正依据;面向遥感影像分类特点和需求,给出在一个类别对应一个概率混合成分和一个类别对应多个概率混合成分时的分类方法,并分别给出相应的算法流程;发现了休斯现象在半监督学习分类中同样存在;根据设计的分类实验,从经验角度给出了已标记样本集和未标记样本的用量比例参考。针对现有基于直推式思想的遥感影像方法研究中,因未考虑遥感数据特点而导致未标记样本的标注效率低下、分类精度不高的问题,提出了一种适用于遥感影像的未标记样本标注方法;分别应用到目前主流的中分辨率遥感影像和高分辨率遥感影像分类中,给出了基于象元的分类方法流程,和基于分割对象的高分辨率遥感影像半监督学习分类方法流程框架,并指出发展的基于分割对象的分类方法体现了真正意义上的直推式学习。为更全面的评价和分析所发展的两种方法,开展了基于生成模型的方法和基于直推式方法的分类效果对比,并分析了两种方法的适用性。(2)基于发展的半监督分类方法解决了遥感应用中的两个技术难题面向遥感应用中对全自动分类技术和训练样本拓展应用技术的需求,在分析半监督学习方法在上述应用领域中的适用性的基础上,开展了相应的研究。遥感影像的全自动分类是今后大规模、高频度、重复性区域遥感监测技术发展的重要方向。提出一种以建立预设样本集的方式,基于半监督学习的遥感影像全自动分类技术。已标记样本不一定来自待分类影像本身,但却在一定程度上代表影像覆盖的区域。通过在区域性样本集建立策略下的全自动分类实验,给出了可行的基于半监督学习的遥感影像全自动分类方法。面向目前国家海域使用监测中对高自动化信息提取的需求,提出一种针对海岸带滩涂围垦信息的半监督学习全自动提取方法。遥感影像分类样本的时空拓展应用在灾情的应急监测、跨区域的遥感监测和跨国界的军事监测等方面意义重大。代表性不好是样本难以在覆盖不同时空的影像分类中有效应用的主要障碍。为应对该问题,提出了一种遥感影像半监督学习分类样本拓展技术。已标记样本完全不来自待分类影像,甚至无法确定对影像覆盖的区域的代表性。分别针对近距离、应急监测中的样本拓展应用,和远距离、跨国界科学研究和军事意义上的分类样本拓展,开展了实例应用。并根据实验结果对比分析了提出方法和一般半监督学习的异同。

【Abstract】 The classification method based on statistical pattern recognition is one of the most important methods in remote sensing image information extraction field. Because of the high degree of complexity and randomness on the statistical distribution which remote sensing information holds, as well as the operator’s limited knowledge of the images and the blind choice of training samples, the number of samples which we obtained usually uninsufficient and had poor representativeness. We cannot image that a good classification result can be obtained with these samples. But people often ignored this problem both in classification methods research and application, which is not conducive to remote sensing image classification and information extraction technology research development. This thesis mainly aims at dealing with the poor sample representativeness problem and studying how to mining the potentialities of the classification methods and the classified images, so that the classification results can be improved.The thesis begins with an overview of the basic theories and recent research on semi-supervised learning field and remote sensed image classification field. The bulk of author’s contribution, which form the main part of this thesis, are the semi-supervised classification methods developing for remote sensed images, and its application to dealing with two important problems in remote sensing application field.(1) Semi-supervised classification methods development for remote senseing imagesIn analyzing the characteristics of remote sensing image data, based on the two kinds of assumptions which the semi-supervised learning and remote sensing image data distribution share, the methods based on generative model and transductive model of semi-supervised learning for remote sensing image classification are proposed and developed.In generative model-based method studying, we re-derivate the EM algorithm and amendment a recursive formula, and give the reason why we do so. According to the characteristics and needs of remote sensing image classification, a one class corresponds to one component and a one class corresponds to more components models are given out, and the corresponding algorithm flows are developed respectively. We also find out that the Hughes phenomenon exists in semi-supervised learning category too. Based on the designed classification experiments, we give our suggestion that the usage ratio of unlabeled and labeled samples must not be too large or small.In transductive-based methods studying, we propose a new unlabelled sample labeling method according to the remote sensing data and application characteristics, then apply it to the current mainly used medium-low resolution remote sensing images and high-resolution remote sensing image classification. At last we give out the pixel-based and object-based semi-supervised classification process which has great difference with the traditional pixel-based classification method.In the end of this part, we compare the generative model method and the transductive model method by apply them to the same image classification.(2) Automatic classification and training samples space-time expanding methods development for remote sesing image based on the proposed semi-supervised methodsAutomatic classification is the important direction of technology development for large-scale, high frequency, repetitive regional remote sensing monitoring in the future. We propose a semi-supervised remotely sensed image classification method in the way of establishing training samples data sets. Here labeled samples are no longer come from the image to be classified itself, but to some extent on behalf of the image area covered. For the high demand of automatic information extraction from remote sensed images in state sea used monitoring, we proposed a coastal zone usage information automatic extraction method based on the semi-supervised classification method we proposed in the first part.Remote sensing image classification sample space-time expansion plays an important role in the emergency disaster monitoring, cross-regional and cross-border monitoring and other aspects of military. Poor representative is the main problem for training samples applied in the images which cover different space and time. We proposed a semi-supervised based method to deal with this problem. Labeled samples are completely not from the image to be classified and even cannot cover the place the image covers. The classification experiments on regional expanding and border expanding are carried out, and the result shows that our method works well.In the end, the methods which we proposed to dealing with the poor representative of man-picked training samples problem are summarized and discussed, and further research directions and goals are given out.

节点文献中: 

本文链接的文献网络图示:

本文的引文网络