节点文献

基于半监督学习的遥感图像分类研究

【作者】 徐庆伶

【导师】 汪西莉;

【作者基本信息】 陕西师范大学 , 计算机软件与理论, 2010, 硕士

【摘要】 随着卫星遥感技术的迅速发展,我们能够利用遥感技术动态、快速、准确地获得大量的遥感数据,尤其是高分辨率的遥感卫星图像。在遥感技术的应用中,通过遥感图像处理和判读来识别各种地表物体是一个非常主要的工作。无论是地物信息提取、土地动态变化检测、还是专题地图制作和遥感图像库的建立等都离不开分类。特别是随着遥感图像分辨率的不断提高,人们迫切要求从遥感图像中获得更多有用的数据和信息,遥感图像的分类在社会生活和经济建设中发挥着越来越重要的作用。分类的目的是从图像中识别实际地物,进而提取地物信息。近年来随着计算机技术的快速发展,计算机识别分类已经成了遥感技术应用研究中的一个重要组成部分。遥感图像的分类主要有两个方法,一是目视判读,即根据工作人员的经验和知识与多种非遥感信息资料结合,按照一定的规律和方法对图像上的地表物体进行识别;另一种是遥感数字图像的计算机分类,遥感图像的计算机分类,是对遥感图像上的地物进行属性的识别和分类,是模式识别技术在遥感技术领域中的具体运用。半监督学习是模式识别和机器学习中的重要研究领域。半监督学习方法结合了监督与非监督学习方法的优点,它同时利用有标记样本和大量的无标记样本,相比较传统的监督与非监督学习方法,半监督学习方法能够得到比较好的结果。本文将半监督学习应用到遥感图像的分类中,通过查阅相关文献,分析和总结前人的工作经验,采用了两种半监督学习方法对遥感图像进行分类,通过实验验证了应用半监督方法进行遥感图像的分类具有非常重要的理论和现实意义。这不仅能减少由于对样本进行人工标记带来的时间和劳力上的浪费,又能有效提高分类的精度。论文主要进行了以下几个方面的工作:(1)讨论了遥感图像分类技术的研究现状及其发展趋势,并简要介绍了几种传统的遥感图像监督与非监督分类方法;(2)介绍了半监督学习方法并对几种半监督算法进行了描述:包括基于生成式模型的半监督学习算法,自训练算法,基于判别式模型的半监督学习算法,基于图正则化框架的半监督学习算法,协同训练(Co-training)算法以及半监督聚类算法;(3)详细讨论了协同训练(Co-training)算法,该方法是一种非常重要的半监督算法,但由于其要求样本的属性集要能够分割为两个相互冗余的子集,在很多的实际应用中,样本的属性集并不能满足此种分割,因此该方法不能够很好地被直接应用。本文以该算法为基础,将其进行改进,得到一个简化的基于多分类器的半监督分类算法,通过实验比较,该方法优于传统的监督式分类方法,并由于该算法对样本的属性集没有强制性要求,使得其比较适合实际的应用,最后将其应用于对遥感图像进行分类;(4)介绍了直推式学习,讨论了直推式支持向量机(TSVM)的原理,TSVM方法是一种能同时利用有标记样本和无标记样本从而提高学习效果的方法,在遥感图像的分类中,我们能够利用到的已标记的样本个数较少,但却有大量的无标记样本,因此采用TSVM对遥感图像进行地物分类,通过实验可以得出TSVM方法能够有效地利用到大量的无标记样本的信息,并且随着无标记样本个数的增加,分类的精度能够得到一定的提高。

【Abstract】 With the rapid development of remote sensing technology, we can obtain a large number of remote sensing images which play an important role in our practical applications, especially high-resolution remote sensing images. In the application of remote sensing technology, recognizing various ground objects through remote sensing disposal is one very important work. No matter acquiring information of ground objects、detecting dynamic change of grounds、or special map making and establishing image database can’t depart from classification. Especially with the improving the resolution of remote sensing images, people press for more useful data and information from remote sensing images, the classification of remote sensing images is playing a more and more important action in social life and economic construction.The aim of classification is recognizing practical ground objects from images, sequentially, extracting information of ground objects. With the fast development of computer technology nowadays, computer recognition classification has been a most important part of remote sensing technology. The classification of remote sensing images has two methods, one is visual interpretation, by combining workers’ experience, knowledge and manifold non-remote sensing information together, according to stated rules and methods, recognize ground objects of images; the other is computer classification of remote sensing images. The computer classification of remote sensing images is a classification of recognizing ground objects’ attributes, is pattern recognition applied in remote sensing technology.Semi-supervised learning is an important research field of pattern recognition and machine learning. The semi-supervised learning combines advantage of supervised and unsupervised learning, it use labeled samples and abundant unlabeled samples, with the help of labeled samples, semi-supervised learning can achieve a better result.This paper apply semi-supervised learning to the classification of remote sensing images, Through consulting some relevant documents, analyzing and summarizing forerunners’ working experience, the paper adopts two kinds of semi-supervised learning methods to class remote sensing images, validates semi-supervised classification of remote sensing images is meaningful in theory and reality. It can reduce the waste of time and work force brought by labeling samples, and it can improve classification accuracy.The main works are summarized as follows:(1) Discussing the background of research and the development trend of remote sensing images classification, and introducing some traditional supervise and unsupervised classification of remote sensing images briefly.(2) Introducing semi-supervised learning and describing a few classical semi-supervised algorithm: Including generative models, self-training, discriminant model, graph based methods, co-training and semi-supervised clustering.(3) Giving a detailed introduction of Co-training, the method is a very important semi-supervised algorithm, but it requires the attributes of the samples must be able to split into two mutually redundant subsets, and in many practical applications, the attributes of samples can not meet such a split, so this method can not be applied well directly. This paper bases on the algorithm, to make improvements, gets a simplified Semi-supervised classification algorithm based on multi-classifier, through experiments, we can see the method is superior to the traditional supervised classification methods and more suitable for practical application, and applying this method to remote sensing image classification.(4) Introducing transductive learning, and discussing the principle of Transductive Support Vector Machine (TSVM), TSVM can use labeled and unlabeled samples at the same time to improve the learning result.in remote sensing image classification, we can use only a few labeled samples, but a large number of unlabeled samples, so we use TSVM into the classification of remote sensing image, through experiment, we can see TSVM can utilize the information of the large number of unlabeled samples, and as the number of unlabeled samples increasing, the classification accuracy can be improved.

节点文献中: 

本文链接的文献网络图示:

本文的引文网络