节点文献

基于文本图像纹理特征的文种识别技术研究

Research on Script Identification Based on Texture Feature of Document Images

【作者】 顾立娟

【导师】 平西建;

【作者基本信息】 解放军信息工程大学 , 信号与信息处理, 2009, 硕士

【摘要】 随着网络通信技术和信息处理技术的迅速发展,文本图像成为人们获取信息的重要来源。随着国家之间交流的日益频繁,多种语言文字需要识别和处理,文本图像的文种自动识别技术对于有效提取文本图像中的信息具有重要意义。本文主要研究基于文本图像纹理特征的文种识别技术,所做的主要工作如下:1.分析了文本图像的特征尤其是纹理特征,介绍了文本图像文种识别技术的发展历史和研究现状,明确了文种识别技术现有的成果和面临的问题。2.提出了一种基于多小波变换的文种识别算法。将文本图像多小波分解的子图能量作为特征,用SVM实现文种分类。实验结果表明,该算法的识别性能明显优于基于小波变换的方法,尤其提高了对文字字体格式变化的鲁棒性。3.针对目前文种识别中纹理特征描述子对文字行倾斜缺乏鲁棒性,通过研究组成文字的纹理基元可控金字塔子带能量的分布特点,对文本图像的可控金字塔能量统计特征空间重新进行排序,提出一种对文字行倾斜具有鲁棒性的文种识别算法。对十种文字的文本图像进行不同倾斜角度的文种识别实验,结果表明该算法具有较高的识别率且对文字行倾斜具有较强的鲁棒性。4.针对文字笔划具有较强的方向性和文字边缘包含重要的纹理信息,提出了基于多尺度几何分析的文种识别算法。采用Contourlet及复数Contourlet变换对文本图像进行分解,提取子带能量特征;同时对图像Contourlet变换子带系数的边缘分布进行广义高斯建模,提取模型参数特征。采用SVM作为分类器。对十五种文字的文本图像进行实验,结果表明所提出的算法提高了对视觉特征相近的文种的识别能力。5.提出一种分级识别文种的方法。对十四种文字分两级识别,第一级采用文本行灰度投影法对文种粗分类,第二级采用基于纹理特征的算法对文种进行细分类。该方法识别效率高,错误积累小,可以根据文字特征选择识别算法,根据应用需求确定识别层次,具有较高的实用价值。

【Abstract】 With the rapid development of network communication technology and information processing technology, document images have become important source for attaining information. For the intercommunications among countries are more frequent, many languages or scripts need to be identified and processed. Script identification is significant for attaining information from document images effectively. This dissertation mainly works on script identification based on texture feature of document images. The main work is as following:1. The features especially texture features of document images are deeply studied. The development history and researching state of script identification are introduced. The fruits that have got and difficulties that are faced are pointed out.2. A script identification algorithm based on multi-wavelet transform is proposed. The energies of sub images after multi-wavelet decomposition are used as features and SVM is used as classifier. Experimental results confirm the proposed algorithm is more excellent than the one based on wavelet. It’s especially robust to the changes of font and format of characters.3. Most algorithms on texture feature extraction for script identification are unadaptable to the skew of text line presently. To obtain features robust to rotation, texture units consisting of characters are decomposed by Steerable Pyramid and the energy features of sub bands are studied deeply. An algorithm robust to the skew of text line is proposed through realigning the energy statistical features. The experiments are performed on the image database containing ten scripts with different skew angles. The results confirm that the algorithm can identify scripts accurately and is robust to the skew of text line at the same time.4. Aiming at the orientation of characters and the abundant texture features of character edges, algorithms based on multi-scale geometric analysis are proposed. Document images are decomposed by Contourlet and complex Contourlet transform. Energy features of sub bands are extracted. At the same time, sub bands of Contourlet transform are modeled by Generalized Gaussian Model and model parameters are used as features. SVM is used as classifier. The experiments done on image database containing fifteen scripts confirm the proposed algorithms improve the identification performance on the scripts whose vision features are similar.5. A script identification method identifying scripts by steps is proposed. Fourteen scripts are identified by two steps. The text line projection algorithm is used in the first step for coarse identification and the algorithm based on texture feature is used in the second step for fine identification. This method is efficient with small error accumulation. It is very practical for it can select algorithm according as characteristic of script and select step according as application requirement.

节点文献中: 

本文链接的文献网络图示:

本文的引文网络