èŠ‚ç‚¹æ–‡çŒ®

å¤šåŒºåŸŸå›¾åƒçš„åˆ†å‰²å’Œå€¾æ–œæ£€æµ‹æ–¹æ³•ç ”ç©¶

Research on Segmentation and Skew Detection of Multi-region Document Images

åˆ†é¡µä¸‹è½½
åˆ†ç« ä¸‹è½½
æ•´æœ¬ä¸‹è½½
åœ¨çº¿é˜…è¯»
ä¸æ”¯æŒè¿…é›·ç‰ä¸‹è½½å·¥å…·ï¼Œè¯·å–æ¶ˆåŠ é€Ÿå·¥å…·åŽä¸‹è½½ã€‚

ã€ä½œè€…ã€‘ å²³å®ï¼›

ã€ä½œè€…åŸºæœ¬ä¿¡æ¯ã€‘ å±±ä¸œå¸ˆèŒƒå¤§å¦ ï¼Œ è®¡ç®—æœºè½¯ä»¶ä¸Žç†è®ºï¼Œ 2008ï¼Œ ç¡•å£«

ã€æ‘˜è¦ã€‘ åœ¨çŽ°ä»£ä¿¡æ¯ç¤¾ä¼šé‡Œ,è®¡ç®—æœºå·²ç»è¿›å…¥äº†ç¤¾ä¼šçš„å„ä¸ªé¢†åŸŸ,äº’è”ç½‘ä¹Ÿæ—¥ç›Šæ™®åŠ,äººä»¬è¶Šæ¥è¶Šå¤šåœ°ä¾èµ–è®¡ç®—æœºèŽ·å¾—å„ç§ä¿¡æ¯,å¤§é‡çš„å¤„ç†å·¥ä½œä¹Ÿéƒ½è½¬ç§»åˆ°è®¡ç®—æœºä¸Šè¿›è¡Œã€‚ç ”ç©¶å¦‚ä½•å°†ä¼ ç»Ÿçš„çº¸å¼ æ–‡æœ¬è½¬æ¢æˆç”µåæ–‡æœ¬å°±æˆä¸ºäº†äººä»¬å…³æ³¨çš„è¯¾é¢˜ã€‚åœ¨æ—¥å¸¸ç”Ÿæ´»å’Œå·¥ä½œä¸,å˜åœ¨ç€å¤§é‡çš„æ–‡ä»¶èµ„æ–™çš„å¤„ç†é—®é¢˜,è¿™äº›æ–‡ä»¶ä¸ä»…åŒ…æ‹¬é‚£äº›åªæœ‰æ–‡å—ä¿¡æ¯çš„æ–‡ä»¶è¿˜åŒ…æ‹¬é‚£äº›å›¾æ–‡æ··æŽ’çš„æ–‡ä»¶å’Œå›¾åƒæ–‡ä»¶,å› è€Œå¦‚ä½•å°†æ–‡ä»¶å¿«é€Ÿå‡†ç¡®çš„è¾“å…¥è®¡ç®—æœºçš„è¦æ±‚å˜å¾—éžå¸¸è¿«åˆ‡ã€‚æœ¬æ–‡ä¸»è¦ç ”ç©¶çš„æ˜¯å¤šåŒºåŸŸå›¾åƒçš„åˆ†å‰²å’Œå€¾æ–œæ£€æµ‹æ–¹æ³•ã€‚é’ˆå¯¹å¸¸ç”¨çš„æ–‡æœ¬å›¾åƒåˆ†å‰²ç®—æ³•è¿›è¡Œäº†ç»¼è¿°,å¹¶å¯¹å„ä¸ªåˆ†å‰²ç®—æ³•çš„ä¼˜ç¼ºç‚¹è¿›è¡Œäº†ä»‹ç»ã€‚ä¸€èˆ¬çš„æ–‡æœ¬å›¾åƒçš„å¤„ç†ç®—æ³•å¤§ä½“å¯ä»¥åˆ†æˆä¸¤ç±»:å‡ ä½•åˆ†æžæ³•å’Œçº¹ç†åˆ†æžæ³•ã€‚å…¶ä¸å‡ ä½•åˆ†æžæ³•åˆå¯ä»¥åˆ†ä¸ºè‡ªé¡¶å‘ä¸‹ã€è‡ªåº•å‘ä¸Šã€æ··åˆæ³•ã€‚æœ¬æ–‡è¯¦ç»†ä»‹ç»äº†ä¸¤ç§è‡ªé¡¶å‘ä¸‹çš„åˆ†å‰²ç®—æ³•,åˆ†åˆ«æ˜¯æ¸¸ç¨‹å¹³æ»‘ç®—æ³•å’ŒæŠ•å½±è½®å»“ç®—æ³•,ä»¥åŠä¸¤ç§è‡ªåº•å‘ä¸Šçš„å¤„ç†æ–¹æ³•:è¿‘é‚»çº¿å¯†åº¦æ³•å’Œè¿žé€šåˆ†é‡åˆ†æžæ³•ã€‚é™¤æ¤ä¹‹å¤–,è¿˜åˆ—ä¸¾äº†å‡ ç§å¸¸è§çš„å›¾åƒåˆ†å‰²ç®—æ³•ã€‚æœ¬æ–‡æ€»ç»“ä»¥ä¸Šçš„åŸºæœ¬åˆ†å‰²æ–¹æ³•,é’ˆå¯¹å¤šåŒºåŸŸå›¾åƒæå‡ºäº†æ”¹è¿›çš„æŠ•å½±è½®å»“ç®—æ³•ã€‚è¯¥ç®—æ³•è§£å†³äº†ä½¿ç”¨ä¸€èˆ¬çš„æŠ•å½±è½®å»“ç®—æ³•ä¸èƒ½é€‚ç”¨äºŽå¤æ‚çš„å…·æœ‰å€¾æ–œè§’åº¦çš„å¤šåŒºåŸŸå›¾åƒçš„åˆ†å‰²,æœ¬æ–‡é¦–å…ˆå¯¹å›¾åƒè¿›è¡ŒäºŒå€¼åŒ–,ä½¿ç”¨æ•°å¦å½¢æ€å¦çš„è…èš€â€”è†¨èƒ€æ“ä½œé™ä½Žå›¾åƒä¸Šçš„å™ªå£°ã€‚å¯¹äºŽå¾—åˆ°çš„å›¾åƒä½¿ç”¨æ”¹è¿›çš„æŠ•å½±è½®å»“ç®—æ³•,è¯¥ç®—æ³•å³ä½¿åœ¨Xã€Yè½´æ–¹å‘ä¸Šæ²¡æœ‰è°·ç‚¹,ä¹Ÿå¯ä»¥æ ¹æ®å›¾åƒåƒç´ çš„åˆ†å¸ƒçŠ¶å†µæ‰¾å‡ºåˆ‡åˆ†ç‚¹,å°†å›¾åƒåˆ‡æˆå°å—,å†å¯¹å°å—å›¾åƒè¿›è¡ŒæŠ•å½±åˆ†æž,å¾ªçŽ¯æ¤è¿‡ç¨‹,ç›´åˆ°å°†å›¾åƒçš„å„ä¸ªåŒºåŸŸåˆ†å‰²å‡ºæ¥ä¸ºæ¢ã€‚æ–‡æ¡£å€¾æ–œè§’çš„æ£€æµ‹å¤§ä½“å¯ä»¥å½’ä¸ºäº”å¤§ç±»,åŸºäºŽHoughå˜æ¢çš„æ–¹æ³•ã€åŸºäºŽäº¤å‰ç›¸å…³æ€§çš„æ–¹æ³•ã€åŸºäºŽæŠ•å½±çš„æ–¹æ³•ã€åŸºäºŽFourierå˜æ¢çš„æ–¹æ³•å’ŒK-æœ€è¿‘é‚»ç°‡æ³•,å…¶ä¸åŸºäºŽFourierå˜æ¢çš„æ–¹æ³•è®¡ç®—é‡éžå¸¸å¤§,æ•…è€Œå¾ˆå°‘ä½¿ç”¨ã€‚é€šå¸¸æ–‡æ¡£å›¾åƒåœ¨æ‰«æå…¥è®¡ç®—æœºæ—¶éš¾å…ä¼šæœ‰æŸå¤±,æ–‡æ¡£å›¾åƒçš„è¾¹ç¼˜ä¹Ÿå¾ˆä¸è§„åˆ™ã€‚å¦‚æžœç”¨æ™®é€šçš„è¾¹ç¼˜æå–æ–¹æ³•å¯»æ‰¾å›¾åƒè½®å»“,ä¸ä»…å¢žåŠ äº†è®¡ç®—é‡è€Œä¸”å¢žåŠ äº†è®¸å¤šä¸å¿…è¦çš„è®¡ç®—ã€‚æœ¬æ–‡é’ˆå¯¹ä¸€èˆ¬å€¾æ–œæ£€æµ‹ç®—æ³•è®¡ç®—é‡å¤§çš„é—®é¢˜,æå‡ºäº†ä¸€ç§ç®€å•çš„å¯»æ‰¾è¾¹ç¼˜çš„æ–¹æ³•,è¿™é‡Œå¹¶ä¸éœ€è¦ç²¾ç¡®åœ°æ‰¾å‡ºæ–‡æ¡£å›¾åƒçš„è¾¹ç¼˜è½®å»“,åªæ˜¯æ‰¾å‡ºå«æœ‰å›¾åƒçš„åŒºåŸŸå°±å¯ä»¥äº†,è¿™ä¸ªåŒºåŸŸå°±æ˜¯å¤–æŽ¥çŸ©å½¢,å³bounding boxã€‚æœ¬æ–‡å¼•å…¥GAæ–¹æ³•æ£€æµ‹å›¾åƒçš„å€¾æ–œè§’,è¯¥æ–¹æ³•ä½¿ç”¨bounding boxçš„é¢ç§¯ä½œä¸ºé€‚åº”åº¦å‡½æ•°å€¼,åªéœ€è¦æ‰¾å‡ºå›¾åƒçš„ä¸Šä¸‹å·¦å³å››ä¸ªåæ ‡å€¼ä¾¿å¯ä»¥äº†,è¿™æ ·å¤§å¤§å‡å°‘äº†è®¡ç®—é‡ã€‚å®žéªŒç»“æžœè¡¨æ˜Žè¯¥ç®—æ³•å¯¹å€¾æ–œè§’çš„æ£€æµ‹å…·æœ‰è¾ƒé«˜çš„ç²¾ç¡®åº¦ã€‚æ›´å¤š è¿˜åŽŸ

ã€Abstractã€‘ In the modern information society, computer technology has been involved in various fields of our lives. The Internet has also become popular increasingly, and we depend on computers to get information more than ever before, a lot of work is shifted on to computer. Studying how to covert the traditional paper into electronic text has become a topic of concern. In daily life, there are a large number of documents to be handled. All of these documents include not only text files but also images and mixed files, so how to put them into computer efficiently and accurately has become urgent requirements.The main purpose of this thesis is to study algorithms for page segmentation and skew detection of multi-region document images. The thesis summarizes the common algorithms of page segmentation, and gives their advantages and disadvantages of each algorithm. Generally, methods of page segmentation can be classified into two types, one is structural analysis, and the other is texture analysis. The structural analysis includes top-down, bottom-up and a mixing of the two. The thesis presents two top-down methods, run-length smoothing and projection profile cut, and two bottom-up methods, neighborhood line density and connected component analysis. In addition, it gives several algorithms which usually be used in image segmentation.According to these algorithms, this paper presents an improved method of the projection profile cut algorithm. This algorithm solves the problem that the projection profile cut algorithm couldnâ€™t deal with complicated documents containing skewed multi-regions. First, the image is binarized, then denoised by erosion and dilation operation of mathematical morphology. Applying the improved projection profile cut algorithm to document images, we can find the cut-off points of the image which donâ€™t have any peak-valley point on the X-axis and Y-axis. With these cut-off points we could cut the image into small pieces, and then we conduct the same operation until multi-regions are separated.Skew estimating methods can be classified into five general categories: Hough transform, cross-correlation, projection profile, Fourier transform and nearest-neighbor, of which Fourier transform is rarely used because of its high complexity.During document scanning, the image may lose something inevitably, and the edges are not smoothing. If we use the normal image edge detection to find the profile, it increases not only the amount of computation but also many unnecessary calculations. The thesis proposed a brief method to find the profile of the image, for which there is no need to find the edges accurately, just to find the area which contains the image. The area being found is called bounding box. The thesis used GA algorithm to detect skew angles of the images. This method uses the area of the bounding box as its fitness function, in which only the coordinate values of the 4 corners need to be found. This can reduce tremendous computing complexity. Experimental results show that the proposed algorithm can certainly guarantee the accuracy for document image deskewing.æ›´å¤š è¿˜åŽŸ

ã€å…³é”®è¯ã€‘ å¤šåŒºåŸŸå›¾åƒï¼› å›¾åƒåˆ†å‰²ï¼› æŠ•å½±è½®å»“åˆ‡åˆ†ç®—æ³•ï¼› å›¾åƒå€¾æ–œæ£€æµ‹ï¼› é—ä¼ ç®—æ³•ï¼› å¤–æŽ¥çŸ©å½¢ï¼›
ã€Key wordsã€‘ multi-region document imagesï¼› document segmentationï¼› projection profile cutï¼› skew detectionï¼› GAï¼› bounding boxï¼›

ã€ç½‘ç»œå‡ºç‰ˆæŠ•ç¨¿äººã€‘ å±±ä¸œå¸ˆèŒƒå¤§å¦

ã€åˆ†ç±»å·ã€‘TP391.41
ã€è¢«å¼•é¢‘æ¬¡ã€‘3
ã€ä¸‹è½½é¢‘æ¬¡ã€‘184

çŸ¥ç½‘èŠ‚ä¸‹è½½

èŠ‚ç‚¹æ–‡çŒ®ä¸ï¼š

æœ¬æ–‡é“¾æŽ¥çš„æ–‡çŒ®ç½‘ç»œå›¾ç¤º:

æœ¬æ–‡çš„å¼•æ–‡ç½‘ç»œ

èŠ‚ç‚¹æ–‡çŒ®

èŠ‚ç‚¹æ–‡çŒ®

å¤šåŒºåŸŸå›¾åƒçš„åˆ†å‰²å’Œå€¾æ–œæ£€æµ‹æ–¹æ³•ç ”ç©¶

Research on Segmentation and Skew Detection of Multi-region Document Images

æœ¬æ–‡é“¾æŽ¥çš„æ–‡çŒ®ç½‘ç»œå›¾ç¤º:

å¤šåŒºåŸŸå›¾åƒçš„åˆ†å‰²å’Œå€¾æ–œæ£€æµ‹æ–¹æ³•ç ”ç©¶