èŠ‚ç‚¹æ–‡çŒ®

è§†é¢‘ä¸çš„æ–‡æœ¬æå–åŠå…¶åº”ç”¨

Text Extraction on Video and Its Application

åˆ†é¡µä¸‹è½½
åˆ†ç« ä¸‹è½½
æ•´æœ¬ä¸‹è½½
åœ¨çº¿é˜…è¯»
ä¸æ”¯æŒè¿…é›·ç‰ä¸‹è½½å·¥å…·ï¼Œè¯·å–æ¶ˆåŠ é€Ÿå·¥å…·åŽä¸‹è½½ã€‚

ã€ä½œè€…ã€‘ é™†å…µï¼›

ã€ä½œè€…åŸºæœ¬ä¿¡æ¯ã€‘ æ²³æµ·å¤§å¦ ï¼Œ è®¡ç®—æœºåº”ç”¨æŠ€æœ¯ï¼Œ 2007ï¼Œ ç¡•å£«

ã€æ‘˜è¦ã€‘ æ–‡æœ¬æ˜¯è§†é¢‘ä¸é‡è¦çš„å†…å®¹ä¿¡æ¯ã€‚è§†é¢‘ä¸æ–‡æœ¬çš„æ£€æµ‹å’Œè¯†åˆ«åœ¨è§†é¢‘åˆ†æžè¿‡ç¨‹ä¸èµ·åˆ°å¾ˆå¤§çš„ä½œç”¨ã€‚æ–‡æœ¬å¯ä»¥ä½œä¸ºè§†é¢‘ç‰‡æ–çš„å†…å®¹æ ‡è¯†å’Œç´¢å¼•ï¼Œä¾‹å¦‚åœ¨æ–°é—»è§†é¢‘ä¸å‡ºçŽ°çš„æ–°é—»æ‘˜è¦ï¼Œå¯ä»¥ä½œä¸ºè¯¥æ®µæ–°é—»å†…å®¹çš„æè¿°ï¼Œç”¨äºŽæ–°é—»è§†é¢‘èµ„æ–™çš„æ£€ç´¢ã€‚æ‰€ä»¥å¯¹è§†é¢‘æ–‡å—çš„æ£€æµ‹å’Œåˆ†æžæ˜¯è§†é¢‘åˆ†æžçš„é‡è¦å†…å®¹ã€‚è€Œæ£€æµ‹è§†é¢‘ä¸æ–‡å—çš„å‡ºçŽ°åŠå…¶å‡†ç¡®ä½ç½®ï¼Œå¹¶å°†æ–‡å—ä»Žå¤æ‚å¤šå˜çš„èƒŒæ™¯ä¸åˆ†å‰²å‡ºæ¥ï¼Œæ˜¯è§†é¢‘æ–‡å—åˆ†æžå¤„ç†çš„åŸºç¡€ã€‚æ–‡æœ¬ä¿¡æ¯æå–ç³»ç»Ÿä¸»è¦åŒ…æ‹¬æ–‡æœ¬æ£€æµ‹ï¼Œæ–‡æœ¬å®šä½ï¼Œæ–‡æœ¬è·Ÿè¸ªï¼Œæ–‡æœ¬æå–ï¼Œæ–‡æœ¬å¢žå¼ºå’ŒOCRè¯†åˆ«å…ä¸ªéƒ¨åˆ†ã€‚æœ¬æ–‡é‡ç‚¹ç ”ç©¶äº†æ–‡æœ¬å®šä½çš„æ–¹æ³•ï¼Œæå‡ºäº†ä¸€ç§åŸºäºŽæŠ•å½±åˆ†æžä¸Žæ”¯æŒå‘é‡æœºå¦ä¹ ç›¸ç»“åˆçš„æ–‡æœ¬å®šä½æ–¹æ³•ï¼Œè¯•éªŒè¡¨æ˜Žè¯¥æ–¹æ³•æ¯”å•çº¯çš„åŸºäºŽè¾¹ç¼˜çš„æ–¹æ³•æˆ–æ˜¯å¦ä¹ çš„æ–¹æ³•éƒ½è¦å¥½ã€‚é¦–å…ˆé‡‡ç”¨æŠ•å½±åˆ†æžçš„æ–¹æ³•å°†å¯èƒ½çš„æ–‡æœ¬åŒºåŸŸæå–å‡ºæ¥ï¼Œç„¶åŽå†é‡‡ç”¨åŸºäºŽæ”¯æŒå‘é‡æœºå¦ä¹ çš„æ–¹æ³•å°†æå–å‡ºæ¥çš„æ–‡æœ¬åŒºåŸŸä¸çš„è™šå‡æ–‡æœ¬åŒºåŸŸæŽ’é™¤æŽ‰ã€‚è¯¥æ–¹æ³•è™½ç„¶æ¯”åŸºäºŽè¾¹ç¼˜çš„æ–¹æ³•å¤šäº†ä¸€æ¥ï¼Œä½†æ–‡æœ¬åŒºåŸŸçš„æ£€å‡†çŽ‡æœ‰äº†è¾ƒå¤§çš„æé«˜ã€‚ä¸Žä¸€èˆ¬çš„åŸºäºŽå¦ä¹ çš„æ–¹æ³•ç›¸æ¯”ï¼Œè¯¥æ–¹æ³•ä¸å¿…å¯¹æ•´ä¸ªå›¾åƒåŒºåŸŸè¿›è¡Œç‰¹å¾è®¡ç®—ï¼Œå‡å°äº†è®¡ç®—çš„æ—¶é—´å¤æ‚åº¦ã€‚åœ¨ä½¿ç”¨æ”¯æŒå‘é‡æœºè¿›è¡Œæ–‡æœ¬åˆ†ç±»æ—¶æœ¬æ–‡é‡‡ç”¨äº†å°æ³¢ï¼Œè§’ç‚¹ï¼Œæ‰«æçº¿å’ŒåŒºåŸŸå†…è¾¹ç¼˜ç‚¹çš„é‡å¿ƒä½ç½®ç‰ç‰¹å¾ã€‚è®ºæ–‡æœ€åŽç”¨è¯¥æ–¹æ³•ç”¨äºŽå¹¿å‘Šè§†é¢‘æ–‡æœ¬çš„æ£€æµ‹ï¼Œé‡‡ç”¨åŸºäºŽå¤šåˆ†è¾¨çŽ‡åˆ†æžçš„æ–¹æ³•å®šä½å¹¿å‘Šæ–‡æœ¬ã€‚é€šè¿‡æ¯”è¾ƒå‘çŽ°ï¼Œæ–°é—»ä¸çš„æ–‡æœ¬å‡ºçŽ°ä½ç½®æ¯”è¾ƒå›ºå®šè€Œä¸”å„ä¸ªç”µè§†å°çš„æ–‡æœ¬éƒ½æœ‰å„è‡ªå›ºå®šçš„æ ¼å¼ï¼Œä½†å¹¿å‘Šä¸çš„æ–‡æœ¬æ— è®ºæ˜¯å¤§å°ï¼Œå—ä½“éƒ½æ˜¯ä¸ä¸€æ ·çš„ï¼Œåˆ©ç”¨è¿™ä¸€å·®åˆ«å¯ä»¥å¯¹å¹¿å‘Šç‰‡æ–çš„èµ·å§‹ä½ç½®æœ‰ä¸€ä¸ªæ›´åŠ ç²¾ç¡®çš„å®šä½ã€‚å®žéªŒç»“æžœè¡¨æ˜Žè¯¥æ–¹æ³•å¯ä»¥å¾ˆå¥½çš„å®šä½å‡ºå¹¿å‘Šæ–‡æœ¬ã€‚æ›´å¤š è¿˜åŽŸ

ã€Abstractã€‘ Text is part of the important information in videos. Text detection and recognition in videos can help a lot in video content analysis and understanding, since text can provide concise and direct description of the stories presented in the videos. In digital news videos, the superimposed captions usually present the involved personâ€™s name and the summary of the news event. Hence, the recognized text can become a part of index in a video retrieval system. The importance of video can be estimated by the recognized text. So text detection and analysis is important in video analysis. Detecting the accurate position of text in the video and segmenting text from the complex background are the foundation of video text analysis.The text information extraction system can be divided into the following six parts: text detection, text localization, text tracking, text extraction, text enhancement and text recognition. This thesis focuses on the research in text localization. The projection analysis of edge based method and the learning of support vector machine based method are combined to localize text on videos. It has shown good results in the experiments compared to the simple edge based method and the learning based method. The text localization can be divided into two steps. In the first step, the potentially text area are extracted by the edge method. In the second step, support vector machine is used to classify the actual text areas and the false text areas. The false text areas are removed in this step. This method improves the precision rate of text areas. Compared to the learning based method, this method doesnâ€™t need to compute the texture of the whole image. Instead, it only computes the texture of the text areas. This algorithm can reduce the time complexity. The textures used in the support vector machine are wavelet, corner, line and the center of gravity of the text areas.This method is applied in localizing text in advertisements. A multi-resolution based method is used to localize text in advertisements. It is a part of the advertisements detection system. It is obvious that texts in the news are more formal and its positions of texts are in a certain areas. But texts in the advertisements are different from each other in size and style. The method can give out a more accurate position of advertisements. And it has shown good results in the experiments.æ›´å¤š è¿˜åŽŸ

ã€å…³é”®è¯ã€‘ è§†é¢‘æ£€ç´¢ï¼› è§†é¢‘æ–‡æœ¬å®šä½ï¼› æŠ•å½±ï¼› æ”¯æŒå‘é‡æœºï¼› å¹¿å‘Šè§†é¢‘æŽ¢æµ‹ï¼›
ã€Key wordsã€‘ Video Retrievalï¼› Video Text Localizationï¼› Projectionï¼› Support Vector Machineï¼› Advertisement Detectionï¼›

ã€ç½‘ç»œå‡ºç‰ˆæŠ•ç¨¿äººã€‘ æ²³æµ·å¤§å¦

ã€åˆ†ç±»å·ã€‘TP391.41
ã€è¢«å¼•é¢‘æ¬¡ã€‘4
ã€ä¸‹è½½é¢‘æ¬¡ã€‘334

çŸ¥ç½‘èŠ‚ä¸‹è½½

èŠ‚ç‚¹æ–‡çŒ®ä¸ï¼š

æœ¬æ–‡é“¾æŽ¥çš„æ–‡çŒ®ç½‘ç»œå›¾ç¤º:

æœ¬æ–‡çš„å¼•æ–‡ç½‘ç»œ

èŠ‚ç‚¹æ–‡çŒ®

èŠ‚ç‚¹æ–‡çŒ®

è§†é¢‘ä¸­çš„æ–‡æœ¬æå–åŠå…¶åº”ç”¨

Text Extraction on Video and Its Application

æœ¬æ–‡é“¾æŽ¥çš„æ–‡çŒ®ç½‘ç»œå›¾ç¤º:

è§†é¢‘ä¸çš„æ–‡æœ¬æå–åŠå…¶åº”ç”¨