èŠ‚ç‚¹æ–‡çŒ®

å›¾åƒæ•°æ®çš„è§†è§‰æ˜¾è‘—æ€§æ£€æµ‹æŠ€æœ¯åŠå…¶åº”ç”¨

Technologies and Applications of Visual Saliency Detection for Image Datum

åˆ†é¡µä¸‹è½½
åˆ†ç« ä¸‹è½½
æ•´æœ¬ä¸‹è½½
åœ¨çº¿é˜…è¯»
ä¸æ”¯æŒè¿…é›·ç‰ä¸‹è½½å·¥å…·ï¼Œè¯·å–æ¶ˆåŠ é€Ÿå·¥å…·åŽä¸‹è½½ã€‚

ã€ä½œè€…ã€‘ æ¨ä¿Šï¼›

ã€ä½œè€…åŸºæœ¬ä¿¡æ¯ã€‘ å›½é˜²ç§‘å¦æŠ€æœ¯å¤§å¦ ï¼Œ ä¿¡æ¯ä¸Žé€šä¿¡å·¥ç¨‹ï¼Œ 2007ï¼Œ åšå£«

ã€æ‘˜è¦ã€‘ å›¾åƒæ˜¯ä¿¡æ¯ç¤¾ä¼šçš„ä¸»è¦æ•°æ®èµ„æº,æµ·é‡çš„å›¾åƒæ•°æ®ç»™é«˜æ•ˆæ™ºèƒ½ä¿¡æ¯å¤„ç†å¸¦æ¥äº†æŒ‘æˆ˜ã€‚æˆ‘ä»¬æ³¨æ„åˆ°,äººä»¬å…³å¿ƒçš„å†…å®¹é€šå¸¸åªæ˜¯æ•´å¹…å›¾åƒæˆ–æ•´æ®µè§†é¢‘ä¸å¾ˆå°çš„ä¸€éƒ¨åˆ†,å› æ¤,æœ‰å¿…è¦ç›´æŽ¥æ£€æµ‹å‡ºå®ƒä»¬,ä»¥èŽ·å¾—é«˜æ•ˆçš„å¤„ç†ç»“æžœã€‚è¿™ç§å¤„ç†æ€æƒ³æºè‡ªäºŽäººç±»è§†è§‰çš„é€‰æ‹©æ€§æ³¨æ„æœºåˆ¶å’Œæ„ŸçŸ¥ç»„ç»‡åŽŸåˆ™ã€‚ç”±æ¤,æˆ‘ä»¬éœ€è¦é¢å¯¹å¦‚ä¸‹é—®é¢˜:å¦‚ä½•åˆ©ç”¨è§†è§‰æ˜¾è‘—æ€§çš„æ„ŸçŸ¥åŽŸç†?å¦‚ä½•æè¿°å’ŒåŒºåˆ†å›¾åƒä¿¡æ¯ä¸å¯èƒ½å˜åœ¨çš„å¤šç§æ˜¾è‘—æ€§äº‹ä»¶?å¦‚ä½•å°†è¿™äº›å¿ƒç†å¦åŽŸç†æœ‰æ•ˆåœ°å¼•å…¥å›¾åƒåˆ†æžè¿›ç¨‹?å¦‚ä½•ä»Žé™æ€å›¾åƒæˆ–è§†é¢‘åºåˆ—ä¸å¿«é€Ÿæ£€æµ‹ç”¨æˆ·å…³å¿ƒçš„æ˜¾è‘—åŒºåŸŸæˆ–äº‹ä»¶?æœ¬è®ºæ–‡å›´ç»•å…¶å±•å¼€äº†ç ”ç©¶ã€‚è®ºæ–‡ç¬¬ä¸€éƒ¨åˆ†é›†ä¸è®¨è®ºäº†è§†è§‰æ˜¾è‘—æ€§æ£€æµ‹çš„åŸºæœ¬å¤„ç†æ€æƒ³ã€‚é¦–å…ˆ,å›žé¡¾äº†è®¤çŸ¥å¿ƒç†å¦çš„ç›¸å…³ç†è®º,è®¨è®ºäº†è§†è§‰æ˜¾è‘—æ€§å’Œå›¾åƒå†…å®¹ä¹‹é—´çš„å¯¹åº”å…³ç³»,æå‡ºäº†ä¸€ç§åŸºäºŽå†…å®¹ç›¸å…³åº¦çš„è§†è§‰æ˜¾è‘—æ€§è¡¨è¿°ç–ç•¥,å°†å›¾åƒæ˜¾è‘—äº‹ä»¶åˆ†ä¸ºå¼±ç›¸å…³äº‹ä»¶å’Œå¼ºç›¸å…³äº‹ä»¶ä¸¤ç±»;ç»§è€Œ,åˆ†æžäº†æ³¨æ„ä¸Žç»„ç»‡çš„å±‚æ¬¡åä½œå…³ç³»,æå‡ºäº†ä¸€ç§å›¾åƒæ˜¾è‘—å†…å®¹çš„å±‚æ¬¡æè¿°ä¸Žç†è§£æ¡†æž¶;æŽ¥ç€,æå‡ºäº†ä¸€ç§åŸºäºŽæ³›åŒ–æ³¨æ„çš„å›¾åƒè§†è§‰æ˜¾è‘—æ€§æ£€æµ‹æ¨¡åž‹,ç”¨ä»¥å°†é€‰æ‹©æ€§æ³¨æ„æœºåˆ¶èžå…¥åˆ°æ•´ä¸ªå›¾åƒå¤„ç†è¿‡ç¨‹ä¸ã€‚è®ºæ–‡ç¬¬äºŒéƒ¨åˆ†é›†ä¸ç ”ç©¶äº†é¢å‘å›¾åƒæ•°æ®çš„è§†è§‰æ˜¾è‘—æ€§æ£€æµ‹æ–¹æ³•ã€‚é¦–å…ˆ,æå‡ºäº†ä¸€ç§åŸºäºŽæ³¨æ„çš„æ˜¾è‘—åŒºåŸŸåˆ†å‰²åŠå…¶ç‰¹å¾å¦ä¹ æ”¹è¿›ç®—æ³•,ç”¨ä»¥è§£å†³åŒºåŸŸå›¾åƒæ£€ç´¢ä¸çš„æ˜¾è‘—åŸºå…ƒæå–ä¸Žæè¿°é—®é¢˜ã€‚å…¶åŽ,ç ”ç©¶äº†é¥æ„Ÿå›¾åƒç›®æ ‡è¯†åˆ«çš„åº”ç”¨é—®é¢˜,(1)æå‡ºäº†ä¸€ç§äººé€ ç›®æ ‡æ£€æµ‹æ¨¡åž‹å’Œä¸€ç§åŒºåŸŸåˆ†å‰²ç®—æ³•,ç”¨ä»¥è§£å†³äººé€ ç›®æ ‡å€™é€‰åŒºçš„èšç„¦é—®é¢˜ã€‚è¯¥æ¨¡åž‹æ˜¯å±‚æ¬¡åŒ–ç»“æž„æ„ŸçŸ¥çš„,åŒºåŸŸåˆ†å‰²æ˜¯æ°´å¹³é›†æ¼”åŒ–;(2)æž„å»ºäº†ä¸€ç§åŸºäºŽç»“æž„ç¼–ç»„çš„äººé€ ç›®æ ‡åˆ†æžæ¡†æž¶ã€çº¿ç»“æž„åŸºå…ƒçš„æå–å’Œç¼–ç»„æ–¹æ³•,ç”¨ä»¥è§£å†³äººé€ ç»“æž„çš„æ„ŸçŸ¥ç»„ç»‡é—®é¢˜;(3)æå‡ºäº†ä¸€ç§åŸºäºŽæ˜¾è‘—åŸºå…ƒåˆ†ç±»æ„ŸçŸ¥ä¸Žç¼–ç»„çš„é¥æ„Ÿé“è·¯æ£€æµ‹å’Œæå–ç®—æ³•ã€‚éšåŽ,æå‡ºäº†ä¸€ç§åŸºäºŽç©ºæ—¶æ³¨æ„çš„è§†é¢‘æ˜¾è‘—äº‹ä»¶æ£€æµ‹æ¨¡åž‹,å¹¶ç”¨äºŽè§†é¢‘ç«ç„°äº‹ä»¶æ£€æµ‹å’Œç«ç„°æ˜¾è‘—åŒºåŸŸçš„æå–ã€‚è®ºæ–‡æœ€åŽæå‡ºäº†ä¸€ç§å›¾åƒæ•°æ®çš„è§†è§‰æ˜¾è‘—æ€§æ£€æµ‹æŠ€æœ¯å®žéªŒç³»ç»Ÿçš„è®¾è®¡æ–¹æ³•,è®¨è®ºäº†å…¶å¯èƒ½çš„æ½œåœ¨åº”ç”¨å’Œæ‰©å±•é—®é¢˜ã€‚è®ºæ–‡ä¸æå‡ºçš„å„ç§æ¨¡åž‹å’Œæ–¹æ³•åº”ç”¨äºŽå¤šç§ç±»åž‹çš„çœŸå®žå›¾åƒå’Œè§†é¢‘,èŽ·å¾—äº†é¢„æœŸçš„è¯•éªŒç»“æžœ,ä½“çŽ°å‡ºä¸€å®šçš„å¯è¡Œæ€§å’Œé€‚åº”æ€§ã€‚æ›´å¤š è¿˜åŽŸ

ã€Abstractã€‘ Images are the primary data resource in information society. Voluminous image datum results in the critical challenge for the high efficient information processing intelligently. We notice that the content that a person is interested in is often occupied a small part of an image or a period of video. It is necessary directly to detect the interested areas for high efficient processing results. The processing idea stems from the selective attention mechanism and the perceptual organization principle in the human vision system. Thus, the following items should be dealt with: How to utilize the perception principles of visual saliency? How to describe and distinguish the various saliency events contained in images? How to introduce above psychological theories into the procedure of image analysis effectively? How to extract the salient regions or events rapidly from an image or a video period, which are interested by almost users? This dissertation focuses on these aspects.The first part of this thesis emphasizes on the framework design for visual saliency detection. Firstly, after discussing the relation between visual saliency and image contents based on the theories of cognitive psychology, a new strategy for representing visual saliency is proposed based on content-correlation, by which image salient events can be divided into two classes, low correlative and high. Secondly, a hierarchical framework for describing and understanding image saliency is presented by analyzing the cooperation between attention and organization. Thirdly, an image saliency detection model is developed based on the general attention in order to put selective attention mechanism into the whole procedure of image processing.The second part of this thesis studies on the methods of visual saliency detection for image datum. Firstly, an improved attention driven algorithm for salient region segmentation and feature learning is proposed to obtain salient elements and description for region-based image retrieval. Secondly, the applications on target recognition in remote sensing images are researched. (1) A hierarchical model on man-made object detection is built up and a level set evolution algorithm for man-made region segmentation is developed to focus on salient man-made candidate areas. (2) a man-made object analysis framework based on structure grouping and a method for extracting and grouping line-like structural elements are adopted in order to implement perceptual grouping of man-made configuration. (3) A road detection and extraction method based on classified salient element perceptual grouping is developed. Then a video event detection model based on spatial-temporal attention is presented, and is applied to detect fire events from video images by extracting fire-like salient regions.The final part of this thesis offers a general design method of an experimental system on visual saliency detection in image data, and discusses some potential applications and other relative extend items.The models and algorithms developed in the thesis are applied to various real images and video and the expected results are obtained. It has some feasibility and adaptability.æ›´å¤š è¿˜åŽŸ

ã€å…³é”®è¯ã€‘ è§†è§‰æ˜¾è‘—æ€§ï¼› é€‰æ‹©æ€§æ³¨æ„æœºåˆ¶ï¼› æ³¨æ„ç„¦ç‚¹ï¼› æ„Ÿå…´è¶£åŒºåŸŸï¼› æ„ŸçŸ¥ç»„ç»‡ï¼› å±‚æ¬¡æŽ§åˆ¶ï¼› ç©ºæ—¶æ³¨æ„ï¼› ç‰¹å¾é›†æˆç†è®ºï¼› å›¾åƒå†…å®¹æ£€ç´¢ï¼› äººé€ ç›®æ ‡æ£€æµ‹ï¼› è§†é¢‘äº‹ä»¶æ£€æµ‹ï¼› è§†é¢‘ç«ç„°æ£€æµ‹ï¼›
ã€Key wordsã€‘ Visual Saliencyï¼› Selective Attention Mechanismï¼› Focus of attentionï¼› Region of Interestï¼› Perceptual Organizationï¼› Hierarchical Controlï¼› Spatio-temporal Attentionï¼› Feature Integration Theoryï¼› Content-based Image Retrievalï¼› Man-made Object Detectionï¼› Video Event Detectionï¼› Video Fire Detectionï¼›

ã€ç½‘ç»œå‡ºç‰ˆæŠ•ç¨¿äººã€‘ å›½é˜²ç§‘å¦æŠ€æœ¯å¤§å¦

ã€åˆ†ç±»å·ã€‘TP391.41
ã€è¢«å¼•é¢‘æ¬¡ã€‘26
ã€ä¸‹è½½é¢‘æ¬¡ã€‘2547
æ”»è¯»æœŸæˆæžœ

çŸ¥ç½‘èŠ‚ä¸‹è½½

èŠ‚ç‚¹æ–‡çŒ®ä¸ï¼š

æœ¬æ–‡é“¾æŽ¥çš„æ–‡çŒ®ç½‘ç»œå›¾ç¤º:

æœ¬æ–‡çš„å¼•æ–‡ç½‘ç»œ

èŠ‚ç‚¹æ–‡çŒ®

èŠ‚ç‚¹æ–‡çŒ®

å›¾åƒæ•°æ®çš„è§†è§‰æ˜¾è‘—æ€§æ£€æµ‹æŠ€æœ¯åŠå…¶åº”ç”¨

Technologies and Applications of Visual Saliency Detection for Image Datum

æœ¬æ–‡é“¾æŽ¥çš„æ–‡çŒ®ç½‘ç»œå›¾ç¤º:

å›¾åƒæ•°æ®çš„è§†è§‰æ˜¾è‘—æ€§æ£€æµ‹æŠ€æœ¯åŠå…¶åº”ç”¨