èŠ‚ç‚¹æ–‡çŒ®

å›¾åƒåˆ†ç±»å’Œå›¾åƒè¯ä¹‰æ ‡æ³¨çš„ç ”ç©¶

The Study on Image Classifition and Image Annotation

åˆ†é¡µä¸‹è½½
åˆ†ç« ä¸‹è½½
æ•´æœ¬ä¸‹è½½
åœ¨çº¿é˜…è¯»
ä¸æ”¯æŒè¿…é›·ç‰ä¸‹è½½å·¥å…·ï¼Œè¯·å–æ¶ˆåŠ é€Ÿå·¥å…·åŽä¸‹è½½ã€‚

ã€ä½œè€…ã€‘ å¼ ç£Šï¼›

ã€ä½œè€…åŸºæœ¬ä¿¡æ¯ã€‘ å±±ä¸œå¤§å¦ ï¼Œ è®¡ç®—æœºç³»ç»Ÿç»“æž„ï¼Œ 2008ï¼Œ ç¡•å£«

ã€æ‘˜è¦ã€‘ éšç€å¤šåª’ä½“æŠ€æœ¯çš„å‘å±•å’ŒInternetçš„æ™®åŠ,äººä»¬èŽ·å¾—å„ç§å¤šåª’ä½“ä¿¡æ¯è¶Šæ¥è¶Šå®¹æ˜“,å…¶ä¸å›¾åƒæ˜¯æ•°é‡æœ€å¤šçš„ä¸€ç§,å¦‚ä½•æœ‰æ•ˆåœ°ã€å¿«é€Ÿåœ°ä»Žå¤§è§„æ¨¡å›¾åƒæ•°æ®åº“ä¸æ£€ç´¢å‡ºæ‰€éœ€è¦çš„å›¾åƒå·²æˆä¸ºäººä»¬æ—¥ç›Šå…³æ³¨çš„é—®é¢˜ã€‚åŸºäºŽå†…å®¹çš„çš„å›¾åƒæ£€ç´¢æŠ€æœ¯(Content-Based Image Retrieval,CBIR)åˆ©ç”¨å›¾åƒçš„åº•å±‚è§†è§‰ç‰¹å¾(é¢œè‰²,çº¹ç†,å½¢çŠ¶ç‰)ä»£è¡¨å›¾åƒçš„å†…å®¹,ç”±äºŽå›¾åƒçš„åº•å±‚è§†è§‰ç‰¹å¾ä¸Žå›¾åƒçš„è¯ä¹‰è¡¨è¾¾ä¹‹é—´å˜åœ¨â€œè¯ä¹‰é¸¿æ²Ÿâ€,ä¼ ç»Ÿçš„CBIRæŠ€æœ¯ä¸èƒ½æ»¡è¶³äººä»¬æŒ‰è¯ä¹‰æ£€ç´¢å›¾åƒçš„éœ€æ±‚ã€‚å¦‚æžœäº‹å…ˆå¯¹å›¾åƒé›†åˆæŒ‰è¯ä¹‰è¿›è¡Œåˆç†åœ°åˆ†ç±»æˆ–è€…æ ‡æ³¨,ä¼šæžå¤§æé«˜CBIRç³»ç»Ÿçš„æ€§èƒ½ã€‚æœ¬æ–‡ä¸»è¦ç ”ç©¶åŸºäºŽå›¾åƒåº•å±‚è§†è§‰ç‰¹å¾çš„å›¾åƒè¯ä¹‰åˆ†ç±»å’Œè¯ä¹‰è‡ªåŠ¨æ ‡æ³¨ã€‚æœ¬æ–‡çš„ä¸»è¦è´¡çŒ®åœ¨ä»¥ä¸‹å‡ ç‚¹:1.æå‡ºäº†ä¸€ç§åŸºäºŽGaborå˜æ¢å’Œæ”¯æŒå‘é‡æœº(Support Vector Machine,SVM)çš„çº¹ç†åˆ†ç±»ç®—æ³•,è¯¥ç®—æ³•å…·æœ‰æ—‹è½¬ä¸å˜æ€§ã€‚åœ¨å®žéªŒè¿‡ç¨‹ä¸,ä¸ºç¡®ä¿åˆ†ç±»å™¨å¯¹æ—‹è½¬åŽçš„å›¾åƒç‰¹å¾â€œä¸€æ— æ‰€çŸ¥â€,è®ç»ƒé›†å’Œæµ‹è¯•é›†åˆ†åˆ«é€‰è‡ªä¸åŒæ—‹è½¬è§’åº¦å›¾åƒçš„ä¸ŠåŠéƒ¨åˆ†å’Œä¸‹åŠéƒ¨åˆ†,ä¿è¯äº†æœ¬å®žéªŒæ˜¯ä¸€ä¸ªçœŸæ£æ„ä¹‰ä¸Šçš„æ—‹è½¬ä¸å˜å®žéªŒã€‚åœ¨Brodatzå’ŒUIUCTexä¸¤ä¸ªæ•°æ®é›†ä¸çš„å®žéªŒè¡¨æ˜Ž,è¯¥çº¹ç†åˆ†ç±»æ–¹æ³•æ˜¯æœ‰æ•ˆå¯è¡Œçš„,åœ¨æŸäº›ç±»åˆ«ä¸Šçš„åˆ†ç±»å‡†ç¡®çŽ‡å¯ä»¥è¾¾åˆ°100%,åˆ†ç±»å‡†ç¡®çŽ‡å’Œæ—¶é—´å¤æ‚åº¦å‡ä¼˜äºŽkNN(kNearest Neighbors)ç®—æ³•ã€‚2.æå‡ºä¸€ç§åŸºäºŽSVMå¹¶ç»¼åˆMPEG-7è§†è§‰æè¿°åçš„å›¾åƒåˆ†ç±»ç®—æ³•ã€‚ç”±äºŽå›¾åƒé›†ä¸æœ‰å¤šä¸ªè¯ä¹‰ç±»åˆ«,ä½¿ç”¨å¤šç±»åˆ†ç±»ç–ç•¥æž„å»ºä¸€ä¸ªå¤šç±»SVMåˆ†ç±»å™¨ã€‚å›¾åƒç‰¹å¾ä½¿ç”¨MPEG-7 Experimentation Modelè½¯ä»¶ä»Žå›¾åƒä¸æå–ã€‚åœ¨å®žéªŒä¸ç”¨åˆ°äº†å¤šç§é¢œè‰²å’Œçº¹ç†æè¿°å,å¯¹æ¯”äº†å„ç§æè¿°åç»“åˆSVMåˆ†ç±»å™¨åœ¨Corel 1Kå›¾åƒé›†ä¸çš„åˆ†ç±»å‡†ç¡®çŽ‡å’Œæ—¶é—´å¤æ‚åº¦ã€‚å®žéªŒåŒæ—¶è¡¨æ˜Ž,åˆç†åœ°ç»¼åˆä½¿ç”¨å¤šç§è§†è§‰æè¿°åå¯ä»¥å–å¾—æ›´é«˜çš„åˆ†ç±»å‡†ç¡®çŽ‡ã€‚3.æå‡ºäº†ä¸€ç§åŸºäºŽSVMåˆ†ç±»å™¨çš„å›¾åƒè¯ä¹‰è‡ªåŠ¨æ ‡æ³¨ç®—æ³•ã€‚å›¾åƒç‰¹å¾æ˜¯åŸºäºŽMPEG-7é¢œè‰²å’Œçº¹ç†æè¿°åçš„å…¨å±€ç‰¹å¾ã€‚æ¯ä¸ªæ ‡æ³¨è¯å¯¹åº”ä¸€ä¸ªäºŒåˆ†SVMåˆ†ç±»å™¨,é’ˆå¯¹å¤šä¸ªè¯ä¹‰è¯,åˆ©ç”¨å¤šç±»åˆ†ç±»ç–ç•¥æž„å»ºä¸€ä¸ªå¤šç±»åˆ†ç±»å™¨,è¿™å°±å»ºç«‹äº†å›¾åƒåº•å±‚ç‰¹å¾ä¸Žè¯ä¹‰è¯ä¹‹é—´çš„å…³è”ã€‚SVMåˆ†ç±»å™¨çš„è¾“å‡ºé‡‡ç”¨åŽéªŒæ¦‚çŽ‡å½¢å¼,ä»¥æ–¹ä¾¿åœ°æ¯”è¾ƒå›¾åƒå±žäºŽå„ä¸ªè¯ä¹‰è¯ç±»åˆ«çš„å¯èƒ½æ€§ã€‚å®žéªŒåœ¨Corel 5000æ•°æ®é›†ä¸è¿›è¡Œ,é¦–å…ˆä½¿ç”¨Poner stemmingç®—æ³•å¯¹æ‰€æœ‰è¯ä¹‰è¯è¿›è¡Œstemmingæ“ä½œ,å¹¶èˆå¼ƒå›¾åƒæ•°è¿‡å°‘çš„è¯ä¹‰è¯,å…±æœ‰82ä¸ªè¯å¯ç”¨äºŽæž„å»ºåˆ†ç±»å™¨ã€‚å®žéªŒè¿‡ç¨‹ä¸é‡‡å–äº†ä¸¤ç§ç–ç•¥é€‰å–æ ‡æ³¨è¯,å¹¶å¯¹æ¯”äº†ä¸¤ç§ç–ç•¥çš„å®žéªŒç»“æžœã€‚è¯„ä»·æ ‡æ³¨ç»“æžœæ—¶,ä½¿ç”¨äº†åˆ†åˆ«é’ˆå¯¹å›¾åƒå’Œæ ‡æ³¨è¯çš„å‡†ç¡®çŽ‡å’Œå¬å›žçŽ‡,ç»“æžœè¯„ä»·æ›´åŠ å®¢è§‚ã€å…¨é¢ã€‚æ›´å¤š è¿˜åŽŸ

ã€Abstractã€‘ With the development of multimedia technology and the popularization of Internet, people can acquire multimedia information in large amount. How to retrieve the images from image database precisely and efficiently has been an important issue in the field of image retrieval.Content-Based Image Retrieval(CBIR) extracts visual features as retrieval features, such as color, texture and shape, etc. For the existence of semantic gap between low-level image features and human understanding to images, CBIR canâ€™t get satisfied retrieval results. Classifying images into reasonable categories using low-level features or annotating images will greatly improve the performance of CBIR systems. This thesis does a study of image classification and image annotation. The main contributions of this thesis are as follows:1. Propose a method of rotation invariant texture classification using Gabor transform and Support Vector Machine(SVM). To make sure the classifier knows nothing about the characters of rotated images, we create the training set from the subimages from the top half of none rotation image. The subimages from the foot half of rotated images are grouped to the testing set. This method is tested on Brodatz and UIUCTex datasets and the experimental results demonstrate that it is effective and efficient. The precision can be as high as 100% in some classes.2. Propose a method of image classification based on MPEG-7 color and texture descriptors, using SVM as classifier. For there are several classes in image dataset, the approach constructs the multi-class SVM with the help of multi-class classification strategy. Image features are extracted using MPEG-7 Experimentation Model software. The experiment with Corel 1K utilizes several color and texture descriptors. Classification precision and time complexity are given.. The results show that if we properly fuse the MPEG-7 descriptors the higher precision can be achieved.3. Propose a method of image annotation using MPEG-7 descriptors and SVM The image features are global features based on MPEG-7 color and texture descriptors. The method builds a binary SVM according to each word. For there are a lot of words usually, the method constructs the multi-class SVM with help of multi-class classification strategy. Therefore, this multi-class SVM establishes a mapping from images to words. The output of SVM classifier is modified to posterior probability form so we can get the probability estimates. In the experiment with Corel 5000 dataset, the method use Porter stemming algorithm as the first step. By eliminating the words with so few images, 82 words are used to build SVM classifier. The mean per-word precision and recall as well as mean per-image precision and recall are adopted for evaluating annotation effectiveness.æ›´å¤š è¿˜åŽŸ

ã€å…³é”®è¯ã€‘ åŸºäºŽå†…å®¹çš„å›¾åƒæ£€ç´¢ï¼› å›¾åƒåˆ†ç±»ï¼› å›¾åƒè¯ä¹‰ï¼› å›¾åƒæ ‡æ³¨ï¼› çº¹ç†åˆ†ç±»ï¼› æ”¯æŒå‘é‡æœºï¼› MPEG-7ï¼›
ã€Key wordsã€‘ CBIRï¼› Image Classificationï¼› Image Semanticï¼› Image Annotationï¼› Texture Classificationï¼› SVMï¼› MPEG-7ï¼›

ã€ç½‘ç»œå‡ºç‰ˆæŠ•ç¨¿äººã€‘ å±±ä¸œå¤§å¦

ã€åˆ†ç±»å·ã€‘TP391.41
ã€è¢«å¼•é¢‘æ¬¡ã€‘12
ã€ä¸‹è½½é¢‘æ¬¡ã€‘727
æ”»è¯»æœŸæˆæžœ

çŸ¥ç½‘èŠ‚ä¸‹è½½

èŠ‚ç‚¹æ–‡çŒ®ä¸ï¼š

æœ¬æ–‡é“¾æŽ¥çš„æ–‡çŒ®ç½‘ç»œå›¾ç¤º:

æœ¬æ–‡çš„å¼•æ–‡ç½‘ç»œ

èŠ‚ç‚¹æ–‡çŒ®

èŠ‚ç‚¹æ–‡çŒ®

å›¾åƒåˆ†ç±»å’Œå›¾åƒè¯­ä¹‰æ ‡æ³¨çš„ç ”ç©¶

The Study on Image Classifition and Image Annotation

æœ¬æ–‡é“¾æŽ¥çš„æ–‡çŒ®ç½‘ç»œå›¾ç¤º:

å›¾åƒåˆ†ç±»å’Œå›¾åƒè¯ä¹‰æ ‡æ³¨çš„ç ”ç©¶