èŠ‚ç‚¹æ–‡çŒ®

å›¾åƒå±€éƒ¨ä¸å˜é‡ç‰¹å¾æè¿°æ–¹æ³•ç ”ç©¶

Study on Method of Image Local Feature Description

åˆ†é¡µä¸‹è½½
åˆ†ç« ä¸‹è½½
æ•´æœ¬ä¸‹è½½
åœ¨çº¿é˜…è¯»
ä¸æ”¯æŒè¿…é›·ç‰ä¸‹è½½å·¥å…·ï¼Œè¯·å–æ¶ˆåŠ é€Ÿå·¥å…·åŽä¸‹è½½ã€‚

ã€ä½œè€…ã€‘ æ¢èƒ¤ç¨‹ï¼›

ã€ä½œè€…åŸºæœ¬ä¿¡æ¯ã€‘ é²ä¸œå¤§å¦ ï¼Œ è®¡ç®—æœºåº”ç”¨æŠ€æœ¯ï¼Œ 2012ï¼Œ ç¡•å£«

ã€æ‘˜è¦ã€‘ ç”¨æœºå™¨æ¥æ„ŸçŸ¥å’Œè¯†åˆ«è‡ªç„¶ç•Œçš„ç‰©ä½“å’Œåœºæ™¯ï¼Œå³ä½¿æ˜¯å¾ˆç®€å•çš„ç‰©ä½“ï¼Œå¯¹äºŽè®¡ç®—æœºæ¥è¯´ä¹Ÿæ˜¯å¾ˆå›°éš¾çš„äº‹æƒ…ã€‚éš¾ç‚¹æ˜¯å¦‚ä½•æ¥è¡¨è¾¾è‡ªç„¶ç•Œçš„ç‰©ä½“ï¼Œæ—¢è¦åŒºåˆ†å…¶ä»–ç‰©ä½“ï¼Œè¿˜è¦å…‹æœç”±äºŽå°ºåº¦å˜åŒ–ï¼Œç¼©æ”¾ï¼Œå¹³ç§»å¸¦æ¥çš„å·®å¼‚æ€§ã€‚é€‰æ‹©ä»€ä¹ˆæ ·çš„ç‰¹å¾æ¥æè¿°å¾…è¯†åˆ«çš„ç‰©ä½“æ˜¯è®¡ç®—æœºè§†è§‰çš„å…³é”®ã€‚è¿‘å‡ å¹´ï¼Œå›¾åƒå±€éƒ¨ç‰¹å¾çš„å‡ºçŽ°ä½¿è®¡ç®—æœºè§†è§‰çš„ç ”ç©¶å–å¾—äº†é‡å¤§è¿›å±•ã€‚å±€éƒ¨ç‰¹å¾æ ¹æ®å›¾åƒå±€éƒ¨ä¿¡æ¯é‡‡ç”¨å¤šå°ºåº¦åˆ†æžï¼Œç»Ÿè®¡å¦ç‰ç›¸å…³æŠ€æœ¯å½¢æˆç‰¹å¾å‘é‡ï¼Œå¯¹å›¾åƒå½¢æˆäº†æ›´å¥½çš„è¡¨è¾¾ï¼Œå¹¿æ³›åº”ç”¨äºŽç‰©ä½“è¯†åˆ«ã€é…å‡†ã€å…¨æ™¯å›¾åƒæ‹¼æŽ¥å’Œæœºå™¨äººè§†è§‰ç‰é¢†åŸŸã€‚æœ¬æ–‡å¯¹å½“å‰çš„å„ç§å›¾åƒå±€éƒ¨ç‰¹å¾è¿›è¡Œäº†åˆ†æžï¼Œé€šè¿‡å¯¹ä¸»æµçš„å±€éƒ¨ç‰¹å¾Harrisè§’ç‚¹æ£€æµ‹ã€å°ºåº¦ä¸å˜ç‰¹å¾è½¬æ¢(SIFT)ã€åŠ é€Ÿé²æ£’ç‰¹å¾(SURF)ã€æœ€å¤§ç¨³å®šæžè‡´åŒºåŸŸ(MSER)è¿›è¡Œå¯¹æ¯”åˆ†æžï¼Œé€‰æ‹©å½“å‰æœ€æµè¡Œçš„å°ºåº¦ä¸å˜ç‰¹å¾è½¬æ¢ç®—æ³•ä¸ºç€æ‰‹ç‚¹ï¼Œé’ˆå¯¹å½“å‰ç®—æ³•å˜åœ¨çš„ä¸è¶³æå‡ºäº†æ”¹è¿›ï¼Œå¹¶å°†æ”¹è¿›çš„ç®—æ³•åº”ç”¨äºŽè¯åŒ…æ¨¡åž‹çš„åœºæ™¯å›¾åƒåˆ†ç±»ã€‚å…·ä½“å†…å®¹å¦‚ä¸‹ï¼š1. Loweæå‡ºçš„å°ºåº¦ä¸å˜è½¬æ¢ç®—æ³•æ•ˆçŽ‡æ¯”è¾ƒä½Žï¼Œæ— æ³•æ»¡è¶³å®žæ—¶æ€§çš„éœ€è¦ã€‚è®ºæ–‡æå‡ºäº†ä¸€ç§åŸºäºŽåœ†æŠ•å½±çš„å°ºåº¦ä¸å˜è½¬æ¢ç®—æ³•ï¼Œé€šè¿‡å¯¹æŠ•å½±åŽçš„å±€éƒ¨åŒºåŸŸçš„å¿«é€Ÿå‚…é‡Œå¶å˜æ¢åŽè®¡ç®—ä¸€æ¬¡è°æ³¢åˆ†é‡ï¼Œå¯¹å°ºåº¦ä¸å˜è½¬æ¢ç®—æ³•æå–çš„ç‰¹å¾ç‚¹è¿›è¡Œé¢„ç›é€‰ã€‚é€šè¿‡å¯¹ç›é€‰åŽçš„ç‰¹å¾ç‚¹è®¡ç®—å±€éƒ¨åŒºåŸŸæè¿°åè¿›è¡Œå›¾åƒçš„åŒ¹é…ã€‚å®žéªŒç»“æžœè¡¨æ˜Žï¼šç»è¿‡é¢„ç›é€‰ï¼Œè¯¥ç®—æ³•å¯ä»¥æœ‰æ•ˆçš„å‡å°‘å¾…åŒ¹é…ç‰¹å¾ç‚¹çš„ä¸ªæ•°ï¼Œæé«˜ç®—æ³•çš„æ‰§è¡Œæ•ˆçŽ‡å’Œé…å‡†çŽ‡ã€‚2.è¯åŒ…æ¨¡åž‹é€šè¿‡å¯¹SIFTç®—æ³•æ£€æµ‹çš„ç‰¹å¾ç‚¹åœ¨ç‰¹å¾ç©ºé—´èšç±»æ¥æž„é€ è§†è§‰å•è¯ã€‚æœ¬æ–‡æå‡ºä¸€ç§åŸºäºŽFan-SIFTçš„è¯åŒ…æ¨¡åž‹ï¼Œåˆ©ç”¨Fan-SIFTå¯¹ä¸åŒè§’åº¦çš„LOGç®—åå“åº”å€¼ï¼Œæ£€æµ‹å‡ºå›¾åƒä¸çš„æ‰‡å½¢æ–‘ç‚¹å’Œåœ†å½¢æ–‘ç‚¹ï¼Œå¹¶åˆ©ç”¨æ‰‡å½¢åŒºåŸŸæž„é€ çš„ç‰¹å¾æè¿°ç¬¦æ¥æž„é€ è§†è§‰å•è¯ã€‚ç›¸æ¯”äºŽSIFTç®—æ³•åªæ£€æµ‹å›¾åƒä¸çš„åœ†å½¢æ–‘ç‚¹æž„é€ çš„å•è¯ï¼Œæœ¬æ–‡ç®—æ³•æž„é€ çš„è§†è§‰å•è¯æ›´åŠ å…·æœ‰é’ˆå¯¹æ€§ã€‚åœ¨13ç±»åœºæ™¯å›¾åƒå’ŒCaltech101æ•°æ®é›†ä¸Šè¿›è¡Œå®žéªŒè¡¨æ˜Žï¼ŒåŸºäºŽFan-SIFTç®—æ³•ç”Ÿæˆçš„è¯åŒ…æ¨¡åž‹å¯¹åœºæ™¯å›¾åƒçš„åˆ†ç±»å…·æœ‰æ›´é«˜çš„å‡†ç¡®çŽ‡ã€‚å¦å¤–ï¼Œæœ¬æ–‡å¯¹ä¸»æµçš„å›¾åƒæ–‘ç‚¹å±€éƒ¨ç‰¹å¾è¿›è¡Œäº†å¯¹æ¯”å®žéªŒï¼Œé‡ç‚¹å…³æ³¨äº†å„ç§ç‰¹å¾åœ¨å°ºåº¦ç¼©æ”¾ã€è§†è§’å˜åŒ–ã€å…‰ç…§å˜åŒ–ã€å›¾åƒæ¨¡ç³Šæƒ…å†µä¸‹çš„åŒ¹é…ç»“æžœã€‚å¯¹ä¸»æµæ–‘ç‚¹ç‰¹å¾çš„æè¿°æ€§èƒ½æœ‰äº†ç›´è§‚çš„è¡¨ç¤ºã€‚æ›´å¤š è¿˜åŽŸ

ã€Abstractã€‘ Itâ€™s very complicated for computer when it comes to the ability of perception andrecognition, even itâ€™s a very simple object to be recognized. The most difficult point forcomputer recognition is how to express the object. It needs the ability to distinguish oneobject from another no matter its different size, different view and different position. Featureselection is the key process in computer vision which can greatly affects the results. Duringthe past decade, the progresses of local feature prompt computer vision research. With thehelp of multi-scale analysis technology and statistics technology, people draw various kind oflocal image feature from each block of image which has a better express of image. Itâ€™s widelyused in the area of object recognition, registration, image stitch and robot vision etc.We have a deep research on various local image features. A comparison study hasdeveloped on Harris, SIFT, SURF and MSER. SIFT algorithm is selected as the start point forits good effect. We put forward some improvement according the shortcomings. Whatâ€™s more,the improved feature is used on scene image classification and the experiments demonstrateits good effect on image classification. The details and the innovation are as follows:1. The scale-invariant feature transform algorithm proposed by Lowe has a low efficiencyand restricts its application. The algorithm based on rounded projection proposed in our paperapplies Fast Fourier Transform algorithm (FFT) on the projection function to compute thefirst harmonic components which are used to pre-screen the feature points that extracted bySIFT algorithm. After the pre-screening, we get the descriptors according to the local areafeatures of left points. The experiments shows that it has a smaller number of feature pointsthan the original SIFT algorithm, so it improves the efficiency and has a better performance.2. The model of word bags use the SIFT descriptors to formulate the image vision word bycluster method. SIFT algorithm is a detector of blob region of image by LOG kernel function.Instead, we substitute the SIFT detector for Fan-SIFT algorithm. Fan-SIFT not only detectsthe blob region in image, but also the fan region. Accordingly, we use a feature descriptor offan shapes. Fan-SIFT can find different kinds of blob region in image and form the descriptorswith a smaller dimension. Experiments are processed on the data set of13scene images anddata set of Caltech101. The results show a better effect on image classification.We also process the comparison experiments on the blob image features which focus on the match results of different scale, different size and different position, analyze the repeatabilityof different feature detectors. We give an intuitive description on the quality of blob imagefeatures.æ›´å¤š è¿˜åŽŸ

ã€å…³é”®è¯ã€‘ å›¾åƒå±€éƒ¨ä¸å˜é‡ç‰¹å¾ï¼› å°ºåº¦ä¸å˜ç‰¹å¾ï¼› åœ†æŠ•å½±ï¼› è¯åŒ…æ¨¡åž‹ï¼› åœºæ™¯å›¾åƒåˆ†ç±»ï¼›
ã€Key wordsã€‘ image local featureï¼› SIFTï¼› rounded projectionï¼› word bagsï¼› scene imageclassificationï¼›

ã€ç½‘ç»œå‡ºç‰ˆæŠ•ç¨¿äººã€‘ é²ä¸œå¤§å¦

ã€åˆ†ç±»å·ã€‘TP391.41
ã€è¢«å¼•é¢‘æ¬¡ã€‘4
ã€ä¸‹è½½é¢‘æ¬¡ã€‘174
æ”»è¯»æœŸæˆæžœ

çŸ¥ç½‘èŠ‚ä¸‹è½½

èŠ‚ç‚¹æ–‡çŒ®ä¸ï¼š

æœ¬æ–‡é“¾æŽ¥çš„æ–‡çŒ®ç½‘ç»œå›¾ç¤º:

æœ¬æ–‡çš„å¼•æ–‡ç½‘ç»œ

èŠ‚ç‚¹æ–‡çŒ®

èŠ‚ç‚¹æ–‡çŒ®

å›¾åƒå±€éƒ¨ä¸å˜é‡ç‰¹å¾æè¿°æ–¹æ³•ç ”ç©¶

Study on Method of Image Local Feature Description

æœ¬æ–‡é“¾æŽ¥çš„æ–‡çŒ®ç½‘ç»œå›¾ç¤º:

å›¾åƒå±€éƒ¨ä¸å˜é‡ç‰¹å¾æè¿°æ–¹æ³•ç ”ç©¶