èŠ‚ç‚¹æ–‡çŒ®

äººè„¸ç²¾ç¡®æ£€æµ‹ä¸Žå¤šåˆ†è¾¨çŽ‡ä¸‹è¯†åˆ«æ–¹æ³•ç ”ç©¶

Study on Accurate Face Detection and Multi-Resolution Face Recognition

åˆ†é¡µä¸‹è½½
åˆ†ç« ä¸‹è½½
æ•´æœ¬ä¸‹è½½
åœ¨çº¿é˜…è¯»
ä¸æ”¯æŒè¿…é›·ç‰ä¸‹è½½å·¥å…·ï¼Œè¯·å–æ¶ˆåŠ é€Ÿå·¥å…·åŽä¸‹è½½ã€‚

ã€ä½œè€…ã€‘ å¼ ç«‹åˆšï¼›

ã€ä½œè€…åŸºæœ¬ä¿¡æ¯ã€‘ è¥¿åŒ—å†œæž—ç§‘æŠ€å¤§å¦ ï¼Œ è®¡ç®—æœºåº”ç”¨æŠ€æœ¯ï¼Œ 2008ï¼Œ ç¡•å£«

ã€æ‘˜è¦ã€‘ äººè„¸æ£€æµ‹ä¸Žè¯†åˆ«æŠ€æœ¯æ˜¯ç”Ÿç‰©ç‰¹å¾é‰´åˆ«æŠ€æœ¯ä¸ç ”ç©¶æœ€å¤šå’Œæœ€çƒé—¨çš„æŠ€æœ¯ä¹‹ä¸€,å®ƒå·²ç»åœ¨èº«ä»½è®¤è¯ã€å®‰å…¨æ£€æŸ¥ã€ç½ªçŠ¯æŸ¥è¯¢ã€äººæœºäº¤äº’ç‰å¹¿æ³›é¢†åŸŸå¾—åˆ°äº†åˆæ¥åº”ç”¨ã€‚åœ¨äººè„¸æ£€æµ‹ç ”ç©¶ä¸,æž„å»ºå¿«é€Ÿè€Œç²¾ç¡®çš„æ£€æµ‹æ–¹æ³•ä¸€ç›´æ˜¯è¯¥é¢†åŸŸçš„ç ”ç©¶çƒç‚¹;åœ¨äººè„¸è¯†åˆ«ç ”ç©¶ä¸,å¦‚ä½•å…‹æœèŽ·å–å›¾åƒå…‰çº¿ã€è¡¨æƒ…ã€è§†è§’ç‰å˜åŒ–çš„å½±å“,æé«˜è¯†åˆ«çŽ‡åˆ™æ˜¯è¿«åˆ‡éœ€è¦ç ”ç©¶çš„é—®é¢˜ã€‚é’ˆå¯¹è¿™ä¸¤ä¸ªé—®é¢˜,æœ¬æ–‡ä»¥å½©è‰²å’Œç°è‰²æ£é¢äººè„¸é™æ€å›¾åƒä¸ºç ”ç©¶å¯¹è±¡,å°†æ¨¡å¼è¯†åˆ«ç†è®ºå’Œå›¾åƒå¤„ç†æŠ€æœ¯ç›¸ç»“åˆ,é‡ç‚¹ç ”ç©¶åŸºäºŽLVQäººå·¥ç¥žç»ç½‘ç»œ(ANN)çš„è‚¤è‰²åƒç´ æ£€æµ‹å’ŒåŸºäºŽæ¨¡æ¿åŒ¹é…çš„äººè„¸ç²¾ç¡®æ£€æµ‹æ–¹æ³•,ä»¥åŠåŸºäºŽå°æ³¢åŒ…åˆ†è§£(WPD)å’Œ(2D)2PCAçš„ä¸åŒå˜åŒ–æ¡ä»¶äººè„¸å›¾åƒçš„è¯†åˆ«æ–¹æ³•,ä¸ºå»ºç«‹å¿«é€Ÿç²¾ç¡®çš„äººè„¸è¯†åˆ«ç³»ç»Ÿæä¾›æŠ€æœ¯ä¾æ®ã€‚æœ¬æ–‡çš„ä¸»è¦ç ”ç©¶å·¥ä½œå¦‚ä¸‹:(1)é’ˆå¯¹çŽ°æœ‰äººè„¸æ£€æµ‹ç³»ç»Ÿæ£€æµ‹ç²¾åº¦å’Œé€Ÿåº¦ä¸å¹³è¡¡çš„é—®é¢˜,æå‡ºäº†ä¸€ç§åŸºäºŽLVQ ANNçš„è‚¤è‰²æ£€æµ‹ä¸ŽåŸºäºŽæ¨¡æ¿åŒ¹é…çš„ç²¾ç¡®äººè„¸æ£€æµ‹ç›¸ç»“åˆçš„æ–¹æ³•ã€‚è¯¥æ–¹æ³•åœ¨èŽ·å–è‚¤è‰²åƒç´ åŸºç¡€ä¸Š,é‡‡ç”¨åŸºäºŽå…¨å±€æœç´¢çš„Mosaicæ–¹æ³•é¢„å®šä½äººè„¸åŒºåŸŸã€‚ä»¥CVLäººè„¸åº“å›¾åƒå®žéªŒç»“æžœè¡¨æ˜Ž,LVQ ANNå®žçŽ°äº†è¾ƒæ»¡æ„çš„è‚¤è‰²åƒç´ æ£€æµ‹æ•ˆæžœ,åˆèƒ½æé«˜æ£€æµ‹é€Ÿåº¦;Mosaicæ–¹æ³•æˆåŠŸåœ°å®žçŽ°äº†äººè„¸åŒºåŸŸçš„é¢„å®šä½ã€‚(2)ä¸ºåœ¨é¢„å®šä½äººè„¸åŒºåŸŸä¸å®žçŽ°ç²¾ç¡®çš„äººè„¸æ£€æµ‹,é‡‡ç”¨ä¸€ç§åŸºäºŽæ¨¡æ¿åŒ¹é…çš„äººè„¸æ£€æµ‹æ–¹æ³•ã€‚è¯¥æ–¹æ³•é¦–å…ˆæž„å»ºåŸºäºŽRåˆ†é‡çš„æ ‡å‡†ç°åº¦äººè„¸æ¨¡æ¿,ç„¶åŽä»¥ç›¸å…³æ€§ç³»æ•°ä¸ºåŒ¹é…å‡†åˆ™,ä½¿ç”¨å¤šå°ºå¯¸äººè„¸æ¨¡æ¿å®žçŽ°ä¸åŒå°ºå¯¸äººè„¸çš„æ£€æµ‹ã€‚å®žéªŒç»“æžœè¡¨æ˜Ž,CVLäººè„¸åº“ä¸å¸¸æ€ç»„ã€å¾®ç¬‘ç»„å’Œå¤§ç¬‘ç»„çš„æ£ç¡®æ£€æµ‹çŽ‡åˆ†åˆ«ä¸º100%ã€100%å’Œ93.6%;ä¸Žä»…é‡‡ç”¨æ¨¡æ¿åŒ¹é…æ³•ç›¸æ¯”,æ£€æµ‹é€Ÿåº¦ä»Ž1870.6s/å¹…æé«˜åˆ°208.4s/å¹…ã€‚(3)ä¸ºè§£å†³ä»Žå›¾åƒå°æ³¢åŒ…åˆ†è§£å¾—åˆ°èŠ‚ç‚¹å›¾åƒä¸é€‰å–æ˜¾è‘—èŠ‚ç‚¹å›°éš¾çš„é—®é¢˜,æå‡ºäº†é‡‡ç”¨(2D)2 PCAå’Œæœ€é‚»è¿‘åˆ†ç±»å™¨æµ‹è¯•æ‰€æœ‰èŠ‚ç‚¹å›¾åƒçš„æ£ç¡®è¯†åˆ«çŽ‡(CRR),å¹¶ä¾æ®è¯†åˆ«çŽ‡é€‰å–å‡ºâ€œæˆåŠŸâ€èŠ‚ç‚¹å›¾åƒçš„æ–¹æ³•ã€‚(4)ä¸ºäº†æœ‰æ•ˆç»„åˆâ€œæˆåŠŸâ€èŠ‚ç‚¹çš„ç‰¹å¾çŸ©é˜µ,æå‡ºäº†ä¸€ç§æµ‹é‡æµ‹è¯•å›¾åƒå’Œåº“å›¾åƒè·ç¦»çš„æ–¹æ³•ã€‚è¯¥æ–¹æ³•ä»¥â€œæˆåŠŸâ€èŠ‚ç‚¹å›¾åƒç‰¹å¾çŸ©é˜µçš„åŠ æƒè·ç¦»å’Œ,åšä¸ºæµ‹è¯•å›¾åƒå’Œåº“å›¾åƒçš„è·ç¦»,æ—¢è€ƒè™‘äº†å…¨å±€å’Œå±€éƒ¨ç‰¹å¾,åˆè€ƒè™‘äº†ä¸åŒèŠ‚ç‚¹å›¾åƒçš„è¯†åˆ«è´¡çŒ®çŽ‡,äººè„¸è¯†åˆ«å®žéªŒç»“æžœè¡¨æ˜Žè¯¥æµ‹é‡æ–¹æ³•æœ‰æ•ˆåœ°æé«˜äº†è¯†åˆ«çŽ‡ã€‚(5)é’ˆå¯¹å˜åŒ–äººè„¸å›¾åƒè¯†åˆ«å›°éš¾çš„é—®é¢˜,æå‡ºäº†ä¸€ç§åŸºäºŽWPDå’Œ(2D)2PCAçš„äººè„¸è¯†åˆ«æ–¹æ³•ã€‚é¦–å…ˆ,å¯¹å›¾åƒè¿›è¡Œå°æ³¢åŒ…åˆ†è§£,é‡‡ç”¨(2D)2PCAå’Œæœ€é‚»è¿‘åˆ†ç±»å™¨å¾—åˆ°åèŠ‚ç‚¹çš„æ£ç¡®è¯†åˆ«çŽ‡,é€‰å–å…·æœ‰è¾ƒå¤§è¯†åˆ«çŽ‡çš„èŠ‚ç‚¹ä½œä¸ºâ€œæˆåŠŸâ€èŠ‚ç‚¹,ç„¶åŽ,ç»„åˆâ€œæˆåŠŸâ€èŠ‚ç‚¹çš„ç‰¹å¾çŸ©é˜µ,è®¡ç®—æµ‹è¯•å›¾åƒä¸Žåº“å›¾åƒçš„è·ç¦»,æœ€åŽ,é‡‡ç”¨æœ€é‚»è¿‘åˆ†ç±»å™¨å®žçŽ°è¯†åˆ«ã€‚(6)ä»¥MATLAB 7.0ä¸ºå·¥å…·ç¼–ç¨‹å®žçŽ°åŸºäºŽWPDå’Œ(2D)2PCAçš„äººè„¸è¯†åˆ«æ–¹æ³•,å¹¶ä»¥CMU PIEã€Yaleå’ŒUMISTäººè„¸åº“å›¾åƒä¸ºæµ‹è¯•å¯¹è±¡,åˆ†åˆ«è¿›è¡Œå…‰ç…§ã€è¡¨æƒ…å’Œè§†è§’å˜åŒ–å›¾åƒçš„è¯†åˆ«æ€§èƒ½å®žéªŒ,ä»¥åŽŸå›¾åƒé‡‡ç”¨(2D)2PCAå’Œæœ€é‚»è¿‘åˆ†ç±»å™¨çš„è¯†åˆ«çŽ‡ä¸ºå¯¹æ¯”æ ‡å‡†,ç»“æžœè¡¨æ˜Ž,æœ¬æ–‡æ–¹æ³•åœ¨3ä¸ªå®žéªŒä¸çš„è¯†åˆ«çŽ‡å‡é«˜äºŽæ ‡å‡†è¯†åˆ«çŽ‡,å…¶ä¸,å…‰ç…§å˜åŒ–æ—¶è¯†åˆ«èƒ½åŠ›æœ€å¥½,æœ€é«˜è¯†åˆ«çŽ‡ä¸º98.795%;è¡¨æƒ…å˜åŒ–å…¶æ¬¡,æœ€é«˜ä¸º89.796%,è§†è§’å˜åŒ–æœ€å·®,æœ€é«˜ä¸º36.047%ã€‚(7)å®žéªŒè¡¨æ˜Ž,è·ç¦»å°ºåº¦å’Œå°æ³¢å‡½æ•°çš„é€‰å–å¯¹å¤šåˆ†è¾¨çŽ‡ä¸‹èŠ‚ç‚¹çš„è¯†åˆ«çŽ‡æœ‰è¾ƒå¤§å½±å“ã€‚L1åœ¨ä¸»ä½“èŠ‚ç‚¹ä¸Šçš„è¯†åˆ«çŽ‡é«˜,è€ŒL2åœ¨ç»†èŠ‚èŠ‚ç‚¹ä¸Šçš„è¯†åˆ«çŽ‡é«˜;å°æ³¢å‡½æ•°å¯¹ä¸åŒæ¡ä»¶å›¾åƒè¯†åˆ«æ•ˆæžœä¹Ÿå„ä¸ç›¸åŒã€‚å› æ¤,è¦æ ¹æ®å›¾åƒå˜åŒ–æ¡ä»¶é€‰å–èŠ‚ç‚¹ã€è·ç¦»å°ºåº¦å’Œå°æ³¢å‡½æ•°ã€‚ç”±è¯•éªŒæå‡ºäº†å¦‚ä¸‹é€‰å–è§„åˆ™:å…‰ç…§å˜åŒ–æ—¶,é‡‡ç”¨L1å’ŒDaubechies4ä¸‹çš„A1ã€A2ã€H2ã€V2ã€HH2ç»„åˆ;è¡¨æƒ…å˜åŒ–æ—¶,é‡‡ç”¨L1å’ŒHaarä¸‹çš„A2ã€‚(8)æœ¬æ–‡æå‡ºçš„æ–¹æ³•åœ¨è§†è§’å˜åŒ–æ—¶æ•ˆæžœå¹¶ä¸ç†æƒ³,å°šéœ€ç ”ç©¶å¹¶å¯»æ±‚å…¶å®ƒç‰¹å¾æå–æ–¹æ³•ã€‚æ›´å¤š è¿˜åŽŸ

ã€Abstractã€‘ The technology of face detection and recognition is one of the most widely investigated technologies in the filed of Biometric Identification, and it has been used in such areas as identity authentication, security check-up, criminal enquiry, human-computer interaction etc.In regard to face detection, proposing a detection method with high speed and accuracy remains a research hot spot. As to face recognition, due to the great variations of illumination, expression, viewpoint, age, etc. of face images, obtaining high recognition rates under these conditions still is a difficult task and research focus point. With respect to these two problems, this dissertation takes colour and gray static frontal facial images as research objects, and studies face detection and recognition methods based on combination of pattern recognition theory and image processing technology. The main content includes skin pixel extraction method on the basis of LVQ artificial neural network, face detection method using template matching technology, and a novel face recognition method employing (2D)2PCA and WPD under varying illumination, expression and pose conditions. This research targets for providing technology supports for a high-speed and accurate face recognition system. The main contributions of this research include:(1) In order to solve the problem that detection speed and accuracy of current face detection system is unbalanced, a method that extracts skin pixels using LVQ ANN and detects face based on template matching is proposed. Firstly, An LVQ ANN is used to extract skin pixels. Then, a Mosaic method is prompted to primarily locate the face region through searching within the whole image. Experiments on images from CVL indicate that the LVQ ANN gains satisfactory extraction accuracy as well as high speed, and the Mosaic method could successfully pre-locate the face region.(2) A method using template matching is adopted to detect face in the pre-located face region. First of all, a gray standard face template is gained by using only R channel of RGB images.Then, face is detected in the pre-located face region using template matching by taking relativity coefficient as the matching rule. In the end,the location and size of this face are obtained. Experimental results of three testing sets ( normal, smile and big smile sets) from CVL database show that the adopted method obtains good detection accuracy as well as speed. In the concrete, it gains 100%,100% and 93.6% correct detection rates respectively. Meanwhile, its detection speed increases from 1870.6 second/image to 208.4 second/image comparied with only adopting template matching on the original image.(3) To address the difficult problem of choosing remarkable plots from all plots gained via WPD on the original image, a method that selectsâ€œsuccessfulâ€plots according to the correct recognition rates (CRRs) of plots is proposed. These CRRs are obtained by combining (2D)2PCA with the nearest neighborhood classifier.(4) Aiming at efficiently fusing the feature matrixes ofâ€œsuccessfulâ€plots, a distance measurement between testing image and database image is presented. The L1 or L2 distances between feature matrixes of selectedâ€œsuccessfulâ€plots of testing image and each database image are calculated, and then taking the weighted sum of these distances as final distance. This measurement preserves both the local and global features of image, meanwhile, it also takes the CRR contribution differences of different plots into consideration. Experimental results show this measurement improves recognition performance significantly.(5) Viewing the difficulty to recognize face in images taken under different conditions, a novel recognition method employing WPD and (2D)2PCA is developed. Firstly, 20 plots are obtained via two-level WPD on the original image. Secondly, the CRRs of these plots are gained by (2D)2PCA and the nearest neighborhood classifier, andâ€˜successfulâ€™plots are selected based on these CRRs. Thirdly, the distance between testing image and each database image is calcualted using the proposed distance measurement. Finally, the nearest neighborhood classifier is adopted for recognition on the basis of this distance.(6) The proposed recognition mehod is accomplished by MATLAB 7.0 and images from CMU PIE, Yale or UMIST databases are selected to test the recognition improvement of the proposed method under different illumination, expressions and poses respectively. The performance of (2D)2PCA on the original image is defined asâ€˜standardâ€™method. As the experimental results suggest, the proposed method obtains better performance thanâ€˜standardâ€™method under these three conditions. It performs best under different illumination whereas its performance decreases slightly under different expressions and is worst when poses change, and its highest CRRs are 98.795%, 89.796%, 36.047% respectively.(7) Observed from experimental results, the choice of distance metric has a significant effect on face recognition. In general, L1 shows higher CRRs on approximation plots, whilst L2 performs better on detailed plots. Similarly, the filters also show different performances under three different conditions. Therefore, distance metrics and filters should be selected according to these conditions. In the concrete, L1, Daubechies4, and A1, A2, H2, V2, HH2 are recommended to form the proposed method under different illumination, and L1, Haar and A2 are recommended to form the proposed method under different expressions. (8) The proposed method fails to gain satisfactory CRRs under different poses, and the highest record is 36.047%. Thus, it is necessary to seek other methods to extract facial features more efficiently.æ›´å¤š è¿˜åŽŸ

ã€å…³é”®è¯ã€‘ äººè„¸æ£€æµ‹ï¼› äººè„¸è¯†åˆ«ï¼› LVQäººå·¥ç¥žç»ç½‘ç»œï¼› æ¨¡æ¿åŒ¹é…ï¼› å°æ³¢åŒ…åˆ†è§£ï¼›
ã€Key wordsã€‘ face detectionï¼› face recognitionï¼› LVQ artificial neural networkï¼› template matchingï¼› wavelet packet decompositionï¼›

ã€ç½‘ç»œå‡ºç‰ˆæŠ•ç¨¿äººã€‘ è¥¿åŒ—å†œæž—ç§‘æŠ€å¤§å¦

ã€åˆ†ç±»å·ã€‘TP391.41
ã€è¢«å¼•é¢‘æ¬¡ã€‘3
ã€ä¸‹è½½é¢‘æ¬¡ã€‘236

æ‰“å°æœ¬é¡µ

èŠ‚ç‚¹æ–‡çŒ®ä¸ï¼š