èŠ‚ç‚¹æ–‡çŒ®

åŸºäºŽå¦ä¹ çš„äººè„¸è¯†åˆ«ç ”ç©¶

Research on Learning-Based Face Recognition

åˆ†é¡µä¸‹è½½
åˆ†ç« ä¸‹è½½
æ•´æœ¬ä¸‹è½½
åœ¨çº¿é˜…è¯»
ä¸æ”¯æŒè¿…é›·ç‰ä¸‹è½½å·¥å…·ï¼Œè¯·å–æ¶ˆåŠ é€Ÿå·¥å…·åŽä¸‹è½½ã€‚

ã€ä½œè€…ã€‘ å”ä¸‡å¢žï¼›

ã€ä½œè€…åŸºæœ¬ä¿¡æ¯ã€‘ æµ™æ±Ÿå¤§å¦ ï¼Œ æŽ§åˆ¶ç†è®ºä¸ŽæŽ§åˆ¶å·¥ç¨‹ï¼Œ 2008ï¼Œ åšå£«

ã€æ‘˜è¦ã€‘ äººè„¸è¯†åˆ«æ˜¯ç”Ÿç‰©ç‰¹å¾è¯†åˆ«çš„å…³é”®æŠ€æœ¯ä¹‹ä¸€,å…¶æœ€æ ¸å¿ƒçš„ä¸¤å¤§æ¥éª¤ä¸ºäººè„¸æ£€æµ‹ä¸Žè¯†åˆ«ã€‚å®ƒçš„ä¸»è¦ä»»åŠ¡å°±æ˜¯ä»Žå›¾åƒæˆ–è§†é¢‘ä¸å‡†ç¡®åœ°æ‰¾å‡ºäººè„¸å¹¶ç¡®å®šå…¶èº«ä»½ã€‚æœ¬æ–‡ä»ŽåŸºäºŽå¦ä¹ çš„è§’åº¦å‡ºå‘,åœ¨èšç±»ã€æµå½¢ã€åç©ºé—´å¦ä¹ ç‰æœºå™¨å¦ä¹ æ–¹æ³•çš„åŸºç¡€ä¸Š,æå‡ºäº†æ”¹è¿›æˆ–æ–°çš„äººè„¸è¯†åˆ«æ–¹æ³•,å¹¶å’Œå…¶ä»–äººè„¸è¯†åˆ«æ–¹æ³•ä½œæ¯”è¾ƒ,å–å¾—äº†è¾ƒå¥½çš„æ•ˆæžœã€‚æœ¬æ–‡ä¸»è¦ç ”ç©¶äº†ä»¥ä¸‹å‡ æ–¹é¢çš„å†…å®¹:1)é’ˆå¯¹äººè„¸æ£€æµ‹é—®é¢˜,é€šè¿‡è‚¤è‰²æ³•åˆ†ç¦»äººè„¸ç›®æ ‡ä¸ŽèƒŒæ™¯åŽ,æå‡ºä¸¤ç§æ–¹æ³•å®šä½äººè„¸å³:ç§¯åˆ†æŠ•å½±â€”é«˜æ–¯æ›²çº¿æ³•å’Œæ”¹è¿›å‡æ³•èšç±»æ³•,åˆ†åˆ«å¯¹åº”å•äººè„¸å’Œå¤šäººè„¸æ£€æµ‹ã€‚ç§¯åˆ†æŠ•å½±â€”é«˜æ–¯æ›²çº¿æ³•å°†äººè„¸äºŒå€¼å›¾åƒåˆ†åˆ«åœ¨Xã€Yè½´ç§¯åˆ†æŠ•å½±,æ ¹æ®æŠ•å½±æ›²çº¿åˆ†åˆ«è®¡ç®—ç›¸åº”çš„é«˜æ–¯æ›²çº¿,é€šè¿‡æ±‚è§£é«˜æ–¯æ–¹ç¨‹å¿«é€Ÿå¾—åˆ°è¾ƒä¸ºå‡†ç¡®çš„äººè„¸åŒºåŸŸã€‚æ”¹è¿›å‡æ³•èšç±»æ³•è¿ç”¨ä¸€ç§æ–°çš„è·ç¦»å®šä¹‰,é€šè¿‡å›¾åƒä¸äººè„¸ç›®æ ‡çš„ç»Ÿè®¡ä¿¡æ¯å¯¹ç®—æ³•çš„å…³é”®å‚æ•°è¿›è¡Œé¢„ä¼°è®¡,èƒ½è‡ªåŠ¨ç»“æŸäººè„¸ç›®æ ‡æœç´¢ã€‚ä¸‹é‡‡æ ·é™ä½Žå‡æ³•èšç±»çš„è¿ç®—é‡,æé«˜äº†ç®—æ³•çš„è¿è¡Œé€Ÿåº¦,åŒæ—¶éªŒè¯äº†å‡æ³•èšç±»åœ¨è§†é¢‘è¿åŠ¨ç›®æ ‡å®šä½ä¸çš„è‰¯å¥½åº”ç”¨æ•ˆæžœã€‚ç²¾ç¡®åœ°æ£€æµ‹äººè„¸éœ€è¦å¯¹ä¸åŒå§¿æ€çš„äººè„¸è¿›è¡Œå§¿æ€è§’ä¼°è®¡,æœ¬æ–‡åœ¨è‚¤è‰²åŒºåŸŸæå–çš„åŸºç¡€ä¸Š,æå‡ºå§¿æ€è§’åº¦ä¼°è®¡ç›®æ ‡å‡½æ•°,å¹¶è®¨è®ºäº†ä¸¤ç§å¯»ä¼˜æ–¹æ³•,å³æ¢¯åº¦ä¸‹é™æ³•å’Œæ¬¡å…¨å±€æžšä¸¾å¯»ä¼˜æ³•,æ¥ä¼°è®¡å§¿æ€è§’åº¦å€¼ã€‚æ ¹æ®ä¼°è®¡çš„å§¿æ€è§’åº¦ä½œç›¸åº”çš„æ—‹è½¬æ ¡æ£,åœ¨æ ¡æ£åŽçš„åŒºåŸŸåˆ©ç”¨çœ¼ç›å’Œå˜´çš„è‰²åº¦å’Œäº®åº¦ç‰¹ç‚¹åˆ†åˆ«æž„é€ æ˜ å°„å›¾,æå–å‡ºçœ¼ç›å’Œå˜´,å¹¶éªŒè¯äººè„¸ã€‚2)é’ˆå¯¹æµå½¢å¦ä¹ çš„äººè„¸è¯†åˆ«é—®é¢˜,å›´ç»•æµå½¢å¦ä¹ æ–¹æ³•çš„æœ¬è´¨è¦ç´ ,å³:(1)å¦‚ä½•æž„é€ è¿‘é‚»ç»“æž„å›¾;(2)ä»¥ä»€ä¹ˆæ ·çš„è·ç¦»æµ‹åº¦æ¥è¡¡é‡äººè„¸æ ·æœ¬çš„è¿‘é‚»;(3)éµå¾ªä»€ä¹ˆæ ·çš„ç›®æ ‡å‡†åˆ™æ¥æž„é€ ä½Žç»´åµŒå…¥,ä»Žä¸‰æ–¹é¢å…¥æ‰‹,è¡ç”Ÿäº†ä¸å¿ƒè¿‘é‚»åµŒå…¥å¦ä¹ å’Œé‰´åˆ«çŸ¢é‡è§’åµŒå…¥å¦ä¹ ä¸¤ç§æ–°çš„æµå½¢å¦ä¹ æ–¹æ³•ã€‚ä¸å¿ƒè¿‘é‚»åµŒå…¥çš„å¦ä¹ ç®—æ³•,ä¸Žç»å…¸çš„å±€éƒ¨çº¿æ€§åµŒå…¥å’Œä¿å±€æ˜ å°„ä¸åŒ,å®ƒæ˜¯ä¸€ç§æœ‰ç›‘ç£çš„çº¿æ€§é™ç»´æ–¹æ³•ã€‚è¯¥æ–¹æ³•é¦–å…ˆé€šè¿‡è®¡ç®—å„ç±»æ ·æœ¬ä¸å¿ƒ,å¹¶å¼•å…¥ä¸å¿ƒè¿‘é‚»è·ç¦»ä»£æ›¿ä¸¤æ ·æœ¬ç‚¹ä¹‹é—´çš„ç›´æŽ¥è·ç¦»ä½œä¸ºæƒç³»æ•°å‡½æ•°çš„è¾“å…¥;ç„¶åŽåœ¨ä¿æŒä¸å¿ƒè¿‘é‚»å‡ ä½•ç»“æž„ä¸å˜çš„æƒ…å†µä¸‹æŠŠé«˜ç»´æ•°æ®åµŒå…¥åˆ°ä½Žç»´åæ ‡ç³»ä¸ã€‚é‰´åˆ«çŸ¢é‡è§’åµŒå…¥çš„è¯†åˆ«æ–¹æ³•,æž„é€ äº†ä¸€å¹…æœ‰æ£/è´Ÿè¿žæŽ¥è¾¹çš„é‚»æŽ¥å›¾,ç®—æ³•ä¸è¿žæŽ¥è¾¹æƒç³»æ•°çš„æµ‹åº¦é‡‡ç”¨çŸ¢é‡è§’ä»£æ›¿çŸ¢é‡æ¨¡,ä¸ä½†çœåŽ»äº†ä¼ ç»Ÿæ–¹æ³•ä¸å¯¹çƒæ ¸æƒå‡½æ•°tå‚æ•°çš„ä¼°è®¡,è€Œä¸”é™ä½Žç”±äºŽå›¾åƒæ ·æœ¬é—´çš„äº®åº¦å·®å¼‚å¯¹è¯†åˆ«çŽ‡é€ æˆçš„å½±å“ã€‚3)ä¸ºäº†å®žçŽ°äººè„¸è¯†åˆ«å…äºŽç‰¹å¾æå–,æå‡ºäº†ä¸€ç§åŸºäºŽæ£äº¤è¡¥è„¸çš„äººè„¸è¯†åˆ«æ–¹æ³•ã€‚è¯¥æ–¹æ³•åŸºäºŽç©ºé—´æ£äº¤åˆ†è§£ç†è®º,é¦–å…ˆå¯¹ä¸åŒç±»çš„åŽŸå§‹è®ç»ƒæ ·æœ¬è¿›è¡ŒGram-Schmidtæ£äº¤åŒ–,ä»¥æ£äº¤åŒ–åŽçš„åŸºå¼ æˆå„ä¸ªä¸åŒçš„åç©ºé—´,ç„¶åŽæŠŠæµ‹è¯•æ ·æœ¬åˆ†è§£ä¸ºåç©ºé—´æŠ•å½±åŠåç©ºé—´æ£äº¤è¡¥ä¸¤éƒ¨åˆ†ã€‚æ£äº¤è¡¥çš„èŒƒæ•°ä½“çŽ°äº†æµ‹è¯•æ ·æœ¬åˆ°å„ç±»åç©ºé—´çš„è·ç¦»,å¹¶ä»¥æ¤ä½œä¸ºåˆ†ç±»çš„ä¾æ®ã€‚4)é’ˆå¯¹å•æ ·æœ¬äººè„¸è¯†åˆ«é—®é¢˜,æœ¬æ–‡æå‡ºäº†ä¸€ç§åŸºäºŽå•æ ·æœ¬åˆ‡å‰²çš„åæ¨¡å—ä¸»æˆåˆ†åˆ†æžæ–¹æ³•ã€‚è¯¥æ–¹æ³•å°†å•æ ·æœ¬äººè„¸å›¾ç‰‡åˆ‡å‰²æˆå¤§å°ç›¸åŒã€äº’ä¸é‡å çš„å¤šä¸ªåæ¨¡å—,æž„æˆæ–°çš„æ ·æœ¬é›†ã€‚å¯¹æ‰€æœ‰åæ¨¡å—ä½œä¸»æˆåˆ†åˆ†æž(PCA)å¹¶æå–ç‰¹å¾,åŒä¸€äººè„¸çš„åæ¨¡å—ç‰¹å¾ç³»æ•°ä½œä¸ºåˆ†ç±»è¯†åˆ«çš„ä¾æ®ã€‚æ›´å¤š è¿˜åŽŸ

ã€Abstractã€‘ Face recognition is a key technique among biometric identification technologies, and its most important components are face detection and recognition. The aim of face recognition is to detect faces from images or videos accurately and recognize their identities. This dissertation focuses on learning-based face recognition, including machine learning methods such as clustering, manifold and subspace learning. The contributions of the dissertation are:1) To deal with the problem of face detection, two methods are proposed based on skin detection, which are called integral projectionâ€”Gaussian curves and modified substractive clustering, corresponding to single face and muti-face detection. In the approach of integral projectionâ€”Gaussian curves, two curves are obtained by integral projecting the binary-image to X and Y axes respectively, from which Gaussian curves are calculated and then, an accurate face region is found rapidly through the solution of Gaussian equation. The modified clustering algorithm proposes a new definition of distance for multi-face detection, and its key parameters can be predetermined adaptively by statistical information of face objects in the image. Downsampling is employed to reduce the computation of clustering and speed up the process of the proposed method. Meanwhile, the proposed approach also implements well in location of moving objects in video sequence. In order to estimate the angle of pose accurately, a cost function is proposed. The methods of gradient descent and sub-global enumerating are employed to search for the angle of pose. By rotating the image with the estimated angle, the pose is calibrated. And then, the eye map and mouth map are constructed by their characteristics of chroma and lum in the candidate region. Consequently, eyes and mouth are extracted for face validation.2) Focusing on the 3 essentials of manifold learning in face recognition, namely, (1) how to construct the neighborhood graph; (2) which measure can be used to estimate the true distance between two face samples; (3) what is the suitable cost function for embedding into subspace, two novel learning algorithms are derived from the manifold learning, which is called center based neighborhood embedding(CNE) and discriminant vector angle embedding(DVAE). Unlike the classical methods such as local linear embedding(LLE) and local preserving projection(LPP), CNE is a supervised linear dimensionality reduction method. It first computes centers of all sample classes. The input of the weight function between two samples is replaced by center based neighborhood(CN) distance. Then, the high-dimensional data are embedded into a low-dimensional space with preserving the CN geometric structure. On the other hand, DVAE constructs a graph with both positive and negative edges. The measure in DVAE is the angle between two vectors instead of modulus in traditional methods. It can be exempted from the estimation of the parameter in heat weight function. When test sample is embedded into low-dimensional space, a classification called angle nearest neighbor is used for face recognition.3) In ordrer to free face recognition from feature extraction, a method called orthogonal complement faces (OC-faces) is presented. The method is based on the orthogonal decomposition theorem. Firstly, the Gram-Schmidt orthogonal transformation is performed on the original training data of each class. Secondly, the orthogonal basis of each class spans a corresponding subspace. Therefore, the query sample can be decomposed into the sum of two components which are the orthogonal projection of query sample onto the corresponding subspace and the orthogonal complement of the subspace, respectively. Furthermore, the norm of the orthogonal complement indicates the distance between the query sample and the subspace of each class, so it can be used for classification.4) In order to deal with the problem of face recognition with one sample per person, a method called sub-block principle component analysis (PCA) based on partitions of the sample is presented in this disstertation. It first divides the sample into a few sub-blocks which have equal size and are non-overlapping, and then treats all the sub-blocks as a new sample set. Finally, PCA is performed on all the sub-blocks so as to extract features. Classification is done according to the projection coefficients of sub-blocks of a person.æ›´å¤š è¿˜åŽŸ

ã€å…³é”®è¯ã€‘ äººè„¸æ£€æµ‹ï¼› äººè„¸è¯†åˆ«ï¼› èšç±»ï¼› æµå½¢å¦ä¹ ï¼› åç©ºé—´ï¼› å•æ ·æœ¬ï¼›
ã€Key wordsã€‘ face detectionï¼› face recognitionï¼› clusteringï¼› manifold learningï¼› subspaceï¼› one training sampleï¼›

ã€ç½‘ç»œå‡ºç‰ˆæŠ•ç¨¿äººã€‘ æµ™æ±Ÿå¤§å¦

ã€åˆ†ç±»å·ã€‘TP391.41
ã€è¢«å¼•é¢‘æ¬¡ã€‘12
ã€ä¸‹è½½é¢‘æ¬¡ã€‘1323
æ”»è¯»æœŸæˆæžœ

çŸ¥ç½‘èŠ‚ä¸‹è½½

èŠ‚ç‚¹æ–‡çŒ®ä¸ï¼š

æœ¬æ–‡é“¾æŽ¥çš„æ–‡çŒ®ç½‘ç»œå›¾ç¤º:

æœ¬æ–‡çš„å¼•æ–‡ç½‘ç»œ

èŠ‚ç‚¹æ–‡çŒ®

èŠ‚ç‚¹æ–‡çŒ®

åŸºäºŽå­¦ä¹ çš„äººè„¸è¯†åˆ«ç ”ç©¶

Research on Learning-Based Face Recognition

æœ¬æ–‡é“¾æŽ¥çš„æ–‡çŒ®ç½‘ç»œå›¾ç¤º:

åŸºäºŽå¦ä¹ çš„äººè„¸è¯†åˆ«ç ”ç©¶