èŠ‚ç‚¹æ–‡çŒ®

åŸºäºŽå›¾å’Œç†µæ£åˆ™åŒ–çš„åŠç›‘ç£åˆ†ç±»ç®—æ³•

Semi-supervised Classification Algorithm Based on Graph and Entropy Regularization

åˆ†é¡µä¸‹è½½
åˆ†ç« ä¸‹è½½
æ•´æœ¬ä¸‹è½½
åœ¨çº¿é˜…è¯»
ä¸æ”¯æŒè¿…é›·ç‰ä¸‹è½½å·¥å…·ï¼Œè¯·å–æ¶ˆåŠ é€Ÿå·¥å…·åŽä¸‹è½½ã€‚

ã€ä½œè€…ã€‘ åˆ˜å°å…°ï¼›

ã€ä½œè€…åŸºæœ¬ä¿¡æ¯ã€‘ åŽå—ç†å·¥å¤§å¦ ï¼Œ è®¡ç®—æœºåº”ç”¨æŠ€æœ¯ï¼Œ 2011ï¼Œ åšå£«

ã€æ‘˜è¦ã€‘ åŠç›‘ç£å¦ä¹ ï¼ˆSemi-supervised Leaning,SSLï¼‰è¯•å›¾åˆ©ç”¨å¤§é‡çš„æ— æ ‡è®°æ ·æœ¬å¦ä¹ æ•°æ®çš„å†…åœ¨å‡ ä½•ç»“æž„,åœ¨æ¤åŸºç¡€ä¸Šåˆ©ç”¨å°‘é‡çš„æœ‰æ ‡è®°æ ·æœ¬å®Œæˆé™ç»´ã€åˆ†ç±»å’Œå›žå½’ç‰ä»»åŠ¡ã€‚ç”±äºŽSSLåœ¨å‡å°‘äººå·¥æ ‡æ³¨ä»£ä»·ã€æé«˜æœºå™¨å¦ä¹ æ€§èƒ½æ–¹é¢çš„çªå‡ºä¼˜åŠ¿,ä»¥åŠåœ¨ç½‘é¡µæ£€ç´¢ã€æ–‡æœ¬åˆ†ç±»ã€åŸºäºŽç”Ÿç‰©ç‰¹å¾çš„èº«ä»½è¯†åˆ«å’ŒåŒ»ç–—è¯Šæ–ç‰é¢†åŸŸåº”ç”¨çš„å¹¿æ³›æ€§,ä»Žä¸Šä¸–çºª90å¹´ä»£å¼€å§‹,å®ƒå°±åœ¨æœºå™¨å¦ä¹ ç•Œå¼•èµ·äº†å…³æ³¨ã€‚ç›®å‰,SSLå·²æˆä¸ºæœºå™¨å¦ä¹ ç ”ç©¶ä¸æœ€å—å…³æ³¨çš„é—®é¢˜ä¹‹ä¸€ã€‚æœ¬æ–‡åœ¨åˆ†æžäº†SSLçš„å‘å±•çŽ°çŠ¶å’Œç›®å‰ä»å˜åœ¨çš„é—®é¢˜çš„åŸºç¡€ä¸Š,å¯¹åŸºäºŽå›¾å’Œç†µæ£åˆ™åŒ–çš„åŠç›‘ç£åˆ†ç±»å¦ä¹ ä¸çš„è‹¥å¹²é‡è¦é—®é¢˜è¿›è¡Œäº†ç ”ç©¶,å…·ä½“ç ”ç©¶å†…å®¹å’Œæˆæžœå¦‚ä¸‹:1ã€æ•°æ®å›¾çš„æž„é€ ã€‚æ•°æ®å›¾çš„æž„é€ æ˜¯è®¾è®¡åŸºäºŽå›¾çš„SSLç®—æ³•çš„ç¬¬ä¸€æ¥ã€‚å¤§å¤šæ•°ä¼ ç»Ÿæ•°æ®å›¾æž„é€ æ–¹æ³•æ˜¯å‚æ•°ä¾èµ–çš„,ä¸”å¯¹å‚æ•°è¾ƒæ•æ„Ÿ;å¦ä¸€æ–¹é¢,æœ€è¿‘æå‡ºçš„åŸºäºŽç¨€ç–è¡¨è¾¾çš„æœ€å°åŒ–L1æ¨¡æž„é€ æ¨¡åž‹ä¸èƒ½ä¿è¯éžè´Ÿè§£,å› æ¤ä¸èƒ½ç›´æŽ¥ç”¨ä½œå›¾ä¸Šè¾¹çš„æƒé‡ã€‚é’ˆå¯¹è¿™äº›ä¸è¶³,æå‡ºäº†ä¸¤ä¸ªåŸºäºŽéžè´Ÿç¨€ç–è¡¨è¾¾çš„æœ€å°åŒ–L1æ¨¡æž„é€ æ¨¡åž‹:L1_IMPå’ŒL1_IMPvã€‚ä¸¤ä¸ªæ–°æ¨¡åž‹åœ¨çŽ°æœ‰æœ€å°åŒ–L1æ¨¡æž„é€ æ¨¡åž‹çš„åŸºç¡€ä¸Šå¢žåŠ äº†éžè´Ÿçº¦æŸ,ä»Žè€Œä½¿å¾—æ¨¡åž‹çš„ç¨€ç–è§£ä¸ä»…å¯ä»¥åæ˜ æˆå¯¹æ ·æœ¬é—´çš„ç´§å¯†ç¨‹åº¦,è€Œä¸”å¯ä»¥ç›´æŽ¥ç”¨ä½œå›¾ä¸Šè¾¹çš„æƒé‡ã€‚æ¤å¤–,æ–°çš„å›¾æž„é€ æ–¹æ³•å¯ä»¥åœ¨ç¡®å®šå›¾çš„é‚»æŽ¥ç»“æž„çš„åŒæ—¶å®Œæˆè¾¹çš„æƒé‡è®¡ç®—ã€‚ç»“åˆæ ‡è®°ä¼ æ’ç®—æ³•,åœ¨UCIå’Œäººè„¸æ•°æ®é›†ä¸Šçš„å®žéªŒç»“æžœè¡¨æ˜Ž,L1_IMPå’ŒL1_IMPvåœ¨å¤§å¤šæ•°æƒ…å†µä¸‹çš„åˆ†ç±»æ•ˆæžœä¼˜äºŽä¼ ç»Ÿæ–¹æ³•ã€‚2ã€åŸºäºŽä¸ç›¸ä¼¼æ€§çš„å›¾SSLç®—æ³•ã€‚è´Ÿç›¸ä¼¼æ€§åœ¨ååŒè¿‡æ»¤ç‰é—®é¢˜ä¸ç»å¸¸å‡ºçŽ°ã€‚é’ˆå¯¹ç›®å‰æå‡ºçš„å¤§éƒ¨åˆ†å›¾SSLç®—æ³•éƒ½ä¸èƒ½å¤„ç†ä¸ç›¸ä¼¼æ€§æˆ–è´Ÿç›¸ä¼¼æ€§çš„ä¸è¶³,æå‡ºäº†ä¸€ä¸ªåŸºäºŽè´Ÿç›¸ä¼¼æ€§çš„å›¾SSLæ¨¡åž‹SMLPã€‚SMLPçš„ä¼˜åŒ–ç›®æ ‡æ˜¯å¦‚ä¸‹ä¸¤ä¸ªé‡çš„æ¯”å€¼:ç±»æ ‡è®°å’Œæ£ç›¸ä¼¼æ€§çš„ä¸ä¸€è‡´æ€§ä»¥åŠç±»æ ‡è®°å’Œè´Ÿç›¸ä¼¼æ€§çš„ä¸€è‡´æ€§;åŒæ—¶,SMLPå…è®¸æœ‰æ ‡è®°æ ·æœ¬çš„æ ‡è®°äºˆä»¥é‡æ–°æ ‡è®°,è¿ç”¨ä¸€ç§å…¨å±€ä¼˜åŒ–æ–¹æ³•æ±‚è§£SMLP,å¯ä»¥åœ¨O ( n³ logÎµ^-1 )æ—¶é—´å†…èŽ·å¾—ä¸€ä¸ªÎµ-æœ€ä¼˜è§£ã€‚åœ¨UCIæ•°æ®é›†å’ŒååŒè¿‡æ»¤é—®é¢˜ä¸ŠéªŒè¯äº†SMLPç®—æ³•çš„æœ‰æ•ˆæ€§ã€‚3ã€é€‚äºŽå¤„ç†æ ‡è®°æœ‰å™ªå£°æ•°æ®çš„å›¾SSLç®—æ³•ã€‚ç®—æ³•çš„åŸºæœ¬æ€è·¯æ˜¯è¿ç”¨è½¯æ ‡è®°æ–¹æ³•æ¥å¤„ç†æ ‡è®°æœ‰å™ªå£°æ•°æ®ã€‚é¦–å…ˆ,åˆ©ç”¨å„ç§æ ‡è®°è½¯åŒ–æ–¹æ³•å°†æ ·æœ¬çš„ç±»æ ‡è®°è½¬åŒ–ä¸ºè½¯æ ‡è®°,ç›¸æ¯”ç¡¬æ ‡è®°,è½¯æ ‡è®°å¯ä»¥æ›´å¥½åœ°å®¹çº³ç›‘ç£è€…å¯¹æ¨¡å¼ç±»åˆ«çš„ä¸ç¡®å®šæ€§ã€‚åœ¨æ¤åŸºç¡€ä¸Š,åµŒå…¥çŽ°æœ‰çš„åŸºäºŽå›¾çš„SSLç®—æ³•LGC,ä»¥è¾¾åˆ°é¢„æœŸç›®çš„ã€‚åœ¨æœ‰ç±»é‡å çš„UCIå’Œç‰©ä½“è¯†åˆ«æ•°æ®é›†ä¸Šçš„å®žéªŒè¡¨æ˜Ž,ä¸ŽåŸºäºŽç¡¬æ ‡è®°çš„LGCç®—æ³•ç›¸æ¯”,åŸºäºŽè½¯æ ‡è®°çš„LGCç®—æ³•å¯ä»¥æ›´å¥½åœ°ç”¨äºŽæ ‡è®°æœ‰å™ªå£°æ•°æ®çš„åŠç›‘ç£åˆ†ç±»å¦ä¹ ã€‚4ã€åŸºäºŽç†µæ£åˆ™åŒ–çš„SSLç®—æ³•ã€‚æå‡ºäº†ä¸€ä¸ªåŸºäºŽæ¡ä»¶Havrda-Charvatâ€™s StructuralÎ±-ç†µæ£åˆ™åŒ–çš„ç›´æŽ¨å¼åŠç›‘ç£åˆ†ç±»æ¨¡åž‹MinEntã€‚MinEntçš„åŸºæœ¬æ€æƒ³æ˜¯:ä¸€ä¸ªå¥½çš„èšç±»æ ‡å‡†æ˜¯å¯¹æ— æ ‡è®°æ ·æœ¬çš„ä¸€ä¸ªå¥½çš„åˆ»ç”»ã€‚åœ¨MinEntæ¨¡åž‹ä¸,ç”¨æ¡ä»¶Havrda-Charvatâ€™s StructuralÎ±-ç†µèšç±»æ ‡å‡†åˆ»ç”»æ— æ ‡è®°æ ·æœ¬åŠå…¶æ‰€å±žç±»åˆ«ä¹‹é—´çš„å…³ç³»,åŒæ—¶å¯¹æœ‰æ ‡è®°æ ·æœ¬é‡‡ç”¨å…¶å¯¹æ•°ä¼¼ç„¶å‡½æ•°ã€‚è®¾è®¡äº†åŸºäºŽæ‹Ÿç‰›é¡¿æ³•çš„æ±‚è§£ç®—æ³•ã€‚æ‰€æå‡ºçš„ç®—æ³•æ˜¯åˆ¤åˆ«å¼çš„,é™ä½Žäº†å¯¹æ¨¡åž‹çš„ä¾èµ–ç¨‹åº¦;åŒæ—¶,å®ƒå¯ä»¥é¢„æµ‹æ ·æœ¬ç©ºé—´ä¸ä»»ä½•ä¸€ä¸ªæ ·æœ¬çš„æ ‡è®°,æ˜¯ä¸€ç§ç›´æŽ¨å¼æ–¹æ³•ã€‚åœ¨UCIæ•°æ®é›†ä¸Šçš„ä»¿çœŸå®žéªŒéªŒè¯äº†è¯¥ç®—æ³•çš„æœ‰æ•ˆæ€§ã€‚æ›´å¤š è¿˜åŽŸ

ã€Abstractã€‘ Semi-supervised learning (SSL) tries to discover the intrinsic structure of the given data by use of lot of unlabeled data, on the basis of which, it finishes the task of dimensionality reduction, classification and regression by making use of few labeled data. Because of its prominent advantage of reducing the cost of labeling manually and improving the performance of machine learning, and its widespread popularity in web page retrieval, text classification, personal identification based on biometrics feature and medical diagnosis, SSL has received the attention of machine learning community since 1990. Now, SSL becomes one of the most active research areas in the machine learning field. Based on analyzing the state of the art and the existing problems of SSL, the thesis mainly investigates some key issues of graph-based and entropy regularization SSL. The contributions are as follows:1. Graph construction. Graph construction is the first step of graph-based SSL algorithm. Most traditional graph construction methods depend on parameters and are sensitive to these parameters. The solutions of the recently proposed L1 norm reconstruction error minimization graph construction models based on sparse representation may be negative, so they can not be used as the graph weights directly. According to these deficiencies, two L1 norm minimization graph construction models based on nonnegative sparse representation named L1_IMP and L1_IMPv which add nonnegative constraints to the existing L1 norm minimization models are proposed. The solutions of the proposed models can not only reflect the close relation between the sample pairs, but also can be used as the graph weights directly. Moreover, L1_IMP and L1_IMPv complete the neighborhood graph construction and graph weights calculation within one step. Experimental results on UCI and face recognition datasets show that the classification accuracy of the label propagation algorithms using L1_IMP and L1_IMPv are better than that of the label propagation algorithms using traditional graph construction methods in most cases.2. Graph-based SSL algorithm by dissimilarity. Dissimilarity, or negative similarity frequently appears in many practical applications such as collaborative filtering problem. Considering that most graph-based SSL algorithms can not deal with negative similarity, a graph-based SSL model based on negative similarity named SMLP is proposed. The optimization objective of SMLP is the ratio between the following inconsistency and consistency: the inconsistency between the class assignment and the positive similarity, and the consistency between the class assignment and the negative similarity. Also SMLP allows the labeled data to be relabeled. A global optimal algorithm is applied for solving SMLP, yielding anÎµ?global optimal solution in a computational effort of O ( n 3 logÎµ?1 ). Experimental results on UCI datasets and collaborative filtering problem verify the effectiveness of SMLP algorithm.3. Graph-based SSL algorithm for misclassified data. We use soft labels to deal with misclassified data in the circumstance of SSL. First hard labels of labeled samples are converted to soft labels by several existing methods which can accommodate the uncertainty of an external teacher about uncertain patterns better than hard labels. Then soft labels are embedded into the existing graph-based SSL algorithm LGC to deal with the misclassified data. Experimental results on UCI and object recognition datasets with some classes overlapping show that LGC with soft labels is more resistant to label errors compared with LGC with hard labels.4. SSL algorithm based on minimum entropy regularization. A discriminative SSL classification model named MinEnt is established based on the conditional Havrda-Charvatâ€™s StructuralÎ±-entropy regularization. The basic idea of MinEnt is that a good clustering criterion is also a good description of the unlabeled data. In MinEnt, conditional Havrda-Charvatâ€™s StructuralÎ±-entropy clustering criterion is used to describe the relation of unlabeled data and theirs labels, log likelihood function is used to describe the labeled data and Quasi-Newton method is applied for solving MinEnt. The proposed algorithm is discriminative which has less dependence on the model selection. Moreover, the proposed algorithm is inductive, so it can predict the labels of the out of the samples easily. Experimental results on several UCI datasets demonstrate the effectiveness of the proposed algorithm.æ›´å¤š è¿˜åŽŸ

ã€å…³é”®è¯ã€‘ åŠç›‘ç£å¦ä¹ ï¼› å›¾æ–¹æ³•ï¼› ç¨€ç–è¡¨è¾¾ï¼› åˆ†å¼äºŒæ¬¡è§„åˆ’ï¼› ç†µæ£åˆ™åŒ–ï¼›
ã€Key wordsã€‘ Semi-supervised learningï¼› Graph-based methodï¼› Sparse representationï¼› Fractional quadratic programï¼› Entropy regularizationï¼›

ã€ç½‘ç»œå‡ºç‰ˆæŠ•ç¨¿äººã€‘ åŽå—ç†å·¥å¤§å¦

ã€åˆ†ç±»å·ã€‘TP181
ã€è¢«å¼•é¢‘æ¬¡ã€‘4
ã€ä¸‹è½½é¢‘æ¬¡ã€‘423
æ”»è¯»æœŸæˆæžœ

çŸ¥ç½‘èŠ‚ä¸‹è½½

èŠ‚ç‚¹æ–‡çŒ®ä¸ï¼š

æœ¬æ–‡é“¾æŽ¥çš„æ–‡çŒ®ç½‘ç»œå›¾ç¤º:

æœ¬æ–‡çš„å¼•æ–‡ç½‘ç»œ

èŠ‚ç‚¹æ–‡çŒ®

èŠ‚ç‚¹æ–‡çŒ®

åŸºäºŽå›¾å’Œç†µæ­£åˆ™åŒ–çš„åŠç›‘ç£åˆ†ç±»ç®—æ³•

Semi-supervised Classification Algorithm Based on Graph and Entropy Regularization

æœ¬æ–‡é“¾æŽ¥çš„æ–‡çŒ®ç½‘ç»œå›¾ç¤º:

åŸºäºŽå›¾å’Œç†µæ£åˆ™åŒ–çš„åŠç›‘ç£åˆ†ç±»ç®—æ³•