èŠ‚ç‚¹æ–‡çŒ®

åŸºäºŽå…´è¶£åº¦çš„å…³è”è§„åˆ™æŒ–æŽ˜ç®—æ³•çš„ç ”ç©¶

Mining Algorithm Research for Association Rules Base on Interest Measure

åˆ†é¡µä¸‹è½½
åˆ†ç« ä¸‹è½½
æ•´æœ¬ä¸‹è½½
åœ¨çº¿é˜…è¯»
ä¸æ”¯æŒè¿…é›·ç‰ä¸‹è½½å·¥å…·ï¼Œè¯·å–æ¶ˆåŠ é€Ÿå·¥å…·åŽä¸‹è½½ã€‚

ã€ä½œè€…ã€‘ é™ˆå®‰é¾™ï¼›

ã€ä½œè€…åŸºæœ¬ä¿¡æ¯ã€‘ è¥¿å—äº¤é€šå¤§å¦ ï¼Œ è®¡ç®—æœºåº”ç”¨ï¼Œ 2003ï¼Œ ç¡•å£«

ã€æ‘˜è¦ã€‘ æ•°æ®æŒ–æŽ˜æ˜¯é¢å‘æµ·é‡æ•°æ®çš„çŸ¥è¯†å‘çŽ°æŠ€æœ¯ï¼Œç ”ç©¶é«˜æ•ˆçš„æŒ–æŽ˜ç®—æ³•æ˜¯æ•°æ®æŒ–æŽ˜ç ”ç©¶çš„é‡è¦å†…å®¹ä¹‹ä¸€ã€‚å…³è”è§„åˆ™æ˜¯æ•°æ®æŒ–æŽ˜çš„é‡è¦æ¨¡å¼ä¹‹ä¸€ï¼Œæœ‰ç€æžå…¶é‡è¦åº”ç”¨ä»·å€¼ã€‚æœ¬æ–‡ä¸»è¦ç ”ç©¶äº†å¦‚ä½•æé«˜å¸ƒå°”å…³è”è§„åˆ™çš„æŒ–æŽ˜ç®—æ³•çš„æœ‰æ•ˆæ€§å’Œä¼¸ç¼©æ€§ã€‚ Aprioriç®—æ³•æ˜¯æŒ–æŽ˜å¸ƒå°”å…³è”è§„åˆ™çš„ç®—æ³•ï¼Œè€Œè¯¥ç®—æ³•åœ¨ç©ºé—´å’Œæ—¶é—´çš„å¤æ‚æ€§æœ‰ç€éš¾ä»¥å…‹æœçš„å±€é™æ€§ã€‚å› æ¤ï¼Œæ–‡ä¸å¼•å…¥äº†ä¸€ç§ä¸éœ€è¦äº§ç”Ÿå€™é€‰é¡¹çš„é¢‘ç¹æ¨¡å¼å¢žé•¿ç®—æ³•ï¼Œå°†æ•°æ®åº“çš„äº‹åŠ¡çš„ä¿¡æ¯åŽ‹ç¼©åˆ°FPä¸€æ ‘ï¼Œç„¶åŽé€šè¿‡åŽç¼€ä¸Žå‰ç¼€è¿žæŽ¥äº§ç”Ÿé¢‘ç¹æ¨¡å¼ï¼Œä»Žè€Œé¿å…äº†å¤šæ¬¡æ‰«ææ•°æ®åº“ï¼Œé™ä½Žäº†æ—¶é—´å¼€é”€ã€‚ å½“æ•°æ®åº“ä¸çš„é¡¹ç›®æ•°ç›®è¾ƒå¤§ä¸”äº‹åŠ¡æ•°é‡å·¨å¤§æ—¶ï¼Œé¢‘ç¹æ¨¡å¼å¢žé•¿ç®—æ³•å†…å˜å¼€é”€å¾ˆå¤§ï¼Œå¯èƒ½å¯¼è‡´å†…å˜ç©ºé—´ä¸è¶³çš„çŽ°è±¡ã€‚å› æ¤ï¼Œæœ¬æ–‡æå‡ºäº†åŸºäºŽæžå¤§å›¢åˆ’åˆ†çš„æ¨¡å¼å¢žé•¿ç®—æ³•ï¼Œå°†äº‹åŠ¡é¡¹ç›®é›†åˆ†è§£æˆè‹¥å¹²åé›†ï¼Œå¯¹æ¯ä¸ªåé›†åˆ†åˆ«ä½¿ç”¨é¢‘ç¹æ¨¡å¼å¢žé•¿ç®—æ³•æ‰¾å‡ºå®ƒä»¬çš„é¢‘ç¹æ¨¡å¼ï¼Œä»Žè€Œè§£å†³äº†å†…å˜ä¸è¶³çš„çŸ›ç›¾ã€‚åŒæ—¶ï¼Œæå‡ºäº†ä¸€ç§ç”¨é‚»æŽ¥çŸ©é˜µäº§ç”Ÿé¢‘ç¹2é¡¹é›†çš„æ–¹æ³•ï¼Œå¯ä»¥å‡å°‘æ‰«ææ•°æ®åº“çš„æ¬¡æ•°ã€‚ å¦‚ä½•ä»Žå¤§é‡çš„å…³è”æ¨¡å¼ä¸ç›é€‰å‡ºç”¨æˆ·æ„Ÿå…´è¶£ä¸”æœ‰ä»·å€¼çš„è§„åˆ™ï¼Œæ˜¯ç®—æ³•ç ”ç©¶çš„é‡è¦å†…å®¹ä¹‹ä¸€ã€‚åŸºäºŽæ”¯æŒåº¦å’Œä¿¡ä»»åº¦çš„æ¡†æž¶æ¨¡åž‹æœ‰ä¸€å®šçš„å±€é™æ€§ï¼Œæœ¬æ–‡åœ¨æ¤æ¡†æž¶ä¸å¼•å…¥äº†åŸºäºŽå½±å“çš„å…´è¶£åº¦ï¼Œç”¨æ¥ä¿®å‰ªæ— è¶£çš„è§„åˆ™ï¼Œä»Žè€Œç›é€‰å‡ºç”¨æˆ·çœŸæ£æ„Ÿå…´è¶£çš„è§„åˆ™æ¨¡å¼ã€‚æ›´å¤š è¿˜åŽŸ

ã€Abstractã€‘ Data mining is the knowledge discovery technique oriented to a great deal of data. Researching efficient algorithm is one of the important contents in study of data mining. Association rule is one of the important models of data mining, and has the most significant application value. The core of this dissertation is how to improve the validity and scalability of mining algorithm of Boolean association rules.The Apriori algorithm is the method of finding Boolean association rules, but has the disadvantage in the complexity of space and time. Therefore, this thesis introduces a new frequent-pattern (FP) growth algorithm that does not need to produce the candidate item sets. This algorithm compresses information in database to the FP-tree, then produces frequent pattern by joining suffix with prefix, consequently avoids scanning the database many times, and lowers the time expense.When there are a great many of items and transactions in the database, frequent-pattern growth algorithm needs more additional computer memory, which may cause the lack of memory. Therefore, this paper brings forward frequent-pattern growth algorithm based on maximum clique that resolves problem of memory insufficiency by dividing item set into several subsets, then computing frequent-pattern for each subset. In this paper, a new algorithm is given to find fraquent 2-itemset by adjacency matrix with less times scanning the database.How to select the interested and valuable rules from a large number of association modes is one of the important contents in study of mining algorithm. There is limitation in model based on support and confidence measure, thus interest measure model based on effect is given in this dissertation, which is used to prune the no-interest rules in order to discover the real interest rules mode.æ›´å¤š è¿˜åŽŸ

ã€å…³é”®è¯ã€‘ æ•°æ®æŒ–æŽ˜ï¼› å…³è”è§„åˆ™ï¼› å…´è¶£åº¦ï¼› æžå¤§å›¢ï¼› é‚»æŽ¥çŸ©é˜µï¼›
ã€Key wordsã€‘ Data Miningï¼› Association rulesï¼› Interest measureï¼› Maximum cliqueï¼› Adjacency matrixï¼›

ã€ç½‘ç»œå‡ºç‰ˆæŠ•ç¨¿äººã€‘ è¥¿å—äº¤é€šå¤§å¦

ã€åˆ†ç±»å·ã€‘TP311.13
ã€è¢«å¼•é¢‘æ¬¡ã€‘13
ã€ä¸‹è½½é¢‘æ¬¡ã€‘428

çŸ¥ç½‘èŠ‚ä¸‹è½½

èŠ‚ç‚¹æ–‡çŒ®ä¸ï¼š

æœ¬æ–‡é“¾æŽ¥çš„æ–‡çŒ®ç½‘ç»œå›¾ç¤º:

æœ¬æ–‡çš„å¼•æ–‡ç½‘ç»œ

èŠ‚ç‚¹æ–‡çŒ®

èŠ‚ç‚¹æ–‡çŒ®

åŸºäºŽå…´è¶£åº¦çš„å…³è”è§„åˆ™æŒ–æŽ˜ç®—æ³•çš„ç ”ç©¶

Mining Algorithm Research for Association Rules Base on Interest Measure

æœ¬æ–‡é“¾æŽ¥çš„æ–‡çŒ®ç½‘ç»œå›¾ç¤º:

åŸºäºŽå…´è¶£åº¦çš„å…³è”è§„åˆ™æŒ–æŽ˜ç®—æ³•çš„ç ”ç©¶