èŠ‚ç‚¹æ–‡çŒ®

åŠ¨æ€å¤šæ™ºèƒ½ä½“å»ºæ¨¡ä¸Žå†³ç–é—®é¢˜ç ”ç©¶

Study on Dynamic Multi-Agent Model and Decision

åˆ†é¡µä¸‹è½½
åˆ†ç« ä¸‹è½½
æ•´æœ¬ä¸‹è½½
åœ¨çº¿é˜…è¯»
ä¸æ”¯æŒè¿…é›·ç‰ä¸‹è½½å·¥å…·ï¼Œè¯·å–æ¶ˆåŠ é€Ÿå·¥å…·åŽä¸‹è½½ã€‚

ã€ä½œè€…ã€‘ å§šå®äº®ï¼›

ã€ä½œè€…åŸºæœ¬ä¿¡æ¯ã€‘ åˆè‚¥å·¥ä¸šå¤§å¦ ï¼Œ è®¡ç®—æœºåº”ç”¨æŠ€æœ¯ï¼Œ 2007ï¼Œ åšå£«

ã€æ‘˜è¦ã€‘ å¤æ‚çš„åŠ¨æ€å†³ç–é—®é¢˜æ˜¯äººå·¥æ™ºèƒ½é¢†åŸŸä¸å¤æ‚ç³»ç»Ÿç ”ç©¶çš„ä¸€ä¸ªé‡è¦ç»„æˆéƒ¨åˆ†ã€‚æœ¬æ–‡åŸºäºŽè´å¶æ–¯æŠ€æœ¯å’Œå†³ç–ç†è®ºï¼Œæå‡ºä¸€ç§å…·æœ‰æ›´å¼ºçŸ¥è¯†è¡¨ç¤ºèƒ½åŠ›çš„åŠ¨æ€å†³ç–æ¨¡åž‹â€”â€”å¤šAgentåŠ¨æ€å½±å“å›¾ï¼Œç”¨äºŽåŠ¨æ€çŽ¯å¢ƒä¸çš„å¤šæ™ºèƒ½ä½“å»ºæ¨¡ï¼›æŽ¢è®¨äº†å¤šAgentåŠ¨æ€å½±å“å›¾æ¦‚çŽ‡åˆ†å¸ƒçš„è¿‘ä¼¼è®¡ç®—æ–¹æ³•ã€æŽ¨ç†ç®—æ³•ï¼Œä»¥åŠå¤šæ™ºèƒ½ä½“çš„åä½œé—®é¢˜ã€‚å…¨æ–‡ä¸»è¦å†…å®¹åŠåˆ›æ–°ä¹‹å¤„å¦‚ä¸‹ï¼š(1)ç»™å‡ºäº†å½±å“å›¾çš„ä¸€ç§ç»“æž„åˆ†è§£æ–¹æ³•ï¼Œå°†å½±å“å›¾åˆ†è§£æˆæ¦‚çŽ‡ç½‘ç»œç»“æž„éƒ¨åˆ†å’Œæ•ˆç”¨ç»“æž„éƒ¨åˆ†ï¼›æå‡ºä¸€ç§èžåˆç»“æž„å…ˆéªŒçŸ¥è¯†çš„MDLè¯„åˆ†æ ‡å‡†ä»¥é™ä½Žä¼ ç»ŸMDLè¯„åˆ†æ ‡å‡†å¯¹æ•°æ®çš„ä¾èµ–æ€§ï¼Œå¹¶åŸºäºŽè¯¥è¯„åˆ†æ ‡å‡†æå‡ºä¸€ç§PS-EMç®—æ³•ç”¨äºŽæ¦‚çŽ‡ç½‘ç»œç»“æž„éƒ¨åˆ†çš„æ¨¡åž‹é€‰æ‹©ï¼›é€šè¿‡å°†è”åˆæ•ˆç”¨å‡½æ•°è¡¨ç¤ºæˆå„ä¸ªå±€éƒ¨æ•ˆç”¨å‡½æ•°çš„å’Œï¼Œè¿›è€Œæž„é€ ä¸€ç§ç”¨äºŽå¦ä¹ å±€éƒ¨æ•ˆç”¨å‡½æ•°çš„BPç¥žç»ç½‘ç»œå®žçŽ°å½±å“å›¾æ•ˆç”¨ç»“æž„éƒ¨åˆ†çš„å¦ä¹ ã€‚å®žéªŒç»“æžœè¡¨æ˜Žäº†è¯¥æ¨¡åž‹é€‰æ‹©æ–¹æ³•çš„æœ‰æ•ˆæ€§ã€‚(2)é€šè¿‡å¯¹ç›¸å…³æ¦‚çŽ‡å†³ç–æ¨¡åž‹çš„åˆ†æžï¼Œå°†å¤šAgentå½±å“å›¾åœ¨æ—¶é—´ä¸Šè¿›è¡Œæ‰©å±•ï¼Œæå‡ºä¸€ç§æ–°å†³ç–æ¨¡åž‹â€”â€”å¤šAgentåŠ¨æ€å½±å“å›¾(MADIDs)ï¼Œç”¨äºŽè¡¨ç¤ºåŠ¨æ€çŽ¯å¢ƒä¸å¤šAgentåä½œå…³ç³»ã€‚ä¸ºäº†æœ‰æ•ˆåœ°è®¡ç®—MADIDsçš„æ¦‚çŽ‡åˆ†å¸ƒï¼Œä»¥Agentsä¹‹é—´çš„ç–ç•¥ç›¸å…³æ€§ä¸ºæŒ‡å¯¼ï¼Œç»™å‡ºä¸€ç§æ¦‚çŽ‡åˆ†å¸ƒçš„åˆ†å±‚åˆ†è§£æ–¹æ³•ï¼Œå¹¶åŸºäºŽKLå·®åˆ†å¯¹è¿‘ä¼¼åˆ†å¸ƒçš„è¯¯å·®è¿›è¡Œäº†åˆ†æžã€‚(3)é’ˆå¯¹MADIDsçš„1.5ç‰‡è”åˆæ ‘ç²¾ç¡®æŽ¨ç†ç®—æ³•è®¡ç®—å¤æ‚æ€§é«˜å’ŒBKè¿‘ä¼¼æŽ¨ç†ç®—æ³•è¯¯å·®å¤§çš„é—®é¢˜ï¼Œæå‡ºä¸€ç§æ‰©å±•çš„BK(EBK)ç®—æ³•ã€‚EBKç®—æ³•é€šè¿‡å¯¹MADIDsçš„æ¦‚çŽ‡åˆ†å¸ƒè¿›è¡Œåˆ†å±‚åˆ†è§£æ¥æé«˜æŽ¨ç†çš„è®¡ç®—æ•ˆçŽ‡ï¼Œé€šè¿‡å¼•å…¥åˆ†å‰²å›¢æ¥å‡å°ç®—æ³•çš„æŽ¨ç†è¯¯å·®ï¼Œå¹¶ä¸”æ·»åŠ äº†æ•ˆç”¨ç»“ç‚¹å’Œå†³ç–ç»“ç‚¹çš„æŽ¨ç†ã€‚é’ˆå¯¹ç²’åæ»¤æ³¢æŽ¨ç†ç®—æ³•è®¡ç®—ä¸Šç»´æ•°è¿‡é«˜å’Œå› å¼ç²’åæ»¤æ³¢æŽ¨ç†ç®—æ³•è¯¯å·®è¿‡å¤§çš„é—®é¢˜ï¼Œå°†ç²’åæ»¤æ³¢å’Œè”åˆæ ‘æŽ¨ç†ç®—æ³•çš„ä¼˜ç‚¹ç›¸ç»“åˆï¼Œæå‡ºäº†ä¸€ç§è”åˆæ ‘å› å¼ç²’åæŽ¨ç†(JFP)ç®—æ³•ã€‚JFPç®—æ³•å°†MADIDsçš„æ¦‚çŽ‡åˆ†å¸ƒè½¬å˜æˆå±€éƒ¨å› å¼å½¢å¼ä»¥æé«˜è®¡ç®—æ•ˆçŽ‡ï¼Œå¹¶åˆ©ç”¨è”åˆæ ‘æ¥ä¼ æ’å› å¼ç²’åä»¥å‡å°‘æŽ¨ç†è¯¯å·®ã€‚åœ¨ä»¿çœŸè¶³çƒæœºå™¨äººä¸çš„ä¸€ä¸ªå±€éƒ¨åä½œæ¨¡åž‹ä¸Šï¼Œå¯¹ä¸Šé¢çš„å„ç§ç®—æ³•è¿›è¡Œäº†å®žéªŒéªŒè¯ã€‚(4)åœ¨åŸºäºŽåä½œå›¾å®žçŽ°å¤šAgentåä½œæ–¹æ³•çš„åŸºç¡€ä¸Šï¼Œå°†è§’è‰²å¼•å…¥åä½œå›¾ä¸ç»™å‡ºäº†ä¸€ç§æ‰©å±•çš„åä½œå›¾ï¼Œä»¥å‡å°‘åä½œä¸çš„é€šä¿¡ã€‚ç»™å‡ºä¸€ç§åŸºäºŽMADIDsçš„å¤šAgentåä½œæ–¹æ³•ï¼Œé€šè¿‡çŽ¯å¢ƒçš„æŽ¨ç†å’Œå±€éƒ¨æ•ˆç”¨çš„è®¡ç®—å®žçŽ°åä½œã€‚é€šè¿‡å¯¹å¯¹æ‰‹å»ºæ¨¡é¿å…å±€éƒ¨åä½œçš„é€šä¿¡ã€‚æ›´å¤š è¿˜åŽŸ

ã€Abstractã€‘ The complex dynamic decision problem is an important part of the complex system research in Artificial Intelligence domain. Based on Bayesian technology and decision theory, Multi-Agent Dynamic Influence Diagrams(MADIDs) model is presented for modeling the dynamic Multi-Agent system, which is a dynamic decision model with more strong knowledge representation ability. The method of approximating distribution, inference algorithms and Multi-Agent coordination are discussed. The main research contents and innovations in this dissertation are as follows:(1) A structural decomposition method of Influence Diagrams(IDs) is presented, and an Influence Diagram can be composed into two parts: probability structure and utility structure. A new MDL scoring is presented for reducing dependency on data, which merges the prior knowledge of network structures. Based on the new MDL scoring, a PS-EM algorithm is proposed for learning probability structure of IDs. The utility function of IDs is the sum form of the each local utility function, and a Neural Network is constructed for learning local utility functions of utility part. The experiment results show that PS-EM algorithm is efficient.(2) Based on analyzing some probability decision models, Multi-Agent Dynamic Influence Diagrams(MADIDs) are presented by introducing a temporal aspect into the framework of MAIDs, and coordination relationships in dynamic environment can be modeled. To efficiently compute the probability distribution of MADIDs, a method of hierarchical decomposition is presented for approximating distribution of MADIDs under the guidance of the strategic relativity among Agents, and the errors are analyzed based on the KL divergence.(3) Aimming at the high computation complexity of the 1.5 slice junction tree exact inference algorithm and the large error of BK approximate inference algorithm, an extensional BK (EBK) approximate inference algorithm is proposed. MADIDs are hierarchically decomposed for improving the efficiency of inference in EBK algorithm, and the conditionally independent separators are induced for decreasing the error of the inference, and the inference of decision nodes and utility nodes are added for inferring MADIDs. The particle filter algorithm and factored particle algorithm are discussed, and a junction tree factored particle filter(JFP) algorithm is presented by combing the advantages of the junction trees and particle filter. JFP algorithm converts the distribution of MADIDs into the local factorial form for improving computational efficiency; For decreasing error, the inference is performed by propagating factor particle on junction tree. Some simulative experiments are performed in the RoboCup simulation environment to verify and compare above algorithms, the results of which are quite satisfactory.(4) The method of Multi-Agent Coordination using Coordination Graph (CG) is discussed; further, an extensional Coordination Graph is presented by inductting roles into CG to decrease the coordination communication. A Multi-Agent Coordination method is given based on MADIDs, where the coordination is realized by inference of environment and computation of local utility; and the communication of local coordination is avoided by modeling the opponent.æ›´å¤š è¿˜åŽŸ

ã€å…³é”®è¯ã€‘ å¤æ‚ç³»ç»Ÿï¼› è´å¶æ–¯æŠ€æœ¯ï¼› å¤šAgentåŠ¨æ€å½±å“å›¾ï¼› å†³ç–åˆ†æžï¼›
ã€Key wordsã€‘ Complex Systemï¼› Bayes Technologyï¼› Multi-Agent Dynamic Influence Diagramsï¼› Decision Analysisï¼›

ã€ç½‘ç»œå‡ºç‰ˆæŠ•ç¨¿äººã€‘ åˆè‚¥å·¥ä¸šå¤§å¦

ã€åˆ†ç±»å·ã€‘TP18
ã€è¢«å¼•é¢‘æ¬¡ã€‘8
ã€ä¸‹è½½é¢‘æ¬¡ã€‘1295
æ”»è¯»æœŸæˆæžœ

çŸ¥ç½‘èŠ‚ä¸‹è½½

èŠ‚ç‚¹æ–‡çŒ®ä¸ï¼š

æœ¬æ–‡é“¾æŽ¥çš„æ–‡çŒ®ç½‘ç»œå›¾ç¤º:

æœ¬æ–‡çš„å¼•æ–‡ç½‘ç»œ

èŠ‚ç‚¹æ–‡çŒ®

èŠ‚ç‚¹æ–‡çŒ®

åŠ¨æ€å¤šæ™ºèƒ½ä½“å»ºæ¨¡ä¸Žå†³ç­–é—®é¢˜ç ”ç©¶

Study on Dynamic Multi-Agent Model and Decision

æœ¬æ–‡é“¾æŽ¥çš„æ–‡çŒ®ç½‘ç»œå›¾ç¤º:

åŠ¨æ€å¤šæ™ºèƒ½ä½“å»ºæ¨¡ä¸Žå†³ç–é—®é¢˜ç ”ç©¶