节点文献

基于复杂网络的演化策略博弈及其应用

【作者】 寇晓亮

【导师】 刘波; 张杰;

【作者基本信息】 北方工业大学 , 应用数学, 2012, 硕士

【摘要】 本文主要研究基于复杂网络的演化博弈问题,其主要研究内容如下:1.首先研究在格子网络中博弈个体采取相应博弈策略的问题。在WSLS (Win-Stay-Lose-Shift)策略的基础上,提出了一个新的博弈策略,即:如果博弈个体的收益大于他的邻居,那么他就保持自己的博弈策略不变。否则的话,他将随机采取和他有博弈关系邻居中的一个人的策略。一旦采取这种演化博弈策略,通过计算机仿真,发现演化博弈的结果和收益矩阵中参数b有关系。并且随着参数b的增加,演化博弈中合作者的平均水平也会增加。2.考虑了囚徒困境上博弈的惩罚因素(对于博弈双方都选择背叛策略时的收益)的作用。为了更好的研究这个问题,我们首先提出了一个能反映外部环境对博弈双方影响的策略。通常来说,由于惩罚因素的存在(对于博弈双方都选择背叛策略是囚徒困境模型的纳什均衡,因而双方都选择背叛策略是稳定的从而使得合作策略无法出现)但由于采取这个特殊的博弈策略,通过仿真,我们发现D—D(背叛——背叛)结构并不能够保持稳定。实际上,这个博弈策略主要是通过影响C—D(合作——背叛)结构。并且我们还发现在C—D结构在网络上达到一定比例的时候,惩罚因素会使C—D结构保持稳定并阻止C—D结构向C—C(合作——合作)结构的转化。进一步地,从稳定性方面考虑,我们发现概率还可以影响博弈的结果。3.对小世界网络上的变拓扑结构的惩罚和反馈机制进行研究。依据策略改变的原则,提出了一个新的惩罚反馈机制:网络中的所有个体都要先进行一次n轮的演化博弈,然后对于选择背叛策略最多的个体,他的两个邻居将对他进行惩罚,断开和他的博弈关系并自己建立相应的博弈关系。这个机制会使得网络中整体的度数下降,得到这种机制能够帮助合作者在博弈过程中生存下来避免被背叛者完全消灭。通过研究,我们发现前n轮博弈的次数对博弈结果产生的影响可以忽略,而网络的规模和平均度连接数<k>则能影响博弈过程和结果。

【Abstract】 This paper studies the application of the evolution game based on the complex network. The main contents and contributions of this paper are stated as follows:1. Consider the problem of the strategy adopted by individuals in the vertices of the network for evolution on the lattice of the Prisoner’s Dilemma Game. Based on the WSLS strategy (Win-Stay-Lose-Shift), then propose a new strategy that if the payoff of the individual is larger than the neighbor’s, it will keep its strategy, otherwise, it will imitate the strategy of the directly interacted individual of its neighbors randomly. As the imitating strategy rule adopted, though the simulations, find that the results of the evolution game are related to the parameter b which is the element of the payoff matrix. Furthermore, with the value of b being increased, the percentage of the cooperators in the network will also raise. This situation is opposite to the previous concept that the higher the b is, the smaller the fraction of the cooperator is.2. Present the problem about the effect of the punishment (the payoff for the two players both choose defection) in the Prisoner’s Dilemma Game. In order to study it. we propose a new evolution strategy rule which can reflect the external factor for both players in the evolution game. For generally, if the punishment exists, the D (Defection)-D (Defection) structure (the two players both choose D strategy) which is the Nash equilibrium for the game can keep stable and never let the cooperation emerge. With the particular evolution strategy rule adopted, though the simulations, we find that the D-D structure can not keep stable and it will decrease during the game for the evolution strategy rule. Actually the punishment mainly affects the C (Cooperation)-D (Defection) structure in the network. After the fraction of the C-D structure achieved some levels, the punishment can keep the C-D structure stable and prevent it transforming to C (Cooperation)-C (Cooperation) structure. Moreover, from the aspect of the stability of structure and though the payoff the individual gains, we also find that the probability which is related to the payoff can affect the result of the evolution game.3. Present the problem about the punishment and feedback mechanism with the topology changed on small-world network. Base on the strategy updating rule, then propose a new punishment and feedback mechanism that all the individuals in the network will play a n-round Prisoner’s Dilemma Game firstly and then for the most defectors, its neighbors will punish it and break the connecting tie with it and set up the connecting for themselves. The mechanism can make the degree of the whole network decrease. And find that the mechanism can help keeping the cooperators surviving and avoiding them wiped out by the defectors. With the mechanism adopted, the number of n-round PDG almost has no effect for the evolution game. Furthermore, the probability, the average connecting<k> and the scale of the network are related to the result of the evolution game.

节点文献中: 

本文链接的文献网络图示:

本文的引文网络