16.
S<B
i gets T
others get H j gets E
i gets P
j doesn’t punish i
メタ規範ゲーム
k gets E’
j gets P’
k doesn’t punish j
j punishes i
k punishes j
i defects
Vj
Vk
Axelrod, R.: An Evolutionary Approach to Norms,
American Political Science Review, Vol. 80, No. 4, pp. 1095–1111(1986)
T=3
H=-1
E=-9
P=-2
E’=-9
P’=-2
18.
S<B
i gets T
others get H j gets E
i gets P
j doesn’t punish i
一般化メタ規範ゲーム
k gets E’
j gets P’
k doesn’t punish j
j punishes i
k punishes j
i defects
Vj
Vk
協調
j doesn’t reward i
j rewards i
k gets C’
j gets R’
k doesn’t reward j
k rewards j
k gets E’’
j gets P’’
k doesn’t punish j
k punishes j
k gets C’’
j gets R’’
k doesn’t reward j
k rewards j
i cooperats
Lj
Vk
Lk
Lk
k gets C’’
j gets R’’
i gets F
others get M
k gets C’
j gets R’
k gets C’’
j gets R’’
i gets F
others get M
j gets C
i gets R
21.
S<B
メタ報酬ゲーム
協調
j rewards i
k gets C’’
j gets R’’
k doesn’t reward j
k rewards j
i cooperats
Lk
k gets C’’
j gets R’’
i gets F
others get M
Vj
i gets T
others get H j gets E
i gets P
j doesn’t punish i
k gets E’
j gets P’
k doesn’t punish j
j punishes i
k punishes j
i defects
j doesn’t reward i
k gets C’
j gets R’
k doesn’t reward j
k rewards j
k gets E’’
j gets P’’
k doesn’t punish j
k punishes jLj
Vk
Vk
Lk
k gets C’
j gets R’
k gets C’’
j gets R’’
i gets F
others get M
j gets C
i gets R
22.
従来の議論
• 報酬では協調は促進されづらい
• メタ報酬ゲームで協調は促進されるのか?
• 二つのゲームを比較
• メタ懲罰ゲーム
• Axelrodによるメタ規範ゲーム
• メタ報酬ゲーム
• ソーシャルメディアをモデル化したメタ規範ゲーム
・Sutter, M., Haigner, S., and Kocher, M. G.:
Choosing the Carrot or the Stick? Endogenous Institutional Choice in Social
Dilemma Situations, Review of Economic Studies, Vol. 77, No. 4, pp. 1540–1566 (2010)
・Hilbe, C. and Sigmund, K.: Incentives and opportunism: from the carrot to the stick
Proc. R. Soc. B, Vol. 277, pp. 2427–2433 (2010)
23.
S<B
i gets T
others get H j gets E
i gets P
j doesn’t punish i
一般化メタ規範ゲーム
k gets E’
j gets P’
k doesn’t punish j
j punishes i
k punishes j
i defects
協調
j doesn’t reward i
j rewards i
k gets C’
j gets R’
k doesn’t reward j
k rewards j
k gets E’’
j gets P’’
k doesn’t punish j
k punishes j
k gets C’’
j gets R’’
k doesn’t reward j
k rewards j
i cooperats
Vj
Lj
Vk
Vk
Lk
Lk
k gets C’’
j gets R’’
i gets F
others get M
k gets C’
j gets R’
k gets C’’
j gets R’’
i gets F
others get M
j gets C
i gets R
メタ報酬ゲーム
メタ懲罰ゲーム
24.
メタ懲罰ゲームエージェント
• エージェントパラメータ
• 協調率: Bi
• 懲罰率: Vi
• エージェントの行動
• 発見率St <1- Bi :裏切り
• 確率Stで他人の裏切りを発見
• 確率Viで懲罰
• 確率Stで他人が懲罰していないのを発見
• 確率Viでメタ懲罰
25.
メタ報酬ゲームエージェント
• エージェントパラメータ
• 協調率: Bi
• 報酬率: Li
• エージェントの行動
• 発見率St < Bi :協調(情報提供)
• 確率Stで他人の協調を発見
• 確率Liで報酬
• 確率Stで他人の報酬を発見
• 確率Liでメタ報酬
28.
メタ報酬ゲーム
協調
-3
+1
+1
+1
+1
+1 S<B
協調
j doesn’t reward i
j rewards i
k gets C’’
j gets R’’
k doesn’t reward j
k rewards j
i cooperats
Lj
Lk
k gets C’’
j gets R’’
i gets F
others get M
k gets C’’
j gets R’’
i gets F
others get M
j gets C
i gets R
メタ報酬ゲーム
29.
メタ報酬ゲーム
協調
協調者への報酬
協調者を発見&報酬を与える
-2
+9
S<B
協調
j doesn’t reward i
j rewards i
k gets C’’
j gets R’’
k doesn’t reward j
k rewards j
i cooperats
Lj
Lk
k gets C’’
j gets R’’
i gets F
others get M
k gets C’’
j gets R’’
i gets F
others get M
j gets C
i gets R
メタ報酬ゲーム
30.
メタ報酬ゲーム
協調
協調へのメタ報酬
協調を発見&報酬
報酬を与えたことを発見
&報酬
-2
+9
S<B
協調
j doesn’t reward i
j rewards i
k gets C’’
j gets R’’
k doesn’t reward j
k rewards j
i cooperats
Lj
Lk
k gets C’’
j gets R’’
i gets F
others get M
k gets C’’
j gets R’’
i gets F
others get M
j gets C
i gets R
メタ報酬ゲーム
Be the first to comment