博弈论 斯坦福game theory stanford week 5.1_

简介: title: 博弈论 斯坦福game theory stanford week 5-1tags: notenotebook: 6- 英文课程-15-game theory---博弈论 斯坦福game theory stanford week 5-1练习1.

title: 博弈论 斯坦福game theory stanford week 5-1
tags: note
notebook: 6- 英文课程-15-game theory
---

博弈论 斯坦福game theory stanford week 5-1

练习

1. Question 1

Two players play the following normal form game.

1  2 Left Middle Right

Left 4,2 3,3 1,2

Middle 3,3 5,5 2,6

Right 2,1 6,2 3,3

Which is the pure strategy Nash equilibrium of this stage game (if it is played only once)?

a) (Left, Left);

b) (Left, Middle);

c) (Left, Right);

d) (Middle, Left);

e) (Middle, Middle);

f) (Middle, Right);

g) (Right, Left);

h) (Right, Middle);

i) (Right, Right).

Correct 
(i) is the unique Nash equilibrium of the stage game.

(Right, Right) is a Nash equilibrium of the stage game because Right is the best response when the other player is playing Right.
It is also the unique Nash equilibrium. To see this, check that in all other cases at least one player has an incentive to deviate.

Question 2

Two players play the following normal form game.

1  2 Left Middle Right

Left 4,2 3,3 1,2

Middle 3,3 5,5 2,6

Right 2,1 6,2 3,3

Suppose that the game is repeated for two periods. What is the outcome from the subgame perfect Nash equilibrium of the whole game:

a) (Left, Left) is played in both periods.

b) (Right, Right) is played in both periods.

Correct 
(b) is true.

The stage game has a unique Nash equilibrium.
In the second period, (Right, Right) must be played regardless of the outcome obtained in the first period.
Then, it is optimal for both players to maximize the current payoff at the first period and play (Right, Right).

c) (Middle, Middle) is played in the first period, followed by (Left, Left)

d) (Middle, Middle) is played in the first period, followed by (Right, Right)

Question 3

Two players play the following normal form game.

1  2 Left Middle Right

Left 4,2 3,3 1,2

Middle 3,3 5,5 2,6

Right 2,1 6,2 3,3

Suppose that there is a probability p that the game continues next period and a probability (1−p) that it ends. What is the threshold p∗ such that when p≥p∗ (Middle, Middle) is sustainable as a subgame perfect equilibrium by grim trigger strategies, but when img_f0a48318fdb976c94810bc78faac3be4.gif playing Middle in all periods is not a best response? [Here the grim strategy is: play Middle if the play in all previous periods was (Middle, Middle); play Right otherwise.]

a) 1/2;

b) 1/3;

c) 1/4;

This should not be selected

d) 2/5.

Question 4

Consider the following game:

1  2 Left Middle Right

Left 1,1 5,0 0,0

Middle 0,5 4,4 0,0

Right 0,0 0,0 3,3

Which are the pure strategy Nash equilibria of this stage game? There can be more than one.

a) (Left, Right);

Un-selected is correct 

b) (Left, Left);

Correct 

(b) and (f) are pure strategy Nash equilibria of the stage game.

(Left, Left) and (Right, Right) are Nash equilibria of the stage game because Right is the best response when the other player is playing Right, and Left is the best response when the other player is playing Left.
There are no other pure strategy Nash equilibria. To see this, check that in all other cases at least one player has an incentive to deviate.

c) (Left, Middle);

Un-selected is correct 

d) (Middle, Right);

Un-selected is correct 

e) (Middle, Left);

Un-selected is correct 

f) (Right, Right).

Correct 
(b) and (f) are pure strategy Nash equilibria of the stage game.

(Left, Left) and (Right, Right) are Nash equilibria of the stage game because Right is the best response when the other player is playing Right, and Left is the best response when the other player is playing Left.
There are no other pure strategy Nash equilibria. To see this, check that in all other cases at least one player has an incentive to deviate.

g) (Right, Middle);

Un-selected is correct 

h) (Right, Left);

Un-selected is correct 

i) (Middle, Middle);

Un-selected is correct 

Question 5

Consider the following game:

1  2 Left Middle Right

Left 1,1 5,0 0,0

Middle 0,5 4,4 0,0

Right 0,0 0,0 3,3

Suppose that the game is repeated for two periods. Which of the following outcomes could occur in some subgame perfect equilibrium? (There might be more than one).

a) (Middle, Middle) is played in the first period, followed by (Right, Right)

Correct 
(a), (b) and (c) are all correct.

Recall that playing a Nash equilibrium of the stage game in each period forms a subgame perfect Nash equilibrium of the whole game. Then, (b) and (c) are subgame perfect Nash equilibria.
Outcome (a) can be obtained when both players play the following strategy:
Play Middle in the first period.
If outcome in first period was (Middle, Middle) play Right in the second period; otherwise play Left.
It is easy to check that this grim strategy forms a subgame perfect Nash equilibrium:
Suppose that player 1 plays this strategy.
If player 2 plays the same strategy, he/she will receive a total payoff of 4+3=7 (assume no discounting).
If player 2 deviates to (Left, Right), he/she will receive a total payoff of 5+1=6 (which is lower than the payoff of following the grim strategy).

b) (Left, Left) is played in both periods.

Correct 
(a), (b) and (c) are all correct.

Recall that playing a Nash equilibrium of the stage game in each period forms a subgame perfect Nash equilibrium of the whole game. Then, (b) and (c) are subgame perfect Nash equilibria.
Outcome (a) can be obtained when both players play the following strategy:
Play Middle in the first period.
If outcome in first period was (Middle, Middle) play Right in the second period; otherwise play Left.
It is easy to check that this grim strategy forms a subgame perfect Nash equilibrium:
Suppose that player 1 plays this strategy.
If player 2 plays the same strategy, he/she will receive a total payoff of 4+3=7 (assume no discounting).
If player 2 deviates to (Left, Right), he/she will receive a total payoff of 5+1=6 (which is lower than the payoff of following the grim strategy).

c) (Right, Right) is played in both periods.

Correct 
(a), (b) and (c) are all correct.

Recall that playing a Nash equilibrium of the stage game in each period forms a subgame perfect Nash equilibrium of the whole game. Then, (b) and (c) are subgame perfect Nash equilibria.
Outcome (a) can be obtained when both players play the following strategy:
Play Middle in the first period.
If outcome in first period was (Middle, Middle) play Right in the second period; otherwise play Left.
It is easy to check that this grim strategy forms a subgame perfect Nash equilibrium:
Suppose that player 1 plays this strategy.
If player 2 plays the same strategy, he/she will receive a total payoff of 4+3=7 (assume no discounting).
If player 2 deviates to (Left, Right), he/she will receive a total payoff of 5+1=6 (which is lower than the payoff of following the grim strategy).

Question 6

Consider the following trust game:

There is a probability p that the game continues next period and a probability (1−p) that it ends. The game is repeated indefinitely. Which statement is true? [Grim trigger in (c) and (d) is player 1 playing Not play and player 2 playing Distrust forever after a deviation from ((Play,Share), (Trust)).]

a) There exists a pure strategy Nash equilibrium in the one-shot game with player 2 playing Trust.

b) There exists a pure strategy subgame perfect equilibrium with player 2 playing Trust in any period in the finitely repeated game.

This should not be selected 

c) ((Play,Share), (Trust)) is sustainable as a subgame perfect equilibrium by grim trigger in the indefinitely repeated game with a probability of continuation of p≥5/9.

Question 7

In an infinitely repeated Prisoner's Dilemma, a version of what is known as a "tit for tat" strategy of a player i is described as follows:

There are two "statuses" that player i might be in during any period: "normal" and "revenge";
In a normal status player i cooperates;
In a revenge status player i defects;
From a normal status, player i switches to the revenge status in the next period only if the other player defects in this period;
From a revenge status player i automatically switches back to the normal status in the next period regardless of the other player's action in this period.
Consider an infinitely repeated game so that with probability p that the game continues to the next period and with probability (1−p) it ends.

Cooperate (C) Defect (D)

Cooperate (C) 4,4 0,5

Defect (D) 5,0 1,1

True or False:

When player 1 uses the above-described "tit for tat" strategy and starts the first period in a revenge status (thus plays defect for sure), in any infinite payoff maximizing strategy, player 2 plays defect in the first period

True.

Correct 
True.

If player 1 uses "tit for tat" strategy and starts in a revenge status, the payoff in the first period is higher for player 2 from defection than cooperation.
Moreover, the action played by 2 in the first period when 1 begins in revenge status doesn't affect the remaining periods since 1 switches to normal status in the second period regardless of what player 2 does in the first period.

False.

Question 8

In an infinitely repeated Prisoner's Dilemma, a version of what is known as a "tit for tat" strategy of a player i is described as follows:

There are two "statuses" that player i might be in during any period: "normal" and "revenge";

In a normal status player i cooperates;

In a revenge status player i defects;

From a normal status, player i switches to the revenge status in the next period only if the other player defects in this period;

From a revenge status player i automatically switches back to the normal status in the next period regardless of the other player's action in this period.

Consider an infinitely repeated game so that with probability p that the game continues to the next period and with probability (1−p) it ends.

Cooperate (C) Defect (D)

Cooperate (C) 4,4 0,5

Defect (D) 5,0 1,1

What is the payoff for player 2 from always cooperating when player 1 uses this tit for tat strategy and begins in a normal status? How about always defecting when 1 begins in a normal status?

a) 4+4p+4p2+4p3+… ; 5+p+p2+p3+…

b) 4+4p+4p2+4p3+… ; 5+p+5p2+p3+…

Correct 
(b) is true.

If 2 always cooperates, then 1 stays `normal' and cooperates always as well, and the payoff to each player is 4 in each period.
If 2 always defects, then 1 is normal in odd periods and switches to revenge in even periods (because 2 defects). 1 cooperates in odd periods and defects in even periods, thus 2 earns 5 in odd periods and 1 in even periods.

c) 5+4p+4p2+4p3+… ; 4+4p+4p2+4p3+…

d) 5+4p+4p2+4p3+… ; 5+p+p2+p3+…

Question 9

In an infinitely repeated Prisoner's Dilemma, a version of what is known as a "tit for tat" strategy of a player i is described as follows:

There are two "statuses" that player i might be in during any period: "normal" and "revenge";

In a normal status player i cooperates;

In a revenge status player i defects;

From a normal status, player i switches to the revenge status in the next period only if the other player defects in this period;

From a revenge status player i automatically switches back to the normal status in the next period regardless of the other player's action in this period.

Consider an infinitely repeated game so that with probability p that the game continues to the next period and with probability (1−p) it ends.

Cooperate (C) Defect (D)

Cooperate (C) 4,4 0,5

Defect (D) 5,0 1,1

What is the threshold p∗ such that when p≥p∗ always cooperating by player 2 is a best response to player 1 playing tit for tat and starting in a normal status, but when p<p∗ always cooperating is not a best response?

a) 1/2

b) 1/3

Correct 
(b) is true.

From part (2), in order to sustain cooperation, we need 4+4p+4p2+4p3+...≥5+p+5p2+p3+... , which is 4+4p≥5+p, thus p≥1/3.
p* = 1/3.
Note that this just checks always cooperating against always defecting. However, you can easily check that if player 2 wants to defect in the first period,then s/he should also do so in the second period (our answer from part (1)). Then the third period looks just like we are starting the game over, so player 2 would want to defect again...

c) 1/4

d) 1/5

相关文章
|
机器学习/深度学习 编解码 算法
论文精度笔记(二):《Deep Learning based Face Liveness Detection in Videos 》
论文提出了基于深度学习的面部欺骗检测技术,使用LRF-ELM和CNN两种模型,在NUAA和CASIA数据库上进行实验,发现LRF-ELM在检测活体面部方面更为准确。
118 1
论文精度笔记(二):《Deep Learning based Face Liveness Detection in Videos 》
|
数据处理 Python
在数据科学领域,Pandas和NumPy是每位数据科学家和分析师的必备工具
在数据科学领域,Pandas和NumPy是每位数据科学家和分析师的必备工具。本文通过问题解答形式,深入探讨Pandas与NumPy的高级操作技巧,如复杂数据筛选、分组聚合、数组优化及协同工作,结合实战演练,助你提升数据处理能力和工作效率。
162 5
|
XML 存储 JSON
【JavaScript】爆肝 2 万字!一次性搞懂 Ajax、Fetch 和 Axios 的区别~(上)
【JavaScript】爆肝 2 万字!一次性搞懂 Ajax、Fetch 和 Axios 的区别~(上)
【JavaScript】爆肝 2 万字!一次性搞懂 Ajax、Fetch 和 Axios 的区别~(上)
|
Linux Shell
Linux操作系统3:Linux常用命令
Linux操作系统3:Linux常用命令
144 0
|
消息中间件 存储 负载均衡
|
存储 API 图形学
OpenCV_11 轮廓检测:图像的轮廓+绘制轮廓+轮廓近似+边界矩形+椭圆拟合+直线拟合
轮廓检测指检测图像中的对象边界,更偏向于关注上层语义对象。如OpenCV中的findContours()函数, 它会得到每一个轮廓并以点向量方式存储,除此也得到一个图像的拓扑信息,即一个轮廓的后一个轮廓、前一个轮廓等的索引编号。
3134 0
|
Windows
Invoke-Obfuscation混淆免杀过360和火绒(下)
Invoke-Obfuscation混淆免杀过360和火绒
441 0
|
物联网 Linux 开发者
无名管道读写特点|学习笔记
快速学习无名管道读写特点
无名管道读写特点|学习笔记
|
SQL 安全 Java
Java面试之String、StringBuffer和StringBuilder的区别和原理
这个问题说实话,我是被公司老板反问过的,吃过亏的。所以专门研究了一下并且分享给不懂这块知识点的人
Java面试之String、StringBuffer和StringBuilder的区别和原理
|
Shell Docker 容器
docker自用小手册
容器运行的命令如果不是那些一直挂起的命令( 运行top,tail、循环等),就是会自动退出
223 0