Nash Equilibrium - Game Theory

Motivating Example¶

In the Coordination Game, in how many
situations do neither player have an incentive to independently change
their strategy?

Neither player having a reason to change their strategy implies that both
strategies are best responses to each other.

Recall that for the Coordination game is defined by:

M_r = \begin{pmatrix} 3 & 1 \\ 0 & 2 \end{pmatrix} \qquad M_c = \begin{pmatrix} 2 & 1 \\ 0 & 3 \end{pmatrix}

(1)

If we consider strategies that only play a single action, there are two options for each strategy:

\sigma_1 \in \{(1, 0), (0, 1)\}

(2)

and:

\sigma_2 \in \{(1, 0), (0, 1)\}

(3)

We will inspect all four combinations:

$\sigma_1 = (1, 0)$ and $\sigma_2 = (1, 0)$ corresponds to both players playing their first action, which gives: $u_r(\sigma_1, \sigma_2) = 3$ and $u_c(\sigma_1, \sigma_2) = 2$ .
If the row player were to modify their strategy (while the column player stayed unchanged) to play the second action, their utility would decrease. Likewise, if the column player were to modify their strategy, their utility would also decrease.
$\sigma_1 = (1, 0)$ and $\sigma_2 = (0, 1)$ corresponds to the row player playing their first action and the column player playing their second action, which gives: $u_r(\sigma_1, \sigma_2) = 1$ and $u_c(\sigma_1, \sigma_2) = 1$ .
In this case, if either player were to move, their utility would increase.
$\sigma_1 = (0, 1)$ and $\sigma_2 = (1, 0)$ corresponds to the row player playing their second action and the column player playing their first action, which gives: $u_r(\sigma_1, \sigma_2) = 0$ and $u_c(\sigma_1, \sigma_2) = 0$ .
In this case, if either player were to move, their utility would increase.
$\sigma_1 = (0, 1)$ and $\sigma_2 = (0, 1)$ corresponds to both players playing their second action, which gives: $u_r(\sigma_1, \sigma_2) = 2$ and $u_c(\sigma_1, \sigma_2) = 3$ .
If the row player were to modify their strategy (while the column player stayed unchanged), their utility would decrease. Likewise, if the column player were to modify their strategy, their utility would also decrease.

Is there another pair of strategies that are best responses to each other and will such a pair always exist for any game?

Theory¶

A pair of strategies that are best responses to each other is a Nash equilibrium.

Definition: Nash Equilibria¶

In an $N$ -player normal form game, a Nash equilibrium is a strategy profile
$\tilde{s} = (\tilde{s}_1, \tilde{s}_2, \dots, \tilde{s}_N)$ such that:

u_i(\tilde{s}) = \max_{\bar{s}_i \in \Delta(\mathcal{A}_i)} u_i(\bar{s}_i, \tilde{s}_{-i}) \quad \text{for all } i

(4)

The following algorithm gives an approach to use the best response condition to systematically find all Nash equilibrium.

Definition: Support Enumeration Algorithm¶

The algorithm proceeds as follows:

For a two-player game
$(M_r, M_c) \in \left(\mathbb{R}^{m \times n}\right)^2$ , the following algorithm
returns all pairs of best responses:

For all pairs of supports (subsets of the action space) $(I, J)$ :
Solve the following equations (to ensure best responses):
$\sum_{i \in I} {\sigma_{r}}_i {M_{c}}_{ij} = v \quad \text{for all } j \in J$
(5)
$\sum_{j \in J} {M_r}_{ij} {\sigma_{c}}_j = u \quad \text{for all } i \in I$
(6)
Solve the normalisation and non-negativity constraints:
- $\sum_{i=1}^m {\sigma_{r}}_i = 1$ and ${\sigma_1}_i \geq 0$ for all $i$
- $\sum_{j=1}^n {\sigma_{c}}_j = 1$ and ${\sigma_2}_j \geq 0$ for all $j$
Check the best response condition.

Repeat steps 2–4 for all potential pairs of actions.

Example: Support enumeration algorithm for the coordination game¶

Let us apply the support enumeration algorithm to the coordination game.

The following supports (subsets of the action space) need to be considered:

I \in \{\{r_1\}, \{r_2\}, \{r_1, r_2\}\}

(7)

and

J \in \{\{c_1\}, \{c_2\}, \{c_1, c_2\}\}

(8)

For the cases where $|I|=|J|=1$ steps 2, 3 and 4 of the support enumeration algorithm correspond to finding best responses in the action space. This can be done by highlighting best responses:

M_r = \begin{pmatrix} \underline{3} & 1 \\ 0 & \underline{2} \end{pmatrix} \qquad M_c = \begin{pmatrix} \underline{2} & 1 \\ 0 & \underline{3} \end{pmatrix}

(9)

The support enumeration algorithm for $|I|=|J|=1$ gives the two following Nash equilibria:

((1, 0), (1, 0)) \qquad ((0, 1), (0, 1))

(10)

The final pair of actions to consider is when $I=(r_1, r_2)$ and $J=(c_1, c_2$ . In this case let:

\sigma_1=(x, 1 - x)\qquad \sigma_2=(y,1-y)

(11)

for $0 < x < 1$ and $0 < y < 1$ .

Step 2 corresponds to setting:

\begin{align*} \sum_{i \in I} {\sigma_{r}}_i {M_{c}}_{i1} = 2 x + 0 (1-x) &= v\\ \sum_{i \in I} {\sigma_{r}}_i {M_{c}}_{i2} = 1 x + 3 (1 -x) &= v \end{align*}

(12)

and

\begin{align*} \sum_{j \in J} {M_r}_{1j} {\sigma_{c}}_j = 3y + 1(1-y) &= u \\ \sum_{j \in J} {M_r}_{2j} {\sigma_{c}}_j = 0y + 2(1-y) &= u \\ \end{align*}

(13)

The particular values of $u$ or $v$ are not required so we can equate these pairs of expressions:

\begin{align*} 2x = x + 3-3x &\implies x =\frac{3}{4}\\ 3y + 1-y = 2 - 2y &\implies y =\frac{1}{4}\\ \end{align*}

(14)

Giving: $\sigma_1 = (3/4, 1/4)$ and $\sigma_2=(1/4, 3/4)$ .

Step 3 already holds as we enforced $\sigma_1 = (x, 1 - x)$ and $\sigma_2 = (y, 1 - y)$ at the start of our calculations. This is not necessarily the case for larger games.

The final step, step 4, requires us to check the best response condition. This is to check that there exists no better choice of action outside of the chosen set of $I$ and $J$ . This is more relevant for games with larger action spaces but let us nonetheless carry out the calculations:

M_r \sigma_2^\mathsf{T}= \begin{pmatrix}3/2 \\ 3/2\end{pmatrix}

(15)

and

\sigma_1 M_c = \begin{pmatrix}3/2 & 3/2\end{pmatrix}

(16)

as required by the best response condition. The support enumeration algorithm has given 3 Nash equilibria:

\left\{ ((1, 0), (1, 0)), ((0, 1), (0, 1)), ((\frac{3}{4}, \frac{1}{4}), (\frac{1}{4}, \frac{3}{4})), \right\}

(17)

The support enumeration algorithm is one of many algorithms that can be used to compute Nash equilibrium. Like most of the algorithms it works well for “most” games. When games are “degenerate” it may need more calculations and fail to give all equilibria.

Definition: Degenerate Games¶

A two player game is called non degenerate if no strategy of support size $k$ has more than $k$ best response actions.

Example: Support Enumeration for a degenerate game¶

Let us use support enumeration for the following game.

M_r = \begin{pmatrix} 2 & 5 \\ 0 & 5\\ \end{pmatrix} \qquad M_c = \begin{pmatrix} 2 & 1 \\ 0 & 1\\ \end{pmatrix}

(18)

First, we note that this game is degenerate, there are two best responses in action space to the first column:

M_r = \begin{pmatrix} \underline{2} & \underline{5} \\ 0 & \underline{5}\\ \end{pmatrix} \qquad M_c = \begin{pmatrix} \underline{2} & 1 \\ 0 & \underline{1}\\ \end{pmatrix}

(19)

Evaluating best responses in action space gives Nash equilibria:

((1, 0), (1, 0)) \qquad ((0, 1), (0, 1))

(20)

We need to consider new pairs of supports:

$\sigma_1=(x, 1-x)$ and $\sigma_2 = (1, 0)$ : there is single best response to the first column so nothing else to consider here.
$\sigma_1=(x, 1-x)$ and $\sigma_2 = (0, 1)$ : step 2 holds for all $x$ , step 3 is already satisfied (the vectors are both probability distributions) thus we are left to check the best response condition:
$\sigma_1M_c=(2x, 1)$
(21)
the only value of $x$ that gives a pair of best response is when the column player has no incentive to move from the support thus $2x=1 \implies x=1/2$ .

Exercises¶

Exercise: Support Enumeration¶

Use support enumeration to find Nash equilibria for the following games:

A = \begin{pmatrix} 3 & 3 & 2 \\ 2 & 1 & 3 \end{pmatrix} \qquad B = \begin{pmatrix} 2 & 1 & 3 \\ 2 & 3 & 2 \end{pmatrix}

(22)

A = \begin{pmatrix} 3 & -1 \\ 2 & 7 \end{pmatrix} \qquad B = \begin{pmatrix} -3 & 1 \\ 1 & -6 \end{pmatrix}

(23)

Exercise: Penalty kick strategies and Nash equilibrium¶

A soccer player (Player 1) is taking a penalty kick and can shoot either left or
right: $S_1 = \{\text{SL}, \text{SR}\}$ . The goalie (Player 2) can dive left or
right: $S_2 = \{\text{DL}, \text{DR}\}$ . The probabilities of scoring a goal
(depending on the chosen strategies) are shown in the matrix below:

\begin{pmatrix} 0.8 & 0.15 \\ 0.2 & 0.95 \end{pmatrix}

(24)

Assume the utility to Player 1 is the probability of scoring, and the utility
to Player 2 is the probability of preventing a goal. What is the Nash
equilibrium of this game?

Now suppose Player 1 has a third strategy: shooting in the middle. The new
action set becomes $S_1 = \{\text{SL}, \text{SM}, \text{SR}\}$ . The updated
probability matrix is:

\begin{pmatrix} 0.8 & 0.15 \\ 0.5 & 0.5 \\ 0.2 & 0.95 \end{pmatrix}

(25)

Determine the new Nash equilibrium for the extended game.

Programming¶

The Nashpy library has an implementation of the support enumeration algorithm. First let us create the payoff matrices and the game:

import numpy as np
import nashpy as nash

A = np.array(
    (
        (3, 1),
        (0, 2),
    )
)
B = np.array(
    (
        (2, 1),
        (0, 3),
    )
)

coordination_game = nash.Game(A, B)
coordination_game

Bi matrix game with payoff matrices:

Row player:
[[3 1]
 [0 2]]

Column player:
[[2 1]
 [0 3]]

Now to use the support enumeration algorithm:

list(coordination_game.support_enumeration())

[(array([1., 0.]), array([1., 0.])),
 (array([0., 1.]), array([0., 1.])),
 (array([0.75, 0.25]), array([0.25, 0.75]))]

Notable Research¶

Support enumeration is as close to a “from first principles” algorithm for computing Nash equilibria. Thus there is no specific paper to point to as per its formulation. Or indeed there is no specific set of papers that use it for their findings although a lot of papers that consider small games are in essence using support enumeration.

For example in Chiappori et al., 2002 theoretical results are given regarding penalty kicks that rely on the indifference ensured by the best response condition which is in turn essentially an application of the support enumeration algorithm.

A similar paper applied to another animal conservation is Lee & Roberts, 2016 in which the authors build a theoretical model of the effectiveness of Rhino horn devaluation. Once again the calculation presented are essentially an application of the support enumeration algorithm.

In Knight et al., 2017 whilst a different algorithm is used to identify Nash equilibrium for strategic hospital interactions it is somewhat similar to the support enumeration algorithm except that it takes advantage of the specific structure of the game considered.

Conclusion¶

This chapter introduced the concept of Nash equilibrium and demonstrated a
systematic method for identifying all equilibria in a two-player game using the
support enumeration algorithm. This method is rooted directly in the best
response condition and provides a computational approach that works well for
many games, particularly when the number of actions is small.

It is important to recognise that the support enumeration algorithm is
conceptually simple but can become computationally expensive as the action
spaces grow. Nevertheless, it serves as a useful tool both in theory and in
practice, particularly for small-scale empirical models.

Table 1 summarises the core concepts introduced in this chapter.

Table 1:The main concepts of Nash equilibrium

Concept	Description
Nash equilibrium	A strategy profile where each player’s strategy is a best response to the others’
Support of a strategy	The set of actions played with positive probability
Support enumeration algorithm	Enumerates possible supports and checks conditions for equilibrium
Degenerate game	A game in which some strategy of support size $k$ has more than $k$ best responses

References¶

Chiappori, P.-A., Levitt, S., & Groseclose, T. (2002). Testing mixed-strategy equilibria when players are heterogeneous: The case of penalty kicks in soccer. American Economic Review, 92(4), 1138–1151.
Lee, T. E., & Roberts, D. L. (2016). Devaluing rhino horns as a theoretical game. Ecological Modelling, 337, 73–78.
Knight, V., Komenda, I., & Griffiths, J. (2017). Measuring the price of anarchy in critical care unit interactions. Journal of the Operational Research Society, 68(6), 630–642.