Zero-Sum Games

Motivating Example¶

Consider the standard Rock-Paper-Scissors game The payoff matrix for the row player is:

M = \begin{pmatrix} 0 & -1 & 1 \\ 1 & 0 & -1\\ -1 & 1 & 0 \end{pmatrix}

(1)

We are going to modify the game to reflect the fact that some row player in question enjoys winning and hates losing with Paper more than any other action:

M = \begin{pmatrix} 0 & -1 & 1 \\ 2 & 0 & -2\\ -1 & 1 & 0 \end{pmatrix}

(2)

Is there a way for the row player to choose a strategy that guarantees a certain minimum expected payoff, regardless of how the column player responds?

Theory¶

This chapter will consider a specific subset of Definition: Normal Form Game.

Definition: Zero Sum Game¶

A two player normal form game with payoff matrices $(M_r, M_c) \in {\mathbb{R}^{m\times n}}^2$ is called zero sum if and only if:

M_r + M_c = 0

(3)

In the case of a zero sum game we will use the convention of defining it with:

M=M_r

(4)

and implying $M_c=-M$ .

Definition: the min-max and max-min strategies¶

Given a zero-sum game defined by a payoff matrix $M \in \mathbb{R}^{m \times n}$ and a strategy $y \in \mathbb{R}^n$ for the column player, the row player seeks a best response strategy $x \in \mathbb{R}^m$ that maximises their expected payoff:

\max_{x \in \mathcal{A}_1} x M y^T

(5)

This corresponds to choosing the rows of $M$ that yields the highest expected value under the strategy $y$ , i.e.,

\max_{i \leq m} (M y^T)_i

(6)

The column player, by selecting $y$ , can influence the upper bound $v$ of this maximum. Since the game is zero-sum, the column player will aim to choose $y$ to make this upper bound $v$ as small as possible.

Hence,

\max_{x \in \mathcal{A}_1} x M y^T = \max_{i \leq m} (M y^T)_i = \min \left\{ v \in \mathbb{R} \;\middle|\; M y^T \leq \mathbb{1} v \right\}

(7)

The min-max strategy $y$ for the column player is the solution to the following optimisation problem (referred to as a linear program):

\begin{aligned} \min_{y, v} \quad & v \\ \text{subject to} \quad & M y^T \leq \mathbb{1} v \\ & y \in \mathcal{A}_2 \end{aligned}

(8)

In this formulation, $v$ is the min-max value of the game.

The corresponding max-min strategy $x$ for the row player solves the following linear program:

\begin{aligned} \max_{x, u} \quad & u \\ \text{subject to} \quad & x M \geq \mathbb{1} u \\ & x \in \mathcal{A}_1 \end{aligned}

(9)

In this case, $u$ is the max-min value of the game.

Example: Max-min strategy for modified Rock-Paper-Scissors¶

For the modified Rock-Paper-Scissors game, the max-min strategy $x$ for the row player satisfies the following linear program:

\begin{aligned} \max_{x, u} \quad & u \\ \text{subject to} \quad & 2x_2 - x_3 \geq u \\ & -x_1 + x_3 \geq u \\ & x_1 - 2x_2 \geq u \\ & x_1 + x_2 + x_3 = 1 \\ & x_i \geq 0 \quad \text{for all } i \in \{1, 2, 3\} \end{aligned}

(10)

Example: Max-min strategy for Matching Pennies¶

For Example: Matching Pennies with payoff matrix:

M = \begin{pmatrix} 1 & -1 \\ -1& 1 \end{pmatrix}

(11)

the max-min strategy $x$ for the row player satisfies the following linear program:

\begin{aligned} \max_{x, u} \quad & u \\ \text{subject to} \quad & x_1 - x_2 \geq u \\ & -x_1 + x_2 \geq u \\ & x_1 + x_2 = 1 \\ & x_i \geq 0 \quad \text{for all } i \in \{1, 2\} \end{aligned}

(12)

Given that $x_1 + x_2 = 1$ , this linear program corresponds to:

\begin{aligned} \max_{x_1, u} \quad & u \\ \text{subject to} \quad & 2x_1 - 1 \geq u \\ & -2x_1 + 1 \geq u \\ & 0 \leq x_1 \leq 1 \end{aligned}

(13)

These constraints can be rewritten as:

\begin{aligned} x_1 &\geq \frac{1 + u}{2} \\ x_1 &\leq \frac{1 - u}{2} \\ 0 &\leq x_1 \leq 1 \end{aligned}

(14)

This implies:

\frac{1 + u}{2} \leq x_1 \leq \frac{1 - u}{2}

(15)

which leads to:

\frac{1 + u}{2} \leq \frac{1 - u}{2} \quad \Rightarrow \quad u \leq -u

(16)

This inequality holds only when $u = 0$ . When $u = 0$ , the constraints reduce to:

\frac{1}{2} \leq x_1 \leq \frac{1}{2}

(17)

yielding the unique solution $x_1 = \frac{1}{2}$ .

Thus, the max-min strategy is:

x = \left( \frac{1}{2}, \frac{1}{2} \right)

(18)

Theorem: The minimax theorem¶

The minimax theorem Neumann, 1928 states that if there exists optimal values of the:

max-min value $u$ and the max-min strategy $x$ .
min-max value $v$ and the min-max strategy $y$ .

then $u=v$ .

The proof which uses the linear program duality theorem is omitted from this book but can be found in Vanderbei, 2010.

Note that this answers the question posed at the end of Motivating Example through a choice of strategy the row player can ensure they obtain the value of the game which is equal to the max-min value and the min-max value.

In the next section we will start to introduce practical tools with which to do that.

Definition: Standard Form Linear program¶

A standard form of the linear program can be written which more readily will allow us to use Integer Pivoting.

In a zero-sum game, given a row player payoff matrix $M$ with $m$ rows and $n$ columns, the following linear program yields the max-min strategy and the value of the game:

\min_{x \in \mathbb{R}^{(m + 1) \times 1}} \; c x

(19)

subject to:

\begin{aligned} M_{\text{ub}} x &\leq b_{\text{ub}} \\ M_{\text{eq}} x &= b_{\text{eq}} \\ x_i &\geq 0 \quad \text{for } i \leq m \end{aligned}

(20)

The coefficients in this linear program are defined as:

\begin{aligned} c &= (\underbrace{0, \dots, 0}_{m}, -1) && \text{where } c \in \{0, 1\}^{1 \times (m + 1)} \\[0.5em] M_{\text{ub}} &= \begin{pmatrix} (-M^T)_{11} & \dots & (-M^T)_{1m} & 1 \\ \vdots & \ddots & \vdots & 1 \\ (-M^T)_{n1} & \dots & (-M^T)_{nm} & 1 \end{pmatrix} && M_{\text{ub}} \in \mathbb{R}^{n \times (m + 1)} \\[0.5em] b_{\text{ub}} &= (0, \dots, 0)^T && b_{\text{ub}} \in \mathbb{R}^{n \times 1} \\[0.5em] M_{\text{eq}} &= (\underbrace{1, \dots, 1}_{m}, 0) && M_{\text{eq}} \in \{0, 1\}^{1 \times (m + 1)} \\[0.5em] b_{\text{eq}} &= 1 \end{aligned}

(21)

Example: Standard form for the Modified Rock Paper Scissors Game¶

For the modified Rock-Paper-Scissors game, the corresponding coefficients are:

\begin{aligned} c &= (0, 0, 0, -1) \\[0.5em] M_{\text{ub}} &= \begin{pmatrix} 0 & -2 & 1 & 1 \\ 1 & 0 & -1 & 1 \\ -1 & 2 & 0 & 1 \end{pmatrix} \\[0.5em] b_{\text{ub}} &= \begin{pmatrix} 0 \\ 0 \\ 0 \end{pmatrix} \\[0.5em] M_{\text{eq}} &= (1, 1, 1, 0) \\[0.5em] b_{\text{eq}} &= 1 \end{aligned}

(22)

Definition: Tableau for Zero-Sum Game¶

Given a zero-sum game with payoff matrix $M\in \mathbb{R}{m\times n}$ the standard form linear program: can be represented by the following initial tableau:

T = \left( \begin{array}{ccccccc|c} x_1 & \dots & x_m & v & s_1 & \dots & s_n & b\\ \hline (-M^T)^{11} & \dots & (-M^T)_{1m} & 1 & 1 & \dots & 1 & 0\\ \vdots & \ddots & \vdots & 1 & 0 & \ddots & 0 & 0\\ (-M^T)_{n1} & \dots & (-M^T)\_{nm} & 1 & 0 & \dots & 1 & 0\\ 1 & \dots & 1 & 0 & 0 & \dots & 1 & 1\\ \hline 0 & \dots & 0 & -1 & 0 & \dots & 0 & 0 \end{array} \right)

(23)

Here, slack variables have been introduced to convert inequalities into equalities. The tableau is arranged with columns for the decision variables $x_1, \dots, x_m$ , the game value variable $v$ , slack variables $s_1, \dots, s_n$ , and the right-hand side.

We proceed by performing integer pivoting to move from one basic feasible solution to another, reducing the objective function at each step until optimality is reached.

Example: Integer Pivoting for Modified Rock-Paper-Scissors¶

We now solve the modified Rock-Paper-Scissors game using the tableau method. Recall that the standard form coefficients are:

\begin{aligned} c &= (0, 0, 0, -1) \\ M_{\text{ub}} &= \begin{pmatrix} 0 & -2 & 1 & 1 \\ 1 & 0 & -1 & 1 \\ -1 & 2 & 0 & 1 \end{pmatrix} \\ b_{\text{ub}} &= \begin{pmatrix} 0 \\ 0 \\ 0 \end{pmatrix} \\ M_{\text{eq}} &= (1, 1, 1, 1, 0) \\ b_{\text{eq}} &= 1 \end{aligned}

(24)

To construct the tableau, we introduce slack variables $s_1, s_2, s_3$ for the inequality constraints, and denote the game value variable by $v = x_5$ .

Initial Tableau¶

The initial tableau is:

\begin{array}{ccccccc|c} x_1 & x_2 & x_3 & v & s_1 & s_2 & s_3 & b \\ \hline 0& -2& 1& 1& 1& 0& 0& 0&\\ 1& 0& -1&1& 0& 1& 0& 0&\\ -1& 2& 0& 1& 0& 0& 1& 0&\\ 1& 1& 1& 0& 0& 0& 0& 1&\\ \hline 0& 0& 0& -1& 0& 0& 0& 0&\\ \end{array}

(25)

The last row is the objective function: minimising $v$ . We begin by identifying the entering variable with the most negative coefficient in the objective row, which is $v$ .

Pivot 1: Entering variable $v$ ¶

To pivot on $v$ , we look at the positive entries in the $v$ column (rows 1–3). All are 1, so we apply the ratio test:

Row 1: $0 / 1 = 0$
Row 2: $0 / 1 = 0$
Row 3: $0 / 1 = 0$

Ties are broken arbitrarily. Suppose we choose row 1: we subtract appropriate multiples of this row from all others to eliminate $v$ from those rows:

Row 2 $\gets$ Row 2 - Row 1
Row 3 $\gets$ Row 3 - Row 1
Objective row = Objective + Row 1

After row operations:

\begin{array}{ccccccc|c} x_1 & x_2 & x_3 & v & s_1 & s_2 & s_3 & b \\ \hline 0& -2& 1& 1& 1& 0& 0& 0\\ 1& 2& -2& 0& -1& 1& 0& 0\\ -1& 4& -1& 0& -1& 0& 1& 0\\ 1& 1& 1& 0& 0& 0& 0& 1\\ \hline 0& -2& 1& 0& 1& 0& 0& 0 \end{array}

(26)

Pivot 2: Entering variable $x_2$ ¶

Next, inspect the objective row. The most negative coefficient is -2 for $x_2$ .

Apply ratio test on rows with positive $x_2$ entries:

Row 2: $0 / 2 = 0$
Row 3: $0 / 4 = 0$
Row 4: $1 / 1 = 1$

Choose Row 2 (arbitrary tie-break).

Pivot on entry in row 2, column $x_2$ and eliminate $x_2$ from other rows. After computations, we get:

\begin{array}{ccccccc|c} x_1 & x_2 & x_3 & v & s_1 & s_2 & s_3 & b \\ \hline 2& 0& -2& 2& 0& 2& 0& 0\\ 1& 2& -2& 0& -1& 1& 0& 0\\ -6& 0& 6& 0& 2& -4& 2& 0\\ 1& 0& 4& 0& 1& -1& 0& 2\\ \hline 2& 0& -2& 0& 0& 2& 0& 0 \end{array}

(27)

Pivot 3: Entering variable $x_3$ ¶

Continue inspecting the objective row. The most negative is now -2 for $x_3$ , so it enters. Apply ratio test among positive entries in column $x_3$ .

Row 3: $0 / 6 = 0$

There is a single candidate: pivot on row 3. Eliminate $x_3$ from other rows. After computations we get:

\begin{array}{ccccccc|c} x_1 & x_2 & x_3 & v & s_1 & s_2 & s_3 & b \\ \hline 0& 0& 0& 12& 4& 4& 4& 0\\ -6& 12& 0& 0& -2& -2& 4& 0\\ -6& 0& 6& 0& 2& -4& 2& 0\\ 30& 0& 0& 0& -2& 10& -8& 12\\ \hline 0& 0& 0& 0& 4& 4& 4& 0 \end{array}

(28)

We now set the basic variables to 0 and read the equations for the non-basic variables:

\begin{align*} s_1 &=0\\ s_2 &=0 \\ s_3 &=0 \\ 12 v &= 0\\ -6x_1 + 12x_2&=0\\ -6x_1 + 6 x_3&=0\\ 30x_1 &= 12 \end{align*}

(29)

This gives:

x = (12/30, 6/30, 12/30)\qquad v=0

(30)

Thus, the max-min strategy is:

x = \left( \frac{2}{5}, \frac{1}{5}, \frac{2}{5} \right)

(31)

Exercises¶

Exercise: Coefficients for standard form LP¶

Obtain the coefficients of the standard form linear system for the zero-sum games with the following payoff matrices:

$M = \begin{pmatrix} 3 & -1\ -1 & 2 \end{pmatrix}$
$M = \begin{pmatrix} -1 & -1\ -1 & 3 \end{pmatrix}$
$M = \begin{pmatrix} 2 & 1 & -3\ -3 & -1 & 3 \end{pmatrix}$
$M = \begin{pmatrix} 3 & -2 & 0\ -3 & 0 & 3 \ 0 & 2 & -5 \end{pmatrix}$

Exercise: Max-min strategy for Matching Pennies¶

For Example: Max-min strategy for Matching Pennies:

Use integer pivoting to confirm that the max-min strategy is $x = (1/2, 1/2)$ .
By letting $M = -M^T$ , or otherwise, obtain the min-max strategy for the column player.
Use the Best Response Condition to confirm your calculations.

Exercise: Max-min strategy for Rock Paper Scissors¶

Obtain the max-min strategy for the standard game of Rock Paper Scissors defined by:

M = \begin{pmatrix} 0 & -1 & 1 \\ 1 & 0 & -1\\ -1 & 1 & 0 \end{pmatrix}

(32)

Exercise: Modified Rock Paper Scissors¶

For Example: Integer pivoting for modified Rock Paper Scissors:

By letting $M = -M^T$ , or otherwise, obtain the min-max strategy for the column player.
Use the Best Response Condition to confirm your calculations.

Programming¶

Solve linear programs using Scipy¶

The scipy library provides functionality to solve a linear program in standard form.

We begin by creating the various matrices and vectors: $M_{ub}$ , $M_{eq}$ , $b_{ub}$ , $b_{eq}$ , and $c$ :

import numpy as np

M = np.array(
    [
        [0, -1, 1],
        [2, 0, -2],
        [-1, 1, 0],
    ]
)
M_ub = np.hstack((-M.T, [[1], [1], [1]]))
M_eq = np.array(([[1, 1, 1, 0]]))
b_ub = np.array(
    [
        [0],
        [0],
        [0],
    ]
)
b_eq = 1
c = np.array([0, 0, 0, -1])

Now we can pass these to scipy.optimize.linprog:

import scipy.optimize

res = scipy.optimize.linprog(
    c=c,
    A_ub=M_ub,
    b_ub=b_ub,
    A_eq=M_eq,
    b_eq=b_eq,
)
res

This returns the full output of the optimisation. The min-max strategy is contained in all but the last entry of res.x:

res.x[:-1]

The last entry of res.x gives the value of the game:

res.x[-1]

Obtain min-max and max-min strategies using Nashpy¶

nashpy can be used to directly obtain the min-max and max-min strategies:

import nashpy as nash

game = nash.Game(M, -M)
game.linear_program()

Obtain min-max and max-min strategies using Gambit¶

Gambit can be used to directly obtain the min-max and max-min strategies. We start by creating a pygambit game from arrays:

import pygambit as gbt

game = gbt.Game.from_arrays(M, -M)
game

Now we can solve the underlying linear program:

gbt.nash.lp_solve(game)

Notable Research¶

The foundations of zero-sum game theory and its connection to linear programming emerged from a convergence of ideas in mathematics, economics, and operations research during the mid-20th century.

The minimax theorem was first proven by John von Neumann in 1928 Neumann, 1928. This landmark result, stating that every finite, two-player zero-sum game has a value and optimal strategies, was later generalised in 1944 Neumann & Morgenstern, 1944.

The minimax theorem does not necessarily only apply to zero-sum games but in fact applies to any constant sum game where $M_r + M_c = K$ for some constant $K$ . An example of this is shown in Chiappori et al., 2002 where penalty kicks are modelled and the payoff matrices correspond to the probability of scoring (or for the column player saving) a penalty.

Until the work of Nash Jr, 1950 the minimax theorem was the main solution concept in game theory. For his foundational work on equilibrium in non-cooperative games, John Nash was awarded the Nobel Prize in Economic Sciences in 1994, shared with John Harsanyi and Reinhard Selten. His contributions form the cornerstone of non-cooperative game theory.

Conclusion¶

This chapter introduced zero-sum games, where one player’s gain is precisely balanced by the other’s loss. We explored the foundational minimax theorem, the max-min and min-max strategies, and showed how linear programming provides a practical and elegant way to compute optimal strategies.

The central insight of this chapter is the equivalence between solving a zero-sum game and solving a pair of dual linear programs. This connection allows us to apply tools from optimisation—such as tableau methods and integer pivoting—to find equilibrium strategies.

Table 1 summarises the two central linear programs seen in this chapter.

Table 1:The main linear programs for Zero Sum Game

Problem	Player	Objective	Constraints
Max-min LP	Row player	Maximise $u$	$x M \geq \mathbb{1} u$ , $x \in \mathcal{A}_1$
Min-max LP	Column player	Minimise $v$	$M y^T \leq \mathbb{1} v$ , $y \in \mathcal{A}_2$

References¶

v. Neumann, J. (1928). Zur theorie der gesellschaftsspiele. Mathematische Annalen, 100(1), 295–320.
Vanderbei, R. J. (2010). Linear Programming: Foundations and Extensions (Softcover reprint of hardcover 3rd Edition 2008). Springer Science+Business Media.
von Neumann, J. (1928). Zur Theorie der Gesellschaftsspiele. Mathematische Annalen, 100(1), 295–320.
von Neumann, J., & Morgenstern, O. (1944). Theory of Games and Economic Behavior. Princeton University Press.
Chiappori, P.-A., Levitt, S., & Groseclose, T. (2002). Testing mixed-strategy equilibria when players are heterogeneous: The case of penalty kicks in soccer. American Economic Review, 92(4), 1138–1151.
Nash Jr, J. F. (1950). Equilibrium points in n-person games. Proceedings of the National Academy of Sciences, 36(1), 48–49.

Game Theory

Rationality