Moran Process - Game Theory

Motivating Example: Everyone’s citing the preprint¶

In a graduate student reading group, everyone cites a well-established textbook in their essays. One student, though, starts citing a recent preprint they found on arXiv.

“It’s got a better proof of the key result — and it’s open access.”

Each week:

A student is admired for their choice of citation (fitness ∝ novelty, clarity, or style).
Another student, chosen at random, updates their references to match.

Over time, the group begins to shift its citation culture. The preprint might become canonical, or the students might return to citing the traditional text.

The more students who cite the preprint, the more attractive it becomes to others: shared references lead to easier discussion, common assumptions, and social reinforcement. In this way, the fitness of citing the preprint is not fixed — it depends on the current citation habits of the group. This makes the process frequency-dependent, just as in evolutionary game dynamics where payoffs arise from interaction with others.

To model this interaction explicitly, consider the following symmetric game. Each player chooses whether to cite the textbook ( $T$ ) or the preprint ( $P$ ). The row and column player payoffs are given by:

M_r = \begin{pmatrix} 3 & 0 \\ 1 & 2 \end{pmatrix} \qquad M_c = \begin{pmatrix} 3 & 1 \\ 0 & 2 \end{pmatrix}

(1)

If both students cite the textbook, they align well and receive the highest payoff of 3. If both cite the preprint, they still coordinate and get a payoff of 2 — slightly lower, but still beneficial.

If one cites the textbook while the other cites the preprint, there is a mismatch: the preprint-citer still gets some benefit (payoff 1) from its clarity and openness, but the textbook-citer gains nothing (payoff 0) from the mismatch.

Note here that the group is small and so the infinite population assumption of Replicator Dynamics does not apply: the topic of this chapter is the Moran Process a model suited for exactly this purpose.

Theory¶

Definition: Moran Process¶

First defined in Moran, 1958, the Moran process assumes a constant population of $N$ individuals which can be of $m$ different types. There exists a fitness function $f: \{1, \dots, m\} \times \{1, \dots, m\}^N \to \mathbb{R}$ that maps each individual to a numeric fitness value which is dependent on the types of the individuals in the population.

The process is defined as follows. At each step:

Every individual $k$ has their fitness $f_k$ calculated.
An individual is randomly selected for copying. This selection is done proportional to their fitness $f_k(v)$ . Thus, the probability of selecting individual $k$ for copying is given by:
$\frac{f_k(v)}{\sum_{h=1}^N f_h(v)}$
(2)
An individual is selected for removal. This selection is done uniformly at random. Thus, the probability of selecting individual $i$ for removal is:
$\frac{1}{N}$
(3)
An individual of the same type as the individual selected for copying is introduced to the population.
The individual selected for removal is removed.

The process is repeated until there is only one type of individual left in the population.

A common representation of the fitness function $f$ is to use a game. In this setting, the fitness of an individual of type $i$ is:

f_i(v) = (v_{i} - 1)A_{ii} + \sum_{j\ne i, j=1}^{N}v_jA_{ij}

(4)

Example: Selection Probabilities for citation behaviour.¶

For the Motivating Example: Everyone’s citing the preprint let us consider the situation with $N$ individuals in the reading group: 3 cite the text book ( $T$ ) and 1 cites the preprint ( $P$ ). This gives a total number of 5 different populations. Table 1 gives the different selection probabilities for each population.

Table 1:Selection probabilities for citation behaviour

$(v_T, v_P)$	$f_T$	$f_P$	Prob copy $T$	Prob copy $P$	Prob remove $T$	Prob remove $P$
$(4, 0)$	$3 \cdot 3 = 9$	–	1	0	1	0
$(3, 1)$	$2 \cdot 3 + 1 \cdot 0 = 6$	$3 \cdot 1 + 0 \cdot 2 = 3$	$\frac{3 \cdot 6}{3 \cdot 6 + 1 \cdot 3} = \frac{6}{7}$	$\frac{1}{7}$	$\frac{3}{4}$	$\frac{1}{4}$
$(2, 2)$	$1 \cdot 3 + 2 \cdot 0 = 3$	$2 \cdot 1 + 1 \cdot 2 = 4$	$\frac{2 \cdot 3}{2 \cdot 3 + 2 \cdot 4} = \frac{3}{7}$	$\frac{4}{7}$	$\frac{1}{2}$	$\frac{1}{2}$
$(1, 3)$	$0 \cdot 3 + 3 \cdot 0 = 0$	$1 \cdot 1 + 2 \cdot 2 = 5$	$\frac{1 \cdot 0}{1 \cdot 0 + 3 \cdot 5} = 0$	1	$\frac{1}{4}$	$\frac{3}{4}$
$(0, 4)$	–	$0 \cdot 1 + 3 \cdot 2 = 6$	0	1	0	1

Definition: The Fixation Probability¶

The fixation probability of a given type in a Moran process is the probability that the population eventually becomes composed entirely of individuals of that type.

In the case of a finite population of size $N$ with two types: a resident type and a mutant type. Suppose the process begins with $i$ individuals of the mutant type and $N - i$ residents.

Let $\rho_i$ denote the fixation probability of the mutant type starting from $i$ individuals. Then:

$\rho_0 = 0$ , since no mutants exist.
$\rho_N = 1$ , since all individuals are mutants.
For $0 < i < N$ , $\rho_i$ gives the probability that the mutant type eventually fixates (i.e., reaches frequency $N$ ), assuming the dynamics follow the Moran process.

In practice fixation probabilities correspond to absorption probabilities of an underlying absorbing Markov chain.

Then, the transition probabilities for the underling Markov chain are:

P_{i \to i+1} = (\text{Prob copy mutant})\cdot (\text{Prob remove resident})

(5)

and

P_{i \to i-1} = (\text{Prob copy resident})\cdot (\text{Prob copy mutant})

(6)

Finally:

P_{i \to i} = 1 - P_{i \to i+1} - P_{i \to i-1}

(7)

Example: Fixation of citation behaviour as an absorbing Markov chain¶

For given $N$ the Motivating Example: Everyone’s citing the preprint the underlying absorbing Markov chain has a state space that can be indexed by $i$ the number of individuals that cite the preprint.

For $N=4$ , we can write down the probability of going from state $i$ to state $j$ : $p_{ij}$ using Table 1:

\begin{align*} P &= \begin{pmatrix} 1 & 0 & 0 & 0 & 0 \\ (6/7)\cdot(1/4) & P_{11} & (1/7)\cdot(3/4) & 0 & 0 \\ 0 & (3/7)\cdot(1/2) & P_{22} & (4/7)\cdot(1/2) & 0 \\ 0 & 0 & 0\cdot(3/4) & P_{33} & (1)\cdot(1/4) \\ 0 & 0 & 0 & 0 & 1 \\ \end{pmatrix}&&\text{ using the selection probabilities}\\ &= \begin{pmatrix} 1 & 0 & 0 & 0 & 0 \\ 3/14 & 19/28 & 3/28 & 0 & 0 \\ 0 & 3/14 & 1/2 & 2/7 & 0 \\ 0 & 0 & 0 & 3/4 & 1/4 \\ 0 & 0 & 0 & 0 & 1 \\ \end{pmatrix}&& \text{ using } P_{i \to i} = 1 - P_{i \to i+1} - P_{i \to i-1} \end{align*}

(8)

This is an absorbing Markov chain which, by reordering of states, we can write in the canonical form:

P = \begin{pmatrix} Q & R\\ 0 & I \end{pmatrix}

(9)

with:

Q = \begin{pmatrix} 19/28 & 3/28 & 0 \\ 3/14 & 1/2 & 2/7 \\ 0 & 0 & 3/4 \\ \end{pmatrix}

(10)

and

R = \begin{pmatrix} 3/14 & 0\\ 0 & 0\\ 0 & 1/4 \end{pmatrix}

(11)

thus we can calculate the fundamental matrix:

\begin{align*} N &= \begin{pmatrix} 9/28 & -3/28 & 0 \\ -3/14 & 1/2 & -2/7 \\ 0 & 0 & 1/4 \\ \end{pmatrix} ^ {-1}\\ & = \begin{pmatrix} 98/27 & 7/9 & 8/9\\14/9 & 7/3 & 8/3\\0 & 0 & 4 \end{pmatrix} \end{align*}

(12)

We omit the calculation of the inverse which can be obtained using Gauss-Jordan elimination or any other approach.

We can now compute the absorption probability matrix:

B = N R = \begin{pmatrix} 7/9 & 2/9\\ 1/3 & 2/3\\0 & 1 \end{pmatrix}

(13)

Thus if a single mutant, or in our case a single individual starts citing the pre print: there is $2/9$ chance that the entire reading group starts citing the pre print over time.

\rho_1 = \frac{2}{9}

(14)

Theorem: The fixation probabilities in populations of two types¶

Given a Moran process in a population with two types as defined in Definition: The Fixation Probability, the fixation probability $\rho_i$ is given by:

\rho_i=\frac{1+\sum_{j=1}^{i-1}\prod_{k=1}^j\gamma_k}{1+\sum_{j=1}^{N-1}\prod_{k=1}^j\gamma_k}

(15)

where:

\gamma_k = \frac{f_R(i)}{f_M(i)}

(16)

Proof:

For the underlying absorbing Markov chain we have:

\begin{align*} p_{i,i+1}\rho_{i+1} & = -p_{i,i-1}\rho_{i-1} + \rho_i(1 - p_{ii}) \\ p_{i,i+1}\rho_{i+1} & = p_{i,i-1}(\rho_{i} - \rho_{i-1}) + \rho_ip_{i,i+1} \\ \rho_{i+1} - \rho_i & = \frac{p_{i, i-1}}{p_{i, i+1}}(\rho_i-\rho_{i-1})=\gamma_i(\rho_i-\rho_{i-1}) \end{align*}

(17)

with:

\begin{align*} \gamma_i &= \frac{p_{i, i - 1}}{p_{i, i + 1}}\\ &= \frac{\frac{(N - i)f_R(i)}{if_R(i) + (N - i)f_R(i)}\frac{i}{N}}{\frac{if_M(i)}{if_R(i) + (N - i)f_R(i)}\frac{N-i}{N}}\\ &= \frac{(N - i)f_R(i)}{if_R(i) + (N - i)f_R(i)}\frac{i}{N}{\frac{if_R(i) + (N - i)f_R(i)}{if_M(i)}\frac{N}{N-i}}\\ &= \frac{(N - i)f_R(i)i}{if_M(i)(N-i)}\\ &= \frac{f_R(i)}{f_M(i)}\\ \end{align*}

(18)

We observe that:

\begin{align} \rho_2 - \rho_1 &= \gamma_1(\rho_1-\rho_{0})=\gamma_1\rho_1\\ \rho_3 - \rho_2 &= \gamma_2(\rho_2-\rho_1)=\gamma_2\gamma_1\rho_1\\ \rho_4 - \rho_3 &= \gamma_3(\rho_3-\rho_2)=\gamma_3\gamma_2\gamma_1\rho_1\\ &\; \vdots & \\ \rho_{i+1} - \rho_i &= \gamma_i(\rho_i-\rho_{i-1})=\prod_{k=1}^i\gamma_k\rho_1\\ &\; \vdots & \\ \rho_{N} - \rho_{N-1} &= \gamma_{N-1}(\rho_{N-1}-\rho_{N-2})=\prod_{k=1}^{N-1}\gamma_k\rho_1\\ \end{align}

(19)

thus we have:

\rho_i=\sum_{j=0}^{i-1}\rho_{j+1}-\rho_j=\left(1+\sum_{j=1}^{i-1}\prod_{k=1}^j\gamma_k\right)\rho_1

(20)

solving the following equation to obtain $\rho_1$ gives the required result.

\rho_N=1=\left(1+\sum_{j=1}^{N-1}\prod_{k=1}^j\gamma_k\right)\rho_1

(21)

Example: Direct calculation of fixation of citation behaviour¶

For given $N$ the fixation probabilities of Motivating Example: Everyone’s citing the preprint can be found directly using (15).

For $N=4$ , recalling that $R=T$ and $M=P$ , we can write down the values of $\gamma_i$ sing Table 1:

\begin{align*} \gamma_1 & = \frac{f_{T}(1)}{f_{P}(1)} = \frac{6}{3}=2\\ \gamma_2 & = \frac{f_{T}(2)}{f_{P}(2)} = \frac{3}{4}\\ \gamma_3 & = \frac{f_{T}(3)}{f_{P}(3)} = \frac{0}{5} \end{align*}

(22)

This gives:

\begin{align*} \rho_1 &= \frac{1}{1 + \sum_{j=1}^3\prod{k=1}^{j}\gamma_k} \\ &= \frac{1}{1 + \prod{k=1}^{1}\gamma_k + \prod{k=1}^{2}\gamma_k + \prod{k=1}^{3}\gamma_k} \\ &= \frac{1}{1 + \gamma_1 + \gamma_1\gamma_2 + \gamma_1\gamma_2\gamma_3} \\ &= \frac{1}{1 + 2 + \frac{2\cdot 3}{4} + \frac{2\cdot 3 \cdot 0}{4\cdot5}} \\ &= \frac{1}{1 + 2 + \frac{6}{4}}=\frac{2}{9}\\ \end{align*}

(23)

as calculated Example: Fixation of citation behaviour as an absorbing Markov chain.

Exercises¶

Exercise: Moran Process with neutral drift¶

A Moran process with neutral drift is when: $f_k{v}=C$ for all $k$ for all $v$ for some constant $C$ . In other words: a Moran process with neutral drift is a Moran process where the fitness of all types for all populations is the same.

For a population with 2 types:

Describe the transition probabilities for the Moran process with neutral drift.
Obtain the transition probability matrix for the Moran process with neutral drift with $N=4$ individuals.
Obtain the general formula for $\rho_1$ for a Moran process with neutral drift for general $N$ .

Exercise: Specific fixation probabilities¶

For the following games, assuming the mutant is of the second type, obtain the fixation probability $\rho_1$ for $N=4$ :

$M=\begin{pmatrix}1 & 1 \\ 1 & 1\end{pmatrix}$
$M=\begin{pmatrix}1 & 2 \\ 3 & 1\end{pmatrix}$

Exercise: The effect of fitness¶

Consider the game $M=\begin{pmatrix}r & 1 \\ 1 & 1\end{pmatrix}$ for $r>1$ and $N$ , assuming the mutant is of the second type, obtain $\rho_1$ as a function of $r$ . How does $r$ effect the chance of fixation?

Programming¶

Using Nashpy to simulate a Moran process¶

Nashpy has functionality to simulate a single Moran process. Let us create a 3 by 3 game (for a population with 3 types) and an initial population.

import nashpy as nash
import numpy as np

M = np.array(
    (
        (2, 3, 1),
        (4, 1, 2),
        (1, 2, 5),
    )
)
game = nash.Game(M)
initial_population = np.array((0, 0, 0, 1, 1, 1, 2))

Now to run a Moran process, note that we seed the numpy pseudo-random number generator which is used by Nashpy:

np.random.seed(0)
populations = game.moran_process(initial_population=initial_population)
list(populations)

Using Nashpy to approximate fixation probabilities¶

Nashpy can be directly used to approximate the fixation probabilities by repeated a large number of Moran processes:

M = np.array(
    (
        (3, 0),
        (1, 2),
    )
)
game = nash.Game(M)
initial_population = np.array((0, 0, 0, 1))
game.fixation_probabilities(initial_population=initial_population, repetitions=10_000)

This shows that the final population with only 1s in it occurs $2/7\approx .22$ of the time.

Notable Research¶

Notable Research in the Moran Process and Population Genetics¶

The Moran process was first introduced in Moran, 1958, but it was not the first major model in population genetics. Foundational theoretical work by Ronald Fisher Fisher, 1930, J.B.S. Haldane Haldane, 1927Haldane, 1932, and Sewall Wright Wright, 1931 laid the mathematical groundwork for understanding evolution, focusing on selection and genetic drift in both infinite and finite populations. These early models often used diffusion approximations or the discrete-generation Wright-Fisher model. In contrast, the Moran process provided a continuous-time, discrete-space alternative that allows exact calculation of fixation probabilities and times. For example, Antal and Scheuring Antal & Scheuring, 2006 derived precise analytical results within this framework.

The Moran process has become indispensable in evolutionary game theory, where individual fitness depends on strategic interactions. This is central to the theory of evolutionarily stable strategies Taylor & Jonker, 1978. It has been especially influential in studying social dilemmas, such as the evolution of cooperation. Traulsen and Nowak Traulsen & Nowak, 2006 showed how cooperation can be favored in finite populations, while Knight Knight et al., 2018 explored how self-recognition algorithms can emerge through such dilemmas under the Moran process.

The process is also crucial for analyzing the role of population structure. A notable extension is the Moran process on graphs, where individuals interact only with their neighbors. This framework was first proposed by Lieberman, Hauert, and Nowak Lieberman et al., 2005 and further refined by Ohtsuki, Pacheco, and Nowak Ohtsuki et al., 2007. The Nashpy library Knight & Campbell, 2018 can be used to simulate Moran processes on such networks.

A final, and remarkable, result is the one proved by Traulsen, Claussen, and Hauert Traulsen et al., 2005: in the limit of large population size, the Moran process converges to the replicator dynamics equation.

Conclusion¶

The Moran process offers a foundational framework for understanding how strategies evolve in finite populations. Like the replicator dynamics equation, it links fitness to the growth or decline of types over time — but with a critical distinction: it captures the inherent stochasticity of small populations.

In this chapter, we:

Defined the Moran process as a stochastic model of selection and reproduction;
Introduced fixation probabilities, the likelihood that a given type takes over the population;
Showed how the process corresponds to an absorbing Markov chain, enabling exact analysis;
Proved a general formula for fixation probabilities in the case of two types.

These results highlight how even simple stochastic rules can give rise to rich evolutionary behavior. The Moran process provides a tractable yet powerful model that extends beyond biology — from the spread of opinions to the diffusion of technologies.

The key concepts covered in this chapter are summarized in Table 2.

Table 2:Summary of key concepts in the Moran process

Concept	Description
Moran process	A stochastic model of evolution in finite populations
Copy selection probability	Probability of choosing an individual for reproduction, proportional to fitness
Removal selection probability	Probability of choosing an individual for removal, uniform across the population
Absorbing state	A population state in which all individuals are of a single type
Fixation probability	Probability that a given type eventually takes over the population

References¶

Moran, P. A. P. (1958). Random processes in genetics. Mathematical Proceedings of the Cambridge Philosophical Society, 54(1), 60–71.
Fisher, R. A. (1930). The Genetical Theory of Natural Selection. Clarendon Press.
Haldane, J. B. S. (1927). A Mathematical Theory of Natural and Artificial Selection, Part IV. Mathematical Proceedings of the Cambridge Philosophical Society, 23(5), 838–844.
Haldane, J. B. S. (1932). The Causes of Evolution. Longmans, Green.
Wright, S. (1931). Evolution in Mendelian populations. Genetics, 16(2), 97–159.
Antal, T., & Scheuring, I. (2006). Fixation probability and fixation time in the Moran process with two types of individuals. Bulletin of Mathematical Biology, 68(8), 1923–1944.
Taylor, P. D., & Jonker, L. B. (1978). Evolutionary stable strategies and game dynamics. Mathematical Biosciences, 40(1–2), 145–156.
Traulsen, A., & Nowak, M. A. (2006). Evolution of cooperation by kin selection and group selection in finite populations. Proceedings of the National Academy of Sciences, 103(29), 10952–10955.
Knight, V., Harper, M., Glynatsi, N. E., & Campbell, O. (2018). Evolution reinforces cooperation with the emergence of self-recognition mechanisms: An empirical study of strategies in the Moran process for the iterated prisoner’s dilemma. PloS One, 13(10), e0204981.
Lieberman, E., Hauert, C., & Nowak, M. A. (2005). Evolutionary dynamics on graphs. Nature, 433(7023), 312–316.
Ohtsuki, H., Pacheco, J. M., & Nowak, M. A. (2007). Evolutionary Graph Theory: Breaking the Symmetry between Interaction and Replacement. Journal of Theoretical Biology, 246(4), 681–694. 10.1016/j.jtbi.2007.01.024
Knight, V., & Campbell, J. (2018). Nashpy: A Python library for the computation of Nash equilibria. Journal of Open Source Software, 3(30), 904.
Traulsen, A., Claussen, J. C., & Hauert, C. (2005). Coevolutionary dynamics: from finite to infinite populations. Physical Review Letters, 95(23), 238701.

Game Theory

Replicator Dynamics

Game Theory

Best response polytopes