IB Optimisation - Non-cooperative games

4Non-cooperative games

IB Optimisation

4.2 The minimax theorem

There is a special type of game known as a zero sum game.

Definition (Zero-sum game). A bimatrix game is a zero-sum game, or matrix

game, if q

= −p

for all i, j, i.e. the total payoff is always 0.

To specify a matrix game, we only need one matrix, not two, since the matrix

of the other player is simply the negative of the matrix of the first.

Example.

The rock-paper-scissors games as specified in the beginning example

is a zero-sum game.

Theorem (von Neumann, 1928). If P ∈ R

m×n

. Then

max

x∈X

min

y∈Y

p(x, y) = min

y∈Y

max

x∈X

p(x, y).

Note that this is equivalent to

max

x∈X

min

y∈Y

p(x, y) = − max

y∈Y

min

x∈X

−p(x, y).

The left hand side is the worst payoff the row player can get if he employs the

minimax strategy. The right hand side is the worst payoff the column player

can get if he uses his minimax strategy.

The theorem then says that if both players employ the minimax strategy,

then this is an equilibrium.

Proof.

Recall that the optimal value of

max min p

(

x, y

) is a solution to the linear

program

maximize v such that

i=1

≥ v for all j = 1, · · · , n

i=1

= 1

x ≥ 0

Adding slack variable z ∈ R

with z ≥ 0, we obtain the Lagrangian

L(v, x, z, w, y) = v +

j=1

i=1

− z

− v

− w

i=1

− 1

where w ∈ R and y ∈ R

are Lagrange multipliers. This is equal to





1 −

j=1





v +

i=1





j=1

− w





−

j=1

+ w.

This has finite minimum for all

v ∈ R, x ≥

0 iff

= 1,

≤ w

for all

and y ≥ 0. The dual is therefore

minimize w subject to

j=1

≤ w for all i

j=1

= 1

y ≥ 0

This corresponds to the column player choosing a strategy (

) such that the

expected payoff is bounded above by w.

The optimum value of the dual is

min

y∈Y

max

x∈X

(

x, y

). So the result follows from

strong duality.

Definition (Value). The value of the matrix game with payoff matrix P is

v = max

x∈X

min

y∈Y

p(x, y) = min

y∈Y

max

x∈X

p(x, y).

In general, the equilibrium are given by

Theorem.

(

x, y

)

∈ X × Y

is an equilibrium of the matrix game with payoff

matrix P if and only if

min

∈Y

p(x, y

) = max

∈X

min

∈Y

p(x

, y

)

max

∈X

p(x

, y) = min

∈Y

max

∈X

p(x

, u

)

i.e. the x, y are optimizers for the max min and min max functions.

Proof is in the second example sheet.