One-shot deviation principle
One-shot deviation principle
In game-theory, the one-shot deviation principle (also known as the single-deviation property) is a principle used to determine whether a strategy in a sequential-game constitutes a subgame-perfect-equilibrium. An SPE is a nash-equilibrium where no player has an incentive to deviate in any subgame. It is closely related to the principle of optimality in dynamic programming. In simpler terms, if no player can profit (increase their expected payoff) by deviating from their original strategy via a single action (in just one stage of the game), then the strategy profile is an SPE.
The one-shot deviation principle is very important for infinite horizon games, in which the backward-induction method typically doesn't work to find SPE. In an infinite horizon game where the discount factor is less than 1, a strategy profile is a subgame perfect equilibrium if and only if it satisfies the one-shot deviation principle.
Definitions The following is the paraphrased definition from Watson (2013).
To check whether strategy s is a subgame perfect Nash equilibrium, we have to ask every player i and every subgame, if considering s, there is a strategy s’ that yields a strictly higher payoff for player i than does s in the subgame. In a finite multi-stage-game with observed actions, this analysis is equivalent to looking at single deviations from s, meaning s’ differs from s at only one information set (in a single stage). Note that the choices associated with s and s’ are the same at all nodes that are successors of nodes in the information set where s and s’ prescribe different actions.