The price collusion trap · game-theorist

Players

Meridian Air incumbent carrier, transatlantic route, high-margin position
Vantage Air challenger carrier, same route, cost advantage

Game type: Simultaneous, 2 players, complete information, one-shot (with repeated-game extension)
Lead framework: Dominant strategy elimination, Nash equilibrium in pure strategies

Terms in plain English

Read this first if the theory words are new. The analysis uses a few technical terms, but the ideas underneath are simple.

Player: Someone whose choices can change the outcome: a company, person, regulator, buyer, or rival.
Strategy: A move a player can make. Raise price, hold price, cooperate, defect, wait, signal, commit.
Payoff: What a player gets from an outcome. The number means preference, not always money.
Dominant strategy: A move that beats the alternatives no matter what the other player does.
Nash equilibrium: An outcome where no player can improve their position by changing their move alone.
Repeated game: The same players meet again. Reputation, punishment, and trust start to matter.

Part I · Phases 1 to 2

The structure of the trap

Before a single ticket is priced, the payoff geometry is fixed. Understanding why defection dominates cooperation is the whole analysis.

Phase 1

Deconstruction

Two carriers share a single transatlantic route with no ability to observe or communicate each other's pricing decision before committing. Meridian Air is the incumbent: it built its load factors and yield management around a high-margin fare structure accumulated over years. Vantage Air entered the route with newer aircraft and lower unit costs, carrying a structural incentive to grow market share rapidly. Neither carrier can collude legally. Both must set fares simultaneously, without knowledge of the other's choice.

Each carrier has exactly two strategies. Cooperate means holding price at the jointly profitable level, roughly $600 per seat on this corridor. Defect means undercutting, dropping fares to around $480 to attract price-sensitive passengers from the other carrier. Cooperate for Meridian means publishing and holding high fares even when Vantage could be poaching its passengers. Defect for Vantage means publishing a sale price the morning that Meridian's schedule goes live.

The payoff structure satisfies the four conditions of a prisoner's dilemma. Temptation T = 80 (defect while the other cooperates). Reward R = 60 (mutual cooperation). Punishment P = 40 (mutual defection). Sucker S = 20 (cooperate while the other defects). The ordering T > R > P > S holds strictly. The joint efficiency condition 2R > T + S also holds: 120 > 100. Mutual cooperation is Pareto superior to mutual defection, yet neither carrier can reach it unilaterally.

This is not a coordination game, where both parties want the same outcome and merely need a signal. Both players prefer defection regardless of what the other does. The geometry of the payoffs, not a failure of communication, is what makes this a trap.

Phase 2

Incentive Mapping

Walk through Meridian's reasoning first, then Vantage's, and observe that the conclusion is identical. Meridian considers two scenarios. If Vantage cooperates, Meridian earns 60 by cooperating or 80 by defecting: defect. If Vantage defects, Meridian earns 20 by cooperating or 40 by defecting: defect again. Whatever Vantage does, Meridian is better off defecting. The word for this is a dominant strategy: defection strictly dominates cooperation for Meridian across every possible state of the world.

Vantage's calculation is a mirror image. If Meridian cooperates, Vantage earns 60 by cooperating or 80 by defecting: defect. If Meridian defects, Vantage earns 20 by cooperating or 40 by defecting: defect. Defection is also a dominant strategy for Vantage. The logic is not a matter of distrust or pessimism. Even if each carrier were certain the other would cooperate, the correct individual response is still to defect.

This is the core of the dilemma: rational individual reasoning, applied symmetrically, produces an outcome both parties would prefer to avoid. The combined payoff under mutual cooperation is 120. Under mutual defection it is 80. Forty units of joint value are destroyed not by irrationality but by the incentive structure itself.

Figure 1 · Payoff matrix

Nash equilibrium · (Defect, Defect) Other strategy combinations

Figure 1. The payoff matrix. Mutual cooperation (60, 60) is Pareto optimal but not the equilibrium. Each carrier earns more by defecting regardless of what the other does. The Nash equilibrium (ringed in green) yields only 40 each.

Part II · Phases 3 to 4

The strategy space and the equilibrium

With incentives mapped, the strategy space reduces almost immediately. The equilibrium is not where either player wants to be.

Phase 3

Strategy Space

Four strategy combinations exist; dominance eliminates half the space before any equilibrium reasoning is needed. List all outcomes, label the payoffs, and apply strict dominance. A strategy is strictly dominated if there exists another strategy that yields a higher payoff in every scenario the opponent can play. Here, Cooperate is strictly dominated for both players. The table below captures the full space and marks what survives.

Strategy combination	Meridian payoff	Vantage payoff	Notes
Cooperate, Cooperate	60	60	Pareto optimal, jointly best, but individually unstable: each carrier has a unilateral incentive to deviate.
Cooperate, Defect	20	80	Meridian is the sucker: it holds price while Vantage captures market share. The worst outcome for Meridian.
Defect, Cooperate	80	20	Vantage is the sucker. Individually rational for Meridian if Vantage cooperates, but Vantage has no reason to cooperate.
Defect, Defect	40	40	The only combination where neither player can improve by switching unilaterally. This is the Nash equilibrium.

The dominance argument runs as follows. Cooperate yields 60 if the opponent cooperates and 20 if the opponent defects. Defect yields 80 if the opponent cooperates and 40 if the opponent defects. Since 80 > 60 and 40 > 20, Defect strictly dominates Cooperate for every possible action of the opponent. This holds symmetrically for both carriers. Iterated elimination of strictly dominated strategies leaves a single cell: (Defect, Defect).

No mixed-strategy equilibrium needs to be considered. Strict dominance is a stronger condition than Nash equilibrium: it does not require a belief about what the other player will do. Each player defects because defection is better regardless of the opponent's choice, not because they predict defection.

Defection is individually rational and collectively catastrophic.
The prisoner's dilemma, canonical form

Phase 4

Equilibrium

The unique Nash equilibrium is (Defect, Defect), yielding payoffs of 40 to each carrier. Neither player can improve by deviating unilaterally: if Meridian switches from Defect to Cooperate while Vantage holds at Defect, Meridian drops from 40 to 20. The same applies to Vantage. The equilibrium is self-enforcing in the sense that no player has an incentive to move away from it given the other's strategy.

The tragedy is visible in the numbers. Mutual cooperation would deliver 60 to each carrier, a total joint payoff of 120. Mutual defection delivers 40 to each, a total of 80. Forty units of value are destroyed at equilibrium relative to the cooperative outcome. This gap, 120 minus 80, is the cost of the dilemma's structure. The equilibrium is stable but inefficient: it holds because no one can defect from it profitably, but everyone would be better off if both had agreed to cooperate.

This distinguishes the prisoner's dilemma from other strategic failures. In a pure coordination problem, players want the same outcome but lack a common signal. Here, both players fully understand the payoffs, both prefer (60, 60), and both still end up at (40, 40). The problem is not information or coordination: it is the geometry of the incentives. Individually rational behaviour produces a collectively irrational result.

Part III · Phases 5 to 6

Breaking the dilemma

The one-shot dilemma is a trap; the repeated game is a negotiation. The path out runs through repetition, credible commitment, and observable punishment.

Phase 5

Recommendation

In a one-shot game, the recommendation is unambiguous: Defect. This is not a moral statement. It is the only choice consistent with rational self-interest when the game is played once with no history, no future interaction, and no binding agreement. Any carrier that cooperates in a one-shot prisoner's dilemma against a rational opponent is leaving money at the gate.

The situation changes entirely if the two carriers will keep competing on this route. Both Meridian and Vantage will be pricing this corridor for years. The one-shot model is a simplification: the real game is repeated, indefinitely or with uncertain endpoint. In a repeated game, cooperation can be sustained if each player values future payoffs highly enough relative to the short-term gain from defection.

The condition for sustainable cooperation under a tit-for-tat protocol is: the discount factor delta must exceed (T minus R) divided by (T minus P). Using the payoffs above: (80 minus 60) divided by (80 minus 40) = 20 divided by 40 = 0.5. If each carrier values future periods at more than 50 cents per present dollar, cooperation is self-sustaining without any communication or agreement. Given that both carriers operate on long planning horizons with route licences and fleet commitments, the discount factor on this corridor is almost certainly above 0.5.

The practical recommendation for the repeated game: signal willingness to hold price through observable actions rather than explicit communication. Published fare schedules with transparent pricing logic are observable. Announce high fares early and hold them. If the other carrier defects, match the defection immediately and visibly. Return to high pricing only after the opponent returns. This is tit-for-tat in practice: cooperative, retaliatory, forgiving, and transparent. It requires no communication between the carriers, which is the critical constraint given antitrust law.

One hard constraint governs all of this: any explicit agreement between Meridian and Vantage on pricing is illegal price-fixing in every jurisdiction where this route operates. The recommendation concerns unilateral signalling and behavioural strategies only. The difference between sustainable implicit cooperation and cartel behaviour is the difference between two carriers independently arriving at the same rational equilibrium and two carriers coordinating to reach it. The former is legal; the latter is not.

Phase 6

Dynamic Adaptation

Four triggers can alter the equilibrium, each operating through a different mechanism. The prisoner's dilemma is a structure, not a fixed outcome. When the parameters shift, the equilibrium shifts with them. Tracking these triggers is the work of dynamic adaptation.

Entry of a third carrier changes the payoff matrix and makes cooperation harder to sustain. With two players, a tit-for-tat protocol is clear: punish the defector, reward the cooperator. With three players, the punishment calculus becomes ambiguous. If a third carrier enters this route at a discount price, Meridian and Vantage face pressure to match it regardless of their bilateral tacit agreement. N-player prisoner's dilemmas are harder to escape than two-player versions: the required discount factor rises with the number of players, and the temptation to defect against a partial cooperator is ever-present.

A demand shock, such as a fuel cost spike or a sharp drop in travel demand from a health crisis, shifts the absolute payoff values but does not necessarily alter the dominance structure. If T > R > P > S still holds after the shock, both carriers still have dominant strategies to defect. The dilemma persists at lower absolute values. The practical risk is that a demand shock compresses margins to the point where the cooperative payoff R drops below P for one carrier, at which point that carrier has no incentive to return to cooperation even in a repeated game.

The finite horizon destroys cooperation through backward induction. If both carriers know this route will be discontinued in twelve months, consider what happens in the final period. In the last period, there is no future to value: defect is dominant, exactly as in the one-shot case. Both carriers know this. In the penultimate period, both know the other will defect in the last period, so the future is worthless from the penultimate period's perspective: defect again. This logic unravels all the way to the first period. A known endpoint converts any repeated game back into a one-shot game, eliminating the discount-factor mechanism that sustains cooperation.

Regulatory scrutiny changes the effective payoffs of signalling and punishment. If antitrust authorities are actively monitoring the route, the legal risk of observable tit-for-tat behaviour increases. Carriers may be unable to punish defection visibly without risking an investigation. This raises the effective cost of the cooperation-sustaining strategy, which lowers the effective discount factor and may push the equilibrium back towards persistent mutual defection. The irony is that stronger antitrust enforcement, intended to benefit consumers, can destabilise tacit cooperation and drive carriers back into a price war that is legal but collectively destructive.

Lesson

The game teaches that rational individual actors can produce outcomes that are collectively worse than what coordination would have achieved. The exit from the one-shot trap is not goodwill but structure: binding agreements where they are legal, repeated interaction that makes the future valuable, and the credible shadow of punishment that makes defection too costly to choose.