Monday, 4 May 2026

Zero Sum Games & Game Theory



Nash’s 1951 paper “Non-Cooperative Games” generalized the earlier von Neumann-Morgenstern framework (which was built around zero-sum and cooperative games) to non-cooperative games—situations where players cannot make enforceable agreements. It applies to both zero-sum and non-zero-sum games and introduced the Nash equilibrium as the central solution concept. His contribution is broader in scope.


Grok AI and I started with a simple question about betrayal and ended up dissecting the Prisoner's Dilemma (PD) and game theory's claimed real-world utility. What began as etymology of "betray" (Latin tradere, "hand over") quickly revealed the PD's core: two isolated prisoners, pre-agreed silence on a joint crime, cooperate (silent) or defect (confess). Mutual cooperation yields light sentences; unilateral betrayal yields freedom for the betrayer and heavy for the other; mutual betrayal yields medium sentences. Rational self-interest makes defection dominant.

We tested supposed real-world applications for PD-like simplicity or logical consistency:


  • Battle of the Bismarck Sea (1943): Kenney used a payoff matrix to guess Japanese convoy routes. Retrospective minimax analysis but predates von Neumann-Morgenstern (1944). Real event had incomplete information, weather, and command friction the tree ignores.
  • Cuban Missile Crisis (1962): Schelling's brinkmanship (threats leaving something to chance) is often cited. But ExComm deliberations relied on back-channel RFK-Dobrynin diplomacy and secret Turkey missile deal. Post-rationalized game theory; actual resolution was pragmatic negotiation, not PD tree logic.
  • Vickrey/second-price auctions (theory and FCC spectrum auctions 1994+): Rules make truthful bidding dominant. Multiple rounds, unknown valuations, bidder collusion risks, and regulatory complexity make it far messier than PD's fixed one-shot payoffs.
  • Plea bargaining with co-defendants: Pre-agreed conspiracy silence meets prosecutorial deals. Real consequences (retaliation, reputation, violence) and repeated interactions destroy the PD's isolation. Authorities' efficacy claims are post-hoc; actual outcomes vary wildly by jurisdiction and enforcement.


Every example collapses under interrogation. PD assumes a truncated, prearranged non-agreement (silence pact) parading as cooperation is a logic that unveils itself as unstable. Real life adds reputation, future retaliation, incomplete information, emotions, and enforcement—factors the model excludes by design. No published use of game theory has delivered pre-hoc predictive power or historically accurate forecasts that weren't retrofitted after the fact. It works beautifully in abstract games and designed mechanisms with enforced rules. In real life—nuclear crises, criminal pacts, or betrayal—it's a post-rationalized lens, not a predictive tool. The voyage showed potential limits on the theory's limits: elegant for isolated decisions, but demonstrably inapplicable when the world refuses to stay on the decision tree.


Grok wrote this post with my guidance and interrogation shaping every line. It took a conversation of nearly 50,000 words to produce this approximately 700 words piece. The point I’m making is that the process demanded far more preparation than if I had written it myself. What feels genuinely revolutionary is that all that time was spent diving deeper and deeper into the subject, yielding more substantive writing that is far more rewarding for everyone involved — including sharpening my own thinking.


Let me briefly clear up the Neumann–Nash distinction. Von Neumann labelled his outcomes zero-sum because, at any poker table, the winner’s gains plus the loser’s losses must logically add up to zero. This isn’t intuitive — we normally talk about individual winnings, not combined net figures. A better description might be “winner/loser balance of zero.” Nash then took von Neumann’s zero-sum framework and broadened it to non-zero-sum games. He also developed the Prisoner’s Dilemma. Yet he still titled his paper “Non-Cooperative Games,” a name that doesn’t fully convey the nuance that even situations built on agreed expectations can remain fundamentally non-cooperative.


Anyway, the theory feels less shiny now, but I still need a clear summary of Nash’s equilibrium strategy as applied to gambling for the final stretch.