Metamagical Themas — The Prisoner's Dilemma Computer Tournaments and the Evolution of Cooperation (May, 1983)

May 1, 2023

5 minutes read

I Recently revisited Metamagical Themas & picked up from where I stopped [It is not a material you can -or frankly should- consume at once]. As with the previous metamagical themas post I should add that This blog post is a [barely] refined collection of my [AI-assisted] study notes, similar in style to my Voidpaper newsletter where I publish notes on other people’s research - the ideas are not my original ideas. I should also add that while this passage can be interepreted in other contexts, these notes are within the context of Artificial Intelligence - as that is the general them of the book.

As Always, deeply fascinated by the ideas presented within the book especially considering how early these ideas were.

The Prisoner’s Dilemma is a game-theoretical model that illustrates the tension between individual self-interest and collective cooperation.
- Imagine that two people are arrested and held in separate cells. They are both suspected of a crime, but the police don’t have enough evidence to convict them. The police offer each person a deal: if one of them confesses and cooperates with the police by testifying against the other person, they will receive a reduced sentence. If neither person confesses, they will both be released due to lack of evidence. If both people confess, they will both receive a moderate sentence. In this situation, each person must decide whether to cooperate with the other person by remaining silent or to defect by confessing and testifying against the other person. The dilemma arises because each person’s optimal strategy depends on what the other person does. If both people cooperate and remain silent, they will both be released. However, if one person defects and confesses, they will receive a reduced sentence, while the other person will receive a harsher sentence. If both people defect and confess, they will both receive a moderate sentence, but this outcome is worse for both people than if they had both remained silent. Thus, the dilemma is that each person’s optimal strategy is to defect, even though both people would be better off if they both cooperated. This tension between individual self-interest and collective cooperation is at the heart of the Prisoner’s Dilemma and has important implications for decision-making in a wide range of contexts.
The study of the Prisoner’s Dilemma and related games could provide useful insights into the ethical and moral implications of AI systems that interact with humans, especially in contexts such as healthcare, finance, and criminal justice.
In computer tournaments that simulate repeated iterations of the Prisoner’s Dilemma, the Tit for Tat strategy has been shown to be successful in promoting cooperation.
- The Tit for Tat strategy is a simple but effective approach to the Prisoner’s Dilemma and other game-theoretical models. In the Tit for Tat strategy, a player initially cooperates and then copies the opponent’s previous move in subsequent rounds. For example, in the context of the Prisoner’s Dilemma, if the opponent cooperates in the first round, the Tit for Tat player will also cooperate in the first round. If the opponent defects in the second round, the Tit for Tat player will also defect in the second round. However, if the opponent cooperates again in the third round, the Tit for Tat player will also cooperate in the third round, and so on. The idea behind the Tit for Tat strategy is that it is a reciprocal strategy that rewards cooperation and punishes defection. By initially cooperating, the Tit for Tat player sends a signal of goodwill and encourages the opponent to cooperate as well. However, if the opponent defects, the Tit for Tat player responds in kind, sending a signal that defection will not be tolerated.
Studies have shown that the Tit for Tat strategy is highly effective in promoting cooperation in the Prisoner’s Dilemma and other game-theoretical models, especially when the games are repeated over multiple rounds. The strategy has also been used as a basis for developing more sophisticated approaches to decision-making and cooperation in the context of artificial intelligence and machine learning.
The success of the Tit for Tat strategy could be seen as a validation of the idea that AI systems should be designed to mimic human behavior and decision-making, rather than relying on abstract mathematical models.
The success of the Tit for Tat strategy suggests that trust and reciprocity are key elements of successful cooperation, which could inform the design of AI systems that are more trustworthy and transparent, and ultimately help to create more intelligent and cooperative machines.
The author however, notes that the Prisoner’s Dilemma and similar games have limitations as models for real-life situations, particularly when it comes to the number of players, rounds of play, and the possibility of communication and negotiation.
AI systems that are designed to interact with humans should take into account the limitations of game-theoretical models and develop more sophisticated and nuanced approaches to decision-making.
The author argues that the success of the Tit for Tat strategy in the computer tournaments is not necessarily a blueprint for successful human-AI interaction, and that more research is needed to determine what approaches are most effective in different contexts.
A broader societal conversation about the implications of AI is necessary, and policymakers, industry leaders, and the public should be actively engaged in shaping the development of AI in a way that aligns with societal values and priorities.

Back to posts