In economics and game theory, Bayesian persuasion occurs when one participant (the sender) wants to persuade the other (the receiver) of a certain course of action. There is an unknown state of the world, and the sender must commit to a decision of what information to disclose to the receiver. Upon seeing said information, the receiver will revise their belief about the state of the world using Bayes' rule and select an action. Bayesian persuasion was introduced by Emir Kamenica and Matthew Gentzkow.
Bayesian persuasion is a special case of a principalâÂÂagent problem: the principal is the sender and the agent is the receiver. It can also be seen as a communication protocol, comparable to signaling games; the sender must decide what signal to reveal to the receiver to maximize their expected utility. It can also be seen as a form of cheap talk, where the sender has the commitment power.
A medical company (the sender) produces a new medicine, and needs the approval of the regulator (the receiver). There are two possible states of the world: the medicine can be either "good" or "bad". The company and the regulator do not know the true state. However, the company can run an experiment and report the results to the regulator. The question is what experiment the company should run in order to get the best outcome for themselves. The assumptions are:
For example, suppose the prior probability that the medicine is good is 1/3 and that the company has a choice of three actions:
In this case, the third policy is optimal for the sender since this has the highest expected utility of the available options. Using the Bayes rule, the sender has persuaded the receiver to act in a favorable way to the sender.
The basic model has been generalized in a number of ways, including:
The applicability of the model has been assessed in a number of real-world contexts:
Algorithmic techniques have been developed to compute the optimal signalling scheme in practice. This can be found in polynomial time with respect to the number of actions and pseudo-polynomial time with respect to the number of states of the world. Algorithms with lower computational complexity are also possible under stronger assumptions.
The online case, where multiple signals are sent over time, can be solved efficiently as a regret minimization problem.