Log-linear Learning

A noisy best-response dynamic in which each agent selects actions with probability proportional to exp(utility/T). As the temperature T anneals, play concentrates on potential-maximising equilibria, giving convergence guarantees under bounded communication failure.

In this vault

Backlinks