Expand ↗
Page list (942)

Log-linear Learning

A noisy best-response dynamic in which each agent selects actions with probability proportional to exp(utility/T). As the temperature T anneals, play concentrates on potential-maximising equilibria, giving convergence guarantees under bounded communication failure.

In this vault

Last changed by zetl · stable 5d · history

Backlinks