On the Pitfalls of Measuring Emergent Communication

Reference: Lowe, Foerster, Boureau, Pineau, Dauphin (2019). AAMAS 2019 (Proc. 18th Intl. Conf. on Autonomous Agents and Multiagent Systems). Source file: p693.pdf. URL

Summary

This paper critically examines metrics used to detect and measure emergent communication in multi-agent reinforcement learning. The authors show that commonly used indicators — such as speaker consistency (SC), context independence (CI), mutual information between messages and actions, and message-entropy — can be misleading: agents trained with a communication channel that does not influence their behavior may still exhibit high values on these metrics, producing the illusion of communication.

To disentangle the phenomenon, they propose decomposing communication into positive signaling (messages carry information about a speaker’s observations) and positive listening (messages influence a listener’s subsequent actions). They introduce causal influence of communication (CIC), a causal-intervention-based metric measuring how an agent’s message changes another agent’s action distribution, and demonstrate its properties on matrix communication games (MCGs). They offer concrete recommendations for when each metric should be trusted.

Key Ideas

  • Speaker consistency can be positive even when no communication happens.
  • Separate positive signaling from positive listening.
  • Causal influence of communication (CIC) via interventions on messages.
  • Matrix Communication Games as a minimal testbed.
  • Entropy-based metrics are shape-dependent and deceptive.

Connections

Conceptual Contribution

Tags

#emergent-communication #multi-agent-rl #metrics #deep-learning

Backlinks