Emergence of Grounded Compositional Language in Multi-Agent Populations

Reference: Igor Mordatch and Pieter Abbeel (2018). AAAI-18. Source file: 11492-13-15020-1-2-20201228.pdf. URL

Summary

Proposes a multi-agent environment and differentiable training procedure in which agents develop an abstract compositional communication protocol purely from the need to coordinate on non-linguistic goals (move-to-location, look-at, etc.) in a 2D world with landmarks. Agents emit streams of discrete symbols (Gumbel-Softmax relaxed) along with physical actions; symbols acquire stable, interpretable meanings corresponding to goal types, landmarks, and agent identities.

The emergent language exhibits syntactic ordering, vocabulary-size regularization via a Dirichlet-process-inspired penalty, and non-verbal strategies (pointing, guiding) when symbols are disabled. The paper is a foundational piece in the modern “emergent communication” literature.

Key Ideas

  • Communication emerges from cooperative multi-agent RL with shared reward
  • Gumbel-Softmax makes discrete symbol channels differentiable
  • Vocabulary-size penalty (DP-style) prevents symbol proliferation and encourages compositionality
  • Non-verbal channels (gaze, position) substitute when verbal is unavailable
  • Symbols ground to concepts: GOTO, color, landmark/agent identity

Connections

Conceptual Contribution

Tags

#emergent-communication #multi-agent-rl #grounded-language #compositionality

Backlinks