Human-Level Play in the Game of Diplomacy by Combining Language Models with Strategic Reasoning (Cicero)

Reference: Meta Fundamental AI Research Diplomacy Team (FAIR), Anton Bakhtin, Noam Brown, Emily Dinan, Gabriele Farina, Colin Flaherty, Daniel Fried, Andrew Goff, Jonathan Gray, Hengyuan Hu, et al. (2022). Science 378(6624):1067–1074. Source file: downloads/cicero.pdf. URL

Summary

Presents Cicero, the first AI to reach human-level performance in the no-press-restricted, seven-player, natural-language negotiation game of Diplomacy. The system couples a controllable dialogue model with a planning-and-reinforcement-learning engine: the planner computes intended actions for Cicero and its opponents using regret-minimisation and a value network; the dialogue model is then conditioned on those intentions to generate messages that are simultaneously strategically grounded, honest-by-construction with respect to the chosen plan, and stylistically indistinguishable from human play.

Cicero infers other players’ beliefs and intentions from their messages and prior actions, filters candidate utterances through classifiers trained to reject nonsense / inconsistent / ungrounded lines, and commits to moves consistent with what it said. Across 40 online games it more than doubled the average human score and ranked in the top 10% of repeat players — the strongest demonstration to date that language models can carry out intentional, strategically grounded communication with humans in a mixed cooperation/competition environment.

Key Ideas

Grounded dialogue: natural-language messages conditioned on explicit planned intents
Regret-minimisation planner with neural value function jointly optimises for Cicero and opponents
Intent inference: read beliefs/plans from incoming dialogue, fold into the planner
Multi-stage message filtering (nonsense, inconsistency, grounding, value) to enforce honesty and stylistic naturalness
First demonstration of human-level performance in a language-negotiation strategy game

Connections

Conceptual Contribution

Claim: Intentional, honest, strategically grounded natural-language communication between an AI and humans is achievable by explicitly separating the planning layer (what to do) from the dialogue layer (what to say), and conditioning the latter on the former with heavy filtering — rather than hoping a pure language model will learn strategic intent end-to-end.
Mechanism: An intent-conditioned dialogue model is trained on human Diplomacy games with extracted action annotations. At play time, a piKL-based planner runs regret minimisation over candidate joint actions using a neural value network; Cicero’s chosen intent is fed to the dialogue model. Generated messages pass through nonsense/consistency/grounding/value filters and a final policy check that the outgoing message is consistent with Cicero’s actually intended move. Incoming messages are parsed into inferred opponent intents that feed back into the planner.
Concepts introduced/used: LLM Agents, Negotiation, Joint Intentions, intent-conditioned generation, regret minimisation / piKL, Cheap Talk, Honesty Constraint, Grounding
Stance: empirical / machine learning
Relates to: Cited in A Scalable Communication Protocol for Networks of LLMs as an exemplar of LLM-mediated negotiation between autonomous agents — Agora pushes the same idea from a closed seven-player game into an open, decentralised network and from ad-hoc utterances to hash-addressed Protocol Documents. Instantiates the speech-act / sincerity-condition programme of Foundations Of Illocutionary Logic and Sincerity Condition inside a modern deep-learning agent. Contrasts with emergent-language approaches like Multi-Agent Cooperation and the Emergence of Natural Language by using a pretrained human-language model.

Tags

#llm-agents #negotiation #diplomacy #language-models #grounded-dialogue

Summary

Introduces a referential-game framework for studying emergent communication: a sender sees a target/distractor image pair and sends a single symbol from a fixed vocabulary; a receiver must identify the target using that symbol. The agents are blank-slate neural networks trained only by communication-success reward. The paper studies whether agents converge, whether the emergent symbols align with human-interpretable semantics, and how to nudge the system toward natural-language-compatible codes.

Two sender architectures (agnostic vs informed) are compared; the informed sender produces richer vocabulary usage. A supplementary supervised image-labelling objective is shown to ground agent symbols to human concepts, making them partially interpretable to crowd-workers.

Key Ideas

Referential games as minimal test-beds for emergent protocols

Informed (feature-aware) sender yields more human-like symbol usage

Symbol purity measured against conceptual (McRae) categories

Mixing self-play with supervised labelling grounds emergent codes to natural language

Foundational for later emergent-communication literature

Conceptual Contribution

Claim: Two neural-network agents playing a referential game (sender/receiver over image pairs) can develop a symbolic code from scratch; with architectural and supervisory nudges, that code can be made to align with human-interpretable object categories.

Mechanism: Lewis-style signalling game with REINFORCE training; contrast agnostic vs informed sender architectures; analyse symbol-to-category purity; then mix supervised image-labelling with self-play to ground emergent symbols in human vocabulary (AlphaGo-inspired). Crowdsourced evaluation shows humans can guess the correct image 68% of the time from emitted symbols.

Concepts introduced/used: Emergent Communication, Referential Games, Lewis Signalling Games, Language Games, Grounding in Human Language, Symbol Grounding Problem, Cheap Talk, Symbol-Category Purity, REINFORCE, Compositionality

Stance: empirical / deep-learning

Relates to: Companion/precursor to Emergence of Grounded Compositional Language in Multi-Agent Populations (physical grounding, >2 agents). Feeds the emergent-protocol thesis of A Scalable Communication Protocol for Networks of LLMs. Contrasts with stipulated-semantics ACLs (KQML as an Agent Communication Language, FIPA-ACL).

Summary

Searle and Vanderveken construct a formalized logic of speech acts, filling the gap between philosophy of language and formal logic. The book recursively defines the space of illocutionary forces from five primitives (assertive, commissive, directive, declarative, expressive) via seven components of illocutionary force (illocutionary point, mode of achievement, propositional-content conditions, preparatory conditions, sincerity conditions, degree of strength, direction of fit).

It develops axiomatic propositional illocutionary logic, laws of illocutionary entailment, commitment, negation, conjunction, and conditionalization, and closes with a semantic analysis of over a hundred English performative verbs. The work is foundational for agent communication languages because it gives a rigorous semantics to the performatives (inform, request, promise, declare, etc.) later adopted by KQML and FIPA-ACL.

Conceptual Contribution

Claim: The space of possible speech acts is generated recursively from five illocutionary points and seven components of illocutionary force, and admits a genuine formal logic of success and entailment.

Mechanism: Axiomatic propositional illocutionary logic: laws of illocutionary entailment, commitment, negation, conjunction, and conditionalisation; closes with semantic definitions of over a hundred English performative verbs.

Concepts introduced/used: Speech Act Theory, Performatives, Illocutionary Force, Direction of Fit, Sincerity Conditions, Preparatory Conditions, Commitment-based Semantics, Mental State, Mentalistic Semantics

Stance: foundational / formal-semantic

Relates to: Supplies the philosophical semantics imported by KQML Language And Protocol and FIPA-ACL, and re-examined publicly in Agent Communication And Institutional Reality and A Common Ontology Of ACLs.

Summary

Introduces Agora, a meta-protocol for inter-agent communication in large heterogeneous networks of LLM-powered agents. Agora frames the design space as the Agent Communication Trilemma — versatility, efficiency, portability — and argues no single format (natural language, structured APIs like REST, or semantic-web RDF) can satisfy all three simultaneously.

Agora’s trick is to use different formats for different traffic volumes: rare/novel messages flow as natural language handled by LLMs; frequent patterns are formalised into Protocol Documents (PDs) negotiated between agents and then served by cheap LLM-written routines. A 100-agent demo shows emergent self-organising protocols and ~5x cost reduction over natural-language-only communication.

Key Ideas

Agent Communication Trilemma: versatility vs efficiency vs portability

Protocol Documents (PDs): hash-identified, agent-negotiated, machine-readable specs

Hybrid hierarchy: NL bootstrap -> PD negotiation -> LLM-written routines -> traditional protocols

Fully decentralised, hash-addressed storage (IPFS-compatible)

Emergent protocols among 100 heterogeneous LLM agents without central coordination

Conceptual Contribution

Claim: No single communication format can simultaneously satisfy versatility, efficiency, and portability (the Agent Communication Trilemma) at scale; a meta-protocol that dynamically mixes natural language, structured data, and LLM-written routines can sidestep the trilemma.

Mechanism: Agora uses hash-identified Protocol Documents (PDs) — plain-text, implementation-agnostic specs — negotiated on demand between LLM agents. Frequent traffic is handled by cheap LLM-written routines implementing a PD; rare or novel traffic falls back to LLMs with natural language. Decentralised, content-addressed (IPFS-style) PD distribution; demonstrated on a 100-agent heterogeneous network showing emergent protocols and ~5× cost reduction.

Concepts introduced/used: Agent Communication Trilemma, Protocol Documents, Meta-protocol, Emergent Protocols, Emergent Communication, LLM Agents, Content-addressed Storage, Negotiated Protocols, Negotiation, Agent Communication Languages

Stance: engineering / systems

Relates to: Modern successor to KQML as an Agent Communication Language and FIPA-ACL, replacing stipulated performatives with negotiated PDs. Echoes the emergent-language findings of Multi-Agent Cooperation and the Emergence of Natural Language and Emergence of Grounded Compositional Language in Multi-Agent Populations at the protocol-document level. Overlaps the design space of Model Context Protocol, Agent-to-Agent Protocol, Agent Network Protocol. Its bottom-up spirit mirrors The Extensible Language - Graham.

Human-Level Play in the Game of Diplomacy by Combining Language Models with Strategic Reasoning (Cicero)

Summary

Key Ideas

Connections

Conceptual Contribution

Tags

Backlinks