Inter-Agent Trust Models: A Comparative Study of Brief, Claim, Proof, Stake, Reputation and Constraint in Agentic Web Protocol Design

Reference: Botao ‘Amber’ Hu & Helena Rong (2025). arXiv:2511.03434v1 (University of Oxford; NYU Shanghai). Source file: 2511.03434v1.pdf. URL

Summary

As an “agentic web” takes shape — billions of LLM-powered agents autonomously transacting and collaborating — trust shifts from human oversight to protocol design. This paper identifies and compares six primitive trust models used (implicitly or explicitly) across recent agentic-web protocols: Brief (third-party or self-issued verifiable credentials), Claim (self-proclaimed identity/capability, e.g. AgentCards), Proof (cryptographic verification — signatures, ZK proofs, TEE attestations), Stake (economic collateral with slashing), Reputation (graph-based social feedback), and Constraint (sandboxing, capability bounding).

The authors evaluate Google’s A2A, the Agent Payments Protocol (AP2), Ethereum’s ERC-8004 “Trustless Agents”, and related designs against a shared matrix of tradeoffs — cost, latency, Sybil resistance, information overhead, social robustness. Special emphasis on LLM-specific fragilities that change the calculus: prompt injection, sycophancy/nudge-susceptibility, hallucination, deception, emergent power-seeking, and goal misalignment. These mean that unverified trust mechanisms are structurally brittle at machine scale. The conclusion: no single primitive suffices; trustless-by-default architectures should layer Brief for identity/discovery, Proof for high-impact actions, Reputation for flexibility and social signals, and Constraint as a safety net.

Key Ideas

  • Six trust-model primitives form a basis for inter-agent protocol design
  • LLM fragilities (prompt injection, sycophancy, hallucination, deception, power-seeking) force Proof + Constraint over pure Claim/Reputation
  • Protocol mapping: A2A uses Claim heavily (AgentCards); AP2 adds Constraint (mandate/capability caps); ERC-8004 unifies Brief + Proof + Stake + Reputation on-chain
  • Classic trust dimensions — Sybil resistance, collusion robustness, whitewashing, cold-start — recur at agent scale
  • Advocates layered, trustless-by-default protocol architectures
  • Defence-in-depth: multi-modal trust signals, calibration of blended scores, governance standardisation

Connections

Conceptual Contribution

Tags

#trust #reputation #llm-agents #agent-protocols #web3 #a2a #erc-8004 #agentic-web

Backlinks