AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation

Reference: Wu, Bansal, Zhang, Wu, Li, Zhu, Jiang, Zhang, Zhang, Liu, Awadallah, White, Burger, Wang (2023). Microsoft Research et al. arXiv:2308.08155v2. Source file: 2308.08155v2.pdf. URL

Summary

AutoGen is an open-source Microsoft framework for building LLM applications as conversations among customisable conversable agents. Each agent has a configurable back-end (LLMs, humans, tools, or a combination) and can send, receive, and react to messages. Developers compose applications by (1) defining specialised conversable agents and (2) programming their interaction patterns via natural language prompts and/or code — a paradigm the authors call conversation programming.

The framework supports diverse topologies (two-agent chat, group chat, hierarchical chat, dynamic routing), human-in-the-loop participation, and tool execution via code or function calls. Empirical studies demonstrate AutoGen on math, coding, QA, operations research, online decision-making, and entertainment tasks, showing that multi-agent conversations can exceed single-agent performance while reducing development effort.

Key Ideas

  • Conversable agents as the unifying abstraction — uniform message interface over LLMs, humans, and tools.
  • Conversation programming: defining agent capabilities + scripting their interaction patterns as the application-building paradigm.
  • Flexible conversation topologies: joint chat, hierarchical chat, group chat, dynamic routing.
  • Human-in-the-loop and tool execution as first-class participants, not special cases.
  • Empirical validation across six domains showing modular composition yields strong task performance.

Connections

Conceptual Contribution

Tags

#llm-agents #multi-agent-systems #framework #tool-use #conversation-programming

Backlinks