chapter one

1 What are LLM Agents and Multi-Agent Systems?

 

This chapter covers

  • Current real-world applications of LLM agents and multi-agent systems
  • What LLM agents are and why LLMs alone are insufficient
  • Important design patterns, enhancements, and protocols for LLM agents
  • When applications may benefit from multi-agent systems
  • A roadmap for developing LLM agents and multi-agent systems

If a user asks a Large Language Model (LLM) where to find the best value in croissants in New York City, the LLM might respond, “I will search the web for highly-rated croissants and their prices.” LLMs are very good at expressing intent to act toward a specific goal—to generate text that tells us what they are going to do to resolve a query. At this point, however, we run into a critical limitation: since LLMs are only text-generators, they cannot act on their intentions. They can articulate a plan for processing a task, but cannot carry it out—unless they are surrounded by a system to orchestrate the plan and execute the actions.

These orchestration systems are called LLM agents. We’ll add some nuance to this definition soon, but for our purposes, LLM agents are systems that automatically turn the LLM’s intentions into actions.

1.1 Where LLM Agents and Multi-Agent Systems are useful

1.1.1 Report generation

1.1.2 Web search and deep search

1.1.3 Agentic RAG

1.1.4 Coding LLM agents

1.1.5 Computer use

1.1.6 Enhancing applications with MAS

1.2 What is an LLM agent?

1.2.1 Prerequisite LLM capabilities

1.3 The processing loop

1.4 Important enhancements and patterns

1.5 Protocols for LLM agents

1.6 Multi-Agent Systems

1.7 Our LLM agent framework

1.8 How to use this book

1.8.1 Code

1.8.2 The basics of UML diagrams

1.9 Our roadmap

1.10 Summary