6 Brokers

This chapter covers:

Exploring the roles of brokers in our context
Important metrics to monitor in our cluster
Evaluating tradeoffs for certain broker configuration values

So far in our discussions, we have been dealing with Kafka from the view of an application developer interacting from external applications and processes. However, Kafka is a distributed system that needs attention to run in its own right. Let’s start to look at the parts that make the message brokers work!

6.1 Introducing the Broker

Some queues have the concept of a 'smart' broker and a 'dumb' client or the reverse: a 'dumb' broker and a 'smart' client. These terms might fit other queue and/or topic systems well but I would not get too involved in trying to make Kafka brokers and clients fit in either category. In my opinion, I think that both parts have a lot of power. Being a broker in Kafka means being a part of a cluster of machines.

If you are familiar with Big Data concepts or worked with Hadoop before, you might near similar terminology such as rack awareness and partitions. For example, Kafka has a rack awareness feature that will make replicas of the same partition exist physically across different racks. This is important since if all of the servers that make up your cluster are on the same rack, and the entire rack is offline, it would be the same as if you had lost your entire cluster.

6.2 Why Kafka needs Zookeeper

6.3 What does it mean to be a message broker

6.4 Configuration at the Broker Level

6.4.1 Kafka’s Core: The Log

6.4.2 Application Logs

6.5 What Controllers are for

6.6 Leaders and their role

6.6.1 Inter-Broker Communications

6.6.2 The Role of Replicas

6.7 In-Sync Replicas (ISR) Defined

6.8 Unclean Leader Election

6.9 Seeing Metrics from Kafka

6.9.1 Cluster Maintenance

6.9.2 Adding a Broker

6.9.3 Upgrading your Cluster

6.9.4 Upgrading your clients

6.9.5 Backups

6.10 A Note on Stateful Systems

6.11 Exercise