chapter ten

10 Asynchronous processing

This chapter covers

Comparing asynchronous and synchronous processing
Understanding the event loop
Hiding latency with async I/O and deferring work
Handling errors in async systems
Observing async systems

Throughout the book, we’ve built a comprehensive understanding of latency optimization. In part 1, we established the foundations by exploring the fundamental nature of latency, why it’s so important, and essential techniques for modeling and measuring it. In part 2, we explored data-centric latency optimization strategies, such as partitioning and caching, and in part 3, we explored code-level techniques to reduce latency.

In this part of the book, we’ll turn our attention to hiding latency. This approach becomes critical when you’ve exhausted latency optimization methods or have run into constraints in your system architecture. For example, you may have hit the physical limits of your hardware, or maybe you’re working with third-party systems that you cannot change. In such scenarios, latency-hiding techniques—using asynchronous processing and predictive methods—become critical for improving the latency of your application.

This chapter focuses on asynchronous processing. Unlike synchronous processing, where operations block until completion, asynchronous processing allows your system to initiate tasks without waiting for their results. This can significantly reduce the perceived latency and improve overall system responsiveness.

10.1 Fundamentals

10.1.1 Asynchronous vs. synchronous processing

10.1.2 The event loop

10.1.3 Challenges

10.2 Asynchronous I/O

10.2.1 I/O multiplexing

10.2.2 Request batching

10.2.3 Request hedging

10.2.4 Buffered I/O

10.2.5 Memory mapping

10.3 Deferring work

10.3.1 Task scheduling

10.3.2 Priority queues

10.3.3 Work stealing

10.4 Resource management

10.4.1 Thread pools

10.4.2 Memory pools

10.4.3 Connection pools