part four
Part 4 Text-to-image generation
with transformers
While diffusion models dominate today’s landscape, transformer-based approaches to text-to-image generation remain highly influential and offer a different perspective. In chapter 11, we dive deep into the vector quantized generative adversarial network, which transforms images into discrete sequences of integers, enabling transformers to handle images in the same way they process text.