16 Generative Large Language Models
This chapter covers
- A brief history of generative modeling
- Training a miniature GPT model from scratch
- Using a pretrained transformer model to build a chatbot
- Building a multi-modal model that can describe images in natural language
Now that we have covered the key building blocks for text modeling problems, we will turn our eye towards the open ended world of text generation. By scaling up the ideas from the last two chapters, we will build and use conversational models that have been trained on a significant portion of English language text available on the internet. We will discuss the potential and shortcomings of such models.