chapter two

2 Foundation Models: Language & Embedding

This chapter covers

What foundation models are and how they became the new standard in AI
The architecture and training pipeline behind models like GPT and Claude
How inference and reasoning strategies affect model behavior
Why embeddings matter—and how they power search, retrieval, and recommendations
Key trade-offs: hallucinations, bias, compute cost, and deployment decisions.

This chapter explains the engineering foundations behind modern AI systems—what they are, how they’re built, and why they behave the way they do. If you want to move beyond simply calling an API and start making informed decisions about how models are used in your applications, this is where it begins.

We’ll walk through the full development pipeline of foundation models like GPT-4 or Claude, from large-scale pretraining on raw web data to post-training alignment methods like instruction tuning and RLHF. You’ll learn how inference parameters affect outputs, and why small changes in temperature or stop sequences can drastically shift a model’s behavior.

Alongside language models, we’ll introduce embedding models—less visible, but essential to systems like semantic search and Retrieval-Augmented Generation (RAG). These models don’t generate text; instead, they map meaning into vectors, enabling machines to organize and retrieve information based on semantic similarity.

2.1 Introduction to Foundation Models

2.1.1 Why are they called Foundation Models?

2.1.2 Applications and Possibilities of Foundation Models

2.1.3 Key Characteristics

2.1.4 Types of Foundation Models

2.1.5 Why This Matters for Developers

2.2 Architecture of a Foundation Model

2.2.1 Pre Training

2.2.2 Post Training

2.2.3 Inference: How Models Generate Answers

2.2.4 Distribution of Foundation Models

2.3 Challenges and Trade-offs of Foundation Models

2.4 Embedding Models

2.4.1 What are embeddings?

2.4.2 How embeddings are used?

2.4.3 Practical Applications of Embeddings

2.4.4 Vector Storage and Search

2.4.5 Embedding Model Training

2.4.6 Embedding vs Language Models

2.4.7 Key Considerations

2.5 Conclusion

2.6 Summary