chapter ten

10 Application architecture for generative AI apps

 

This chapter covers

  • Overview of GenAI Application Architecture and the emerging GenAI app stack
  • Learning about the different layers that make up the GenAI app stack
  • Defining GenAI architecture principles
  • Learning about the benefits of orchestration frameworks and some of the popular ones
  • Outlining model ensemble architectures – allowing us to combine different foundational models
  • How to create a strategic framework for a cross-functional AI Center of Excellence

As the enterprise architecture landscape continues to evolve, it is moving inexorably toward more self-directed systems—intelligent, self-managing applications capable of learning from interactions and adapting in real time. As more digitization occurs, this fuels the AI digital transformation. This ongoing progression underscores a transformative era in enterprise technology, poised to redefine the very nature of software development and deployment.

10.1 Generative AI – Application Architecture

10.2 Generative AI – Application Stack

10.2.1 GenAI Architecture Principles

10.2.2 GenAI Application Architecture – Detailed View

10.3 Orchestration Layer

10.3.1 Benefits of an Orchestration Framework

10.3.2 Orchestration Frameworks

10.3.3 Managing Operations

10.3.4 Prompt Management

10.4 Grounding Layer

10.4.1 Data Integration and Preprocessing

10.4.2 Embeddings and Vector Management

10.5 Model Layer

10.6 Response Filtering

10.7 Summary

10.8 References