chapter two

2 Understanding and measuring hallucinations in LLMs

This chapter covers

Types of hallucinations in LLMs
Identifying and measuring hallucinations
Mitigating hallucinations in LLMs

Imagine you're a lawyer working on an important case. You're short on time, so you decide to use an AI legal assistant to help with research. You give the AI some prompts, and it quickly generates a draft full of convincing arguments and legal precedents. Impressed, you include parts of the AI's output in your own brief and submit it to the court.

But there's a big problem: some of the legal cases the AI referred to don't actually exist. They're completely made up. By using this "hallucinatory" content without double-checking it, you've accidentally misled the court and put your own reputation at risk. This isn't just a hypothetical scenario. In 2023, a law firm in New York had to pay a $5,000 fine for doing exactly this—submitting a brief with fake cases generated by ChatGPT. This shows how serious the consequences of AI hallucinations can be.

As language AI models like GPT have become more advanced and more widely used, hallucinations have emerged as a major challenge. Hallucinations occur when AI systems generate content that is untrue, inconsistent, or just plain made-up. This can lead to the spread of fake information, incorrect medical diagnoses, bad financial advice, and many other potential harms.

2.1 What are hallucinations, exactly?

2.1.1 Unpacking the use of "hallucination" in AI

2.1.2 Types of hallucinations

2.1.3 Why do hallucinations occur?

2.2 How to identify and measure hallucinations

2.2.1 Four steps to identify and measure hallucinations

2.2.2 FActScore

2.2.3 ROUGE Metric: News summarization example

2.2.4 LLM as a judge: A holistic approach to hallucination detection

2.2.5 Human evaluation

2.2.6 Red teaming: Stress-testing LLMs for robustness

2.2.7 Using monitoring frameworks for hallucination detection

2.2.7 Implementation example: Phoenix Framework

2.3 Mitigating hallucinations

2.3.1 Selecting the right model

2.3.2 Data-related methods to improve relevance and reduce hallucinations

2.3.3 Post-processing to lower hallucinations

2.3.4 Retrieval-augmented generation to improve grounding

2.3.5 Prompting techniques to lower hallucinations

2.3.6 Mitigating hallucinations through product design and user interaction

2.4 Summary

2.5 References