11 Reaching understanding through parsing and intent resolution
This chapter covers
- Determining intents with NLP and NLU
- Incorporating parsing and lexicons as appropriate for voice
- Designing and developing with tagging guides
- Implementing intent resolution with middle layers and multiple passes
Chapter 10 covered the first component of a conversational system: speech-to-text (STT). This chapter is step 2: natural language processing (NLP) and natural language understanding (NLU). If chapter 10 focused on understanding what a user said, chapter 11 focuses on what those words mean, as NLU takes the STT representations and interprets the intended meaning behind those words to determine how to respond. Here, you learn what “meaning” means to a voice system, why it’s difficult, and how to reach the best result.
If your recognition goes wrong, your meaning interpretation probably will too: you can’t “make sense” out of the wrong words. Accurate understanding lets you respond sensibly to users and give them what they ask for. It lets you create more complex and sophisticated VUIs that handle ambiguity and pronoun references, and it lets you deal with mistakes intelligently when they occur (chapter 13). In this chapter, you learn about some of the methods involved in NLP and NLU. NLP assigns structure to a text string, while NLU uses that additional information to more accurately determine the user’s meaning and assign an intent that the VUI can act on and respond to.