2 Working with natural language
This chapter covers
- The hidden structures in unstructured data
- A search-centric philosophy of language
- Exploring distributional semantics and vector-based embeddings
- Modeling domain-specific knowledge
- Challenges with natural language and querys
- Applying natural language understanding techniques to both content and signals
In the first chapter, we provided a high-level overview of what it means to build an AI-powered search system. Throughout the rest of the book, we’ll explore and demonstrate the numerous ways your search application can continuously learn from your content and your users’ behavioral signals to better understand your content, your users, and your domain, and to ultimately deliver users the answers they need. We will get much more hands-on in chapter 3, firing up a search server of your choice and a data processing layer (Apache Spark) and starting with the first of our Jupyter notebooks, which we’ll use throughout the book to walk through many step-by-step examples.