This chapter covers:
- Uncovering the hidden structure in unstructured data
- A search-centric philosophy of language and natural language understanding
- Exploring distributional semantics and word embeddings
- Modeling domain-specific knowledge
- Tackling challenges in natural language understanding and query interpretation
- Applying natural language learning techniques to both content and signals
In the first chapter, we provided a high-level overview of what it means to build an AI-powered search system. Throughout the rest of the book, we’ll explore and demonstrate the numerous ways your search application can continuously learn from your content and your user behavioral signals in order to better understand your content, your users, and your domain, and to ultimately deliver users the answers they need. We will get much more hands on in chapter three, firing up a search server (Apache Solr), a data processing layer (Apache Spark), and starting with the first of our Jupyter notebooks, which we’ll use throughout the book to walk through many step-by-step examples.