Chapter 14. Multilingual search

 

This chapter covers

  • Using Solr’s stemming and language identification libraries
  • Searching multiple languages using separate fields
  • Searching multiple languages in the same field through separate Solr cores
  • Searching multiple languages in the same field and Solr core

Solr comes out of the box with a robust suite of linguistic libraries that enable searching across a wide spectrum of languages from around the world. This chapter will provide an overview of these libraries and will demonstrate how to most effectively make use of them in your search applications. An understanding of Solr’s schema.xml (covered in chapter 5) and the process of text analysis (covered in chapter 6) is assumed in this chapter, so you may need to refer back to those chapters as necessary.

14.1. Why linguistic analysis matters

 
 
 

14.2. Stemming vs. lemmatization

 
 

14.3. Stemming in action

 
 
 

14.4. Handling edge cases

 
 
 
 

14.5. Available language libraries in Solr

 
 
 

14.6. Searching content in multiple languages

 
 

14.7. Language identification

 
 
 

14.8. Summary

 
 
 
 
sitemap

Unable to load book!

The book could not be loaded.

(try again in a couple of minutes)

manning.com homepage
test yourself with a liveTest