Chapter 4. Text mining and text analytics

 

8.1. Text mining in the real world

8.2. Text mining techniques

8.2.1. Bag of words

8.2.2. Stemming and lemmatization

8.2.3. Decision tree classifier

8.3. Case study: Classifying Reddit posts

8.3.1. Meet the Natural Language Toolkit

8.3.2. Data science process overview and step 1: The research goal

8.3.3. Step 2: Data retrieval

8.3.4. Step 3: Data preparation

8.3.5. Step 4: Data exploration

8.3.6. Step 3 revisited: Data preparation adapted

8.3.7. Step 5: Data analysis

8.3.8. Step 6: Presentation and automation

8.4. Summary

What’s inside