Chapter 4. Text mining and text analytics
8.1. Text mining in the real world
8.2. Text mining techniques
8.2.2. Stemming and lemmatization
8.2.3. Decision tree classifier
8.3. Case study: Classifying Reddit posts
8.3.1. Meet the Natural Language Toolkit
8.3.2. Data science process overview and step 1: The research goal
8.3.3. Step 2: Data retrieval
8.3.4. Step 3: Data preparation
8.3.5. Step 4: Data exploration
8.3.6. Step 3 revisited: Data preparation adapted
8.3.7. Step 5: Data analysis
8.3.8. Step 6: Presentation and automation
What’s inside