10 Topic modeling

 

This chapter covers:

  • Introduction to topic modelling with Latent Dirichlet Allocation (LDA)
  • Overview of gensim, an NLP toolkit for topic modelling
  • Implementation of an unsupervised topic modelling approach using gensim
  • Introduction of several visualization techniques for topic exploration in data

The previous chapter introduced various NLP and machine learning techniques for topic classification and topic analysis. Here is a reminder of the scenario that you’ve worked on:

10.1  Topic Modelling with Latent Dirichlet Allocation

10.1.1    Estimating Parameters for the LDA

10.1.2    LDA as a Generative Model

10.2  Implementation of the Topic Modelling Algorithm

10.2.1    Loading the data

10.2.2    Preprocessing the data

10.2.3    Applying the LDA model

10.2.4    Exploring the results

10.3  Summary