Text analysis: classification and topic modeling


Date
Nov 21, 2022 12:25 PM
Location
Rockefeller Hall 203

Overview

  • Introduce supervised text classification
  • Implement a tidymodels workflow using text features
  • Define topic modeling
  • Explain Latent Dirichlet allocation and how this process works
  • Demonstrate how to use LDA to recover topic structure from an unknown set of topics
  • Identify methods for selecting the appropriate parameter for $k$

Before class

Class materials

What you need to do after class

Benjamin Soltoff
Benjamin Soltoff
Assistant Senior Instructional Professor in Computational Social Science & the College