In a previous post, I explained how we can use regular expressions or "regex" in R to parse our text data. Turns out there is a very useful R library for crafting regular expressions, especially in the early stages of learning the notation.
Computational text analysis can be a powerful tool for exploring qualitative data. In this blog post, I'll walk you through the steps involved in reading a document into R in order to find and plot the most relevant words on each page.
In my last blog post, we discussed how to read .pdf files into RStudio.
Using pdftools, we were able to read in .pdfs that were both machine-ready and not.