In this video, Professor Chris Bail of Duke University introduces the basics of quantitative text analysis such as character encoding, GREP, and importing/cleaning text data. Link to slides: https://compsocialscience.github.io/summer-institute/2020/materials/day3-text-analysis/basic-text-analysis/Rpres/Basic_Text_Analysis.html#/ Links to further content discussed in this video are below.
Link to annotated code: https://compsocialscience.github.io/summer-institute/2020/materials/day3-text-analysis/basic-text-analysis/rmarkdown/Basic_Text_Analysis_in_R.html
Regex cheat sheet: https://rstudio.com/wp-content/uploads/2016/09/RegExCheatsheet.pdf
Other resources:
1. Julia Silge's TidyText https://www.tidytextmining.com/
2. Ken Benoit's Quanteda: https://quanteda.io/
3. Jurafsky and Martin's textbook: https://web.stanford.edu/~jurafsky/slp3/