Natural Language Processing to identify substances related to overdose deaths: An R-tidymodels workshop
This will be an interactive workshop introducing a natural language processing / machine learning workflow using R. We will specifically use the tidymodels framework for the analyses. We will be analyzing text data from medical examiners across the United States. We will develop a model to classify overdose deaths by type of substance involved.
If you want to follow along with the code, please do the following before the workshop (will speed things up the day of):
- Download and install R: https://cran.rstudio.com/
- Downlaod and install RStudio (free version): https://www.rstudio.com/products/rstudio/download/
- Clone this repository / or download the .rmd file and the /Data/ folder
- Open RStudio
- Open the .rmd file
- Run the first chunk to install the required packages
If this feels to onerous, you can also follow me along as (I struggle) we go through the code the day of the workshop.
Thanks for taking the time to check out the repository and being present at the workshop.
David