INTRO TO TEXT MINING AND NLP FOR HEALTH DATA

Friday, Feb 12, 2021 - 10:00am to 12:00pm

Click here to register.

Speakers: Wesley Brooks (UC Davis) - Arthur Koehl (UC Davis)

This workshop covers an introduction to natural language processing (NLP) and caveats for its application to health data. Using the R programming language we will introduce the basics of text processing and demo how to calculate common metrics including word frequencies, term frequency-inverse document frequency (TFIDF), and principal component analysis (PCA) to explore important words and group similar documents. We will also introduce more advanced NLP topics (sentiment analysis, topic modeling, etc.) and discuss classical versus deep learning approaches, as time permits. Learners with proficient R skills are encouraged to code along.

This event is part of UC Love Data Week