Movie Review NLP
Analysis with NLTK
After gathering the data, it was time to prepare it for use in predictive modeling. The NLTK library (Natural Language Tool Kit) was a critical component of this project and allowed me to convert text data into numerical format suitable for machine learning algorithms.
Machine Learning
With the data prepped, I was able to explore various machine learning models. At the end of the project I compared the models side by side and the overall prediction accuracy was around 70% which was great considering the only predictor was the text movie review.
Scraping Text Data
For this project I wanted to expand my machine learning skills and get some practice working with text data. I gathered and compiled a small dataset containing written movie reviews with the intention of predicting how many stars that reviewer would give the movie. I was able to utilize a Python script using the Beautifulsoup library to scrape the text data.