Movie Review NLP

Analysis with NLTK

After gathering the data, it was time to prepare it for use in predictive modeling. The NLTK library (Natural Language Tool Kit) was a critical component of this project and allowed me to convert text data into numerical format suitable for machine learning algorithms.

Machine Learning

With the data prepped, I was able to explore various machine learning models. At the end of the project I compared the models side by side and the overall prediction accuracy was around 70% which was great considering the only predictor was the text movie review.

Scraping Text Data

For this project I wanted to expand my machine learning skills and get some practice working with text data. I gathered and compiled a small dataset containing written movie reviews with the intention of predicting how many stars that reviewer would give the movie. I was able to utilize a Python script using the Beautifulsoup library to scrape the text data.

Python
Analytics
Machine Learning
Data Scraping

Read me!