Online Car Marketplace
Developing a Data Pipeline
The script I developed parses through HTML content on the Cars.com listings webpage and collects information such as year, make, model, price, consumer rating, state, zip code, and more. The script then stores the information in a Pandas dataframe and then exports it to a CSV for analysis.
Data Science Community Engagement
After gathering the data, I uploaded it to Kaggle along with a supplementary Jupyter Notebook which contains challenges for students to practice their data handling skills. This dataset was received well by the community with over 2400 downloads, 40 upvotes, and 17 code submissions.
Real-World Data Collection
In this project I built a web scraping script in Python using the BeautifulSoup library. I was able to scrape data from over 9000 car posts on Cars.com.