Online Car Marketplace

Developing a Data Pipeline

The script I developed parses through HTML content on the Cars.com listings webpage and collects information such as year, make, model, price, consumer rating, state, zip code, and more. The script then stores the information in a Pandas dataframe and then exports it to a CSV for analysis.

Data Science Community Engagement

After gathering the data, I uploaded it to Kaggle along with a supplementary Jupyter Notebook which contains challenges for students to practice their data handling skills. This dataset was received well by the community with over 2400 downloads, 40 upvotes, and 17 code submissions.

Real-World Data Collection

In this project I built a web scraping script in Python using the BeautifulSoup library. I was able to scrape data from over 9000 car posts on Cars.com.

Python
Analytics
Statistics
Data Scraping

Read me!