Are Americans choosing used cars over new?

Mi (Mimi) Chung
Posted on Feb 18, 2019

According to the above Auto Remarketing article, there is a rise in used-vehicle sales. One of the factors discussed was the high percent in discount on a used car that isn't too different from a newer model. 

In this study, the process of buying a car is made simpler through comparison graphs and the notion that American's are buying used cars instead of new cars is researched. 

Carfax is a website that allows users to list and sell their used vehicles. The filtering and sorting options are extensive, which are very helpful for narrowing down options for people interested in buying. Using Selenium, the year, make, model, price, mileage, color, and body type were scraped. The quantifiable variables were plotted against each other from a subset of car postings as an example for narrowing options for an interested buyer.

Data cleaning was required for the categorical and numerical columns. 


In the above plot, a subset of cars were chosen and it's year and mileage were chosen as variables. For the best value, a newer car with lower mileage is preferred, so the lower right quadrant would be the best options. However, there are many more variables considered during a typical car decision making process.

The above plot adds the Price variable to the previous plot. Here, the size of the bubbles correlate with the relative pricing of each vehicle. For the best value, the smaller bubbles located in the higher end of the Relative Year and the lower end of the Relative Mileage would be the best options. 

To investigate the idea of Americans buying used cars instead of new cars, the change in sales for the year of 2018 was scraped and plotted. This serves as a preliminary approach to compare new vehicle sales to used vehicle sales.

It appears that there is a generally positive trend in new vehicle sales, suggesting that these opposite values of new and used vehicle sales are independent of each other. This normalized plot serves as the first step to exploring the type of buyer and why they chose one over the other.

With both quantifiable and categorical data, more variables can be visualized for more sophisticated decision making, and the sales trends of vehicles in the US can be explained.

About Author

Mi (Mimi) Chung

Mi (Mimi) Chung

Mimi Chung is a data scientist with experience in the chemical and innovative material science industry as an associate engineer. Previously, she has worked across multiple functions to research, develop and sell electronic solutions. She has experience in...
View all posts by Mi (Mimi) Chung >

Related Articles

Leave a Comment

No comments found.

View Posts by Categories

Our Recent Popular Posts

View Posts by Tags

2019 airbnb alumni Alumni Interview Alumni Spotlight alumni story Alumnus API artist aws beautiful soup Best Bootcamp Best Data Science 2019 Best Data Science Bootcamp Big Data bootcamp Bootcamp Prep Bundles California Cancer Research capstone Career citibike clustering Coding Course Report D3.js data Data Analyst data science Data Science Academy Data Science Bootcamp Data Scientist Data Scientist Jobs data visualization Deep Learning Demo Day dplyr employer networking feature engineering Finance Financial Data Science Flask gbm Get Hired ggplot2 googleVis Hadoop higgs boson Hiring hiring partner events Industry Experts Job JP Morgan Chase Kaggle lasso regression Lead Data Scienctist Lead Data Scientist leaflet linear regression Logistic Regression machine learning Maps matplotlib Medical Research meetup Networking neural network Neural networks New Courses nlp NYC NYC Data Science nyc data science academy NYC Open Data NYCDSA NYCDSA Alumni Open Data painter pandas Portfolio Development prediction Programming PwC python python machine learning python scrapy python web scraping python webscraping Python Workshop R R language R Programming R Shiny r studio R Visualization R Workshop R-bloggers random forest recommendation recommendation system regression Scrapy scrapy visualization seaborn Selenium sentiment analysis Shiny Shiny Dashboard Spark Special Special Summer Sports statistics streaming Student Interview Student Showcase SVM Tableau Testimonial tf-idf Top Data Science Bootcamp twitter visualization web scraping What to expect word cloud word2vec XGBoost yelp