Moving Truck Rental Reviews

Xu Huang
Posted on Aug 13, 2018

This is a Web Scraping project for the review of different moving truck rental companies, using Selenium, python and R. The Moving101 website provides useful knowledge and tips for different types of moving, and moving cost estimations.

The main method used for scraping web information is Selenium because the customer reviews must be updated by "load more" button. There are five companies on the website with customer reviews, however U-Haul, Penske and Budget are the 3 main companies to provide enough reviews.

The moving bussiness has strong strong seasonal feature. Here the calendar heat map clearly shows that April to October are the moving season, reaching the peak in July.

Among all the reviews with complete entries, Penske has the highest costomer ratings. If pay a closer look at the rating vs moving distance, we can see that Penske has nearly ~75% long-distance moving, meanwhile receives highest rating, indicating that their trucks have higher quality and customer service is reliable.

To specify the pro's and con's among customer reviews, the word clouds below are differed by high-score comments and low-score comments. Here the low-score comments reveals that the main problem in U-Haul service is that they have problem to deliver the truck to desired location.

To know more about my project, welcome to my my GitHub account

About Author

Xu Huang

Xu Huang

Xu Huang got PhD in Computational Chemistry from University of Iowa and B.S. in Chemistry from Peking University. Her study includes developing & testing the computational code to improve the accuracy for the modeling of battery material &...
View all posts by Xu Huang >

Leave a Comment

No comments found.

View Posts by Categories


Our Recent Popular Posts


View Posts by Tags

2019 airbnb alumni Alumni Interview Alumni Spotlight alumni story Alumnus API artist aws beautiful soup Best Bootcamp Best Data Science 2019 Best Data Science Bootcamp Big Data bootcamp Bootcamp Prep Bundles California Cancer Research capstone Career citibike clustering Coding Course Report D3.js data Data Analyst data science Data Science Academy Data Science Bootcamp Data Scientist Data Scientist Jobs data visualization Deep Learning Demo Day dplyr employer networking feature engineering Finance Financial Data Science Flask gbm Get Hired ggplot2 googleVis Hadoop higgs boson Hiring hiring partner events Industry Experts Job JP Morgan Chase Kaggle lasso regression Lead Data Scienctist Lead Data Scientist leaflet linear regression Logistic Regression machine learning Maps matplotlib Medical Research meetup Networking neural network Neural networks New Courses nlp NYC NYC Data Science nyc data science academy NYC Open Data NYCDSA NYCDSA Alumni Open Data painter pandas Portfolio Development prediction Programming PwC python python machine learning python scrapy python web scraping python webscraping Python Workshop R R language R Programming R Shiny r studio R Visualization R Workshop R-bloggers random forest recommendation recommendation system regression Scrapy scrapy visualization seaborn Selenium sentiment analysis Shiny Shiny Dashboard Spark Special Special Summer Sports statistics streaming Student Interview Student Showcase SVM Tableau Testimonial tf-idf Top Data Science Bootcamp twitter visualization web scraping What to expect word cloud word2vec XGBoost yelp