Moving Truck Rental Reviews

Xu Huang
Posted on Aug 13, 2018

This is a Web Scraping project for the review of different moving truck rental companies, using Selenium, python and R. The Moving101 website provides useful knowledge and tips for different types of moving, and moving cost estimations.

The main method used for scraping web information is Selenium because the customer reviews must be updated by "load more" button. There are five companies on the website with customer reviews, however U-Haul, Penske and Budget are the 3 main companies to provide enough reviews.

The moving bussiness has strong strong seasonal feature. Here the calendar heat map clearly shows that April to October are the moving season, reaching the peak in July.

Among all the reviews with complete entries, Penske has the highest costomer ratings. If pay a closer look at the rating vs moving distance, we can see that Penske has nearly ~75% long-distance moving, meanwhile receives highest rating, indicating that their trucks have higher quality and customer service is reliable.

To specify the pro's and con's among customer reviews, the word clouds below are differed by high-score comments and low-score comments. Here the low-score comments reveals that the main problem in U-Haul service is that they have problem to deliver the truck to desired location.

To know more about my project, welcome to my my GitHub account

About Author

Xu Huang

Xu Huang

Xu Huang got PhD in Computational Chemistry from University of Iowa and B.S. in Chemistry from Peking University. Her study includes developing & testing the computational code to improve the accuracy for the modeling of battery material &...
View all posts by Xu Huang >

Leave a Comment

No comments found.

View Posts by Categories


Our Recent Popular Posts


View Posts by Tags

#python #trainwithnycdsa 2019 airbnb Alex Baransky alumni Alumni Interview Alumni Reviews Alumni Spotlight alumni story Alumnus API Application artist aws beautiful soup Best Bootcamp Best Data Science 2019 Best Data Science Bootcamp Best Data Science Bootcamp 2020 Best Ranked Big Data Book Launch Book-Signing bootcamp Bootcamp Alumni Bootcamp Prep Bundles California Cancer Research capstone Career Career Day citibike clustering Coding Course Demo Course Report D3.js data Data Analyst data science Data Science Academy Data Science Bootcamp Data science jobs Data Science Reviews Data Scientist Data Scientist Jobs data visualization Deep Learning Demo Day Discount dplyr employer networking feature engineering Finance Financial Data Science Flask gbm Get Hired ggplot2 googleVis Hadoop higgs boson Hiring hiring partner events Hiring Partners Industry Experts Instructor Blog Instructor Interview Job Job Placement Jobs Jon Krohn JP Morgan Chase Kaggle Kickstarter lasso regression Lead Data Scienctist Lead Data Scientist leaflet linear regression Logistic Regression machine learning Maps matplotlib Medical Research Meet the team meetup Networking neural network Neural networks New Courses nlp NYC NYC Data Science nyc data science academy NYC Open Data NYCDSA NYCDSA Alumni Online Online Bootcamp Online Training Open Data painter pandas Part-time Portfolio Development prediction Prework Programming PwC python python machine learning python scrapy python web scraping python webscraping Python Workshop R R language R Programming R Shiny r studio R Visualization R Workshop R-bloggers random forest Ranking recommendation recommendation system regression Remote remote data science bootcamp Scrapy scrapy visualization seaborn Selenium sentiment analysis Shiny Shiny Dashboard Spark Special Special Summer Sports statistics streaming Student Interview Student Showcase SVM Switchup Tableau team TensorFlow Testimonial tf-idf Top Data Science Bootcamp twitter visualization web scraping Weekend Course What to expect word cloud word2vec XGBoost yelp