Maximizing Profits for Airbnb Hosts in Hawaii

Posted on Jul 28, 2019

As businesses grow and become more popular, it is imperative that they learn to adapt their model in a way that does not negatively effect local environments. As a Data Scientist, I am committed to both business and environment so that they may find ways to live together symbiotically. 

Introduction & Purpose

Airbnb is a popular rental service that allows individuals to make extra cash off of unused spaces in their homes. The service is popular to tourists because it offers luxurious, private accommidations for a discounted rate compared to local hotels. Airbnb makes anywhere from 3% per booking from hosts and 6 to 12% from guests. While tourists have enjoyed utilizing Airbnb for travel, many local communities have suffered due to the influx of people. Hawaii is one of these states where the negative effects of un-restricted vaction rentals is quite apparent. The increase in tourism around small, previously rural communities has increased cost of living, which is highly undesirable for a state where the residential population makes less than $80,000 (1) a year and the land value is above $700,000 (2).  The increased property value has left properties on the market for extended amounts of time. These properties are eventually bought by wealthy non-residents and transformend into vacation rentals, which displaces locals. This year, Hawaii attempted to pass legislation that would restict Airbnb rentals. Though the legislation failed, it is up for review again and Hawaii will soon join the handful of states that have passed such legislation. 

With this increasing trend in legislations, it is crucial for Airbnb to adapt to these new changes to keep profits steady. The purpose of this analysis is to explore the factors: rental age, total rental reviews, and total featured amenities, and their effects on nightly rates for rentals across the state of Hawaii. Using data from the Airbnb Hawaii website that includes 275 rentals in Hawaii, simple linear regressions (SLRs) and a multivariate linear regression (MLR) are used to shed light on ways hosts in Hawaii can increase their profits in the new environment that restictive legislation will eventually create in the state. 

Data Collection

Data were collected from the Airbnb Hawaii website using the Selenium web scraping package in Python. When you enter the Airbnb Hawaii website, the properties appear in list form that looks like the following.

Each property contains a url on the main page that the Selenim code had to recognize and click on to access the property-specific pages. Factors of interest obtained from each property's page were: 

  • Rental location (Island location of rental)
  • Rate (in USD)
  • Total reviews 
  • Total featured amenities 
  • Rental age (in years)

Below is an example of a property's page.

Data Analyses

Analyses were conducted in R Studio and Numpy in Python. Plots were generated using Seaborn. A correlation matrix was produced to examine possible relationships between nightly rates and the other factors of interest.

Nightly rates across the state of Hawaii showed a positive correlation with featured amenities (r = 0.36) and a negative correlation with total rental reviews (r = -0.46), and rental age (r = -0.09). These relationships are demonstrated visually in the plots below.

SLRs  were performed to analyze the relationship between rate and each individual factor. Significant relationships were observed between rate and featured amenities (p < 0.0001) and total rental reviews (p < 0.0001). The relationship between rate and rental age was not significant. 

MLR was then performed to observe relationships controlling for all other factors. The model includes total rental reviews, rental age, and number of featured amenities. The MLR showed a significant negative linear relationship between total rental reviews and rate ( p < 0.0001). Significant positive linear relationships were observed between rate and featured amenities ( p < 0.001). All other relationships were not significant. 


Plots for rates and all factors in Hawaii
Plots for rates and factors by host status

Conclusions and Recommendations

Nightly rental rates shared significant linear relationships between number of featured amenities and total rental reviews. Based on the analysis, for a regular host or Superhost to increase their profit, they should increase the number of featured amenities they offer and decrease their rate. If hosts decreases their rate, they will receive more rental reviews, based on the relationship between the two. The assumption is that a rental with low rates and a high number of amenities will be perceived by guests as a great value. This perception could increase total number of reviews for a rental and increase its star-rating, thus increasing a rental's popularity. 

In the future,  I will perform more in-depth analyses on specific amenities that are associated with the highest reviews and highest rates. I will also explore the relationships observed in this analysis by island. 





Sources and Links

(1) Department of Numbers (2017).

(2) KITV4 News (2017).


To examine the code for this project, click the link to my project Github:

About Author


Laura Elliott

Laura Elliott is an experienced data analyst and epidemiologist who specialized in the distribution of mental health disorders, specifically adolescent anxiety. Now pursuing data science, Ms. Elliott intends to improve business practices through her extensive knowledge of mental...
View all posts by Laura Elliott >

Leave a Comment

No comments found.

View Posts by Categories

Our Recent Popular Posts

View Posts by Tags

#python #trainwithnycdsa 2019 airbnb Alex Baransky alumni Alumni Interview Alumni Reviews Alumni Spotlight alumni story Alumnus API Application artist aws beautiful soup Best Bootcamp Best Data Science 2019 Best Data Science Bootcamp Best Data Science Bootcamp 2020 Best Ranked Big Data Book Launch Book-Signing bootcamp Bootcamp Alumni Bootcamp Prep Bundles California Cancer Research capstone Career Career Day citibike clustering Coding Course Demo Course Report D3.js data Data Analyst data science Data Science Academy Data Science Bootcamp Data science jobs Data Science Reviews Data Scientist Data Scientist Jobs data visualization Deep Learning Demo Day Discount dplyr employer networking feature engineering Finance Financial Data Science Flask gbm Get Hired ggplot2 googleVis Hadoop higgs boson Hiring hiring partner events Hiring Partners Industry Experts Instructor Blog Instructor Interview Job Job Placement Jobs Jon Krohn JP Morgan Chase Kaggle Kickstarter lasso regression Lead Data Scienctist Lead Data Scientist leaflet linear regression Logistic Regression machine learning Maps matplotlib Medical Research Meet the team meetup Networking neural network Neural networks New Courses nlp NYC NYC Data Science nyc data science academy NYC Open Data NYCDSA NYCDSA Alumni Online Online Bootcamp Online Training Open Data painter pandas Part-time Portfolio Development prediction Prework Programming PwC python Python Data Analysis python machine learning python scrapy python web scraping python webscraping Python Workshop R R language R Programming R Shiny r studio R Visualization R Workshop R-bloggers random forest Ranking recommendation recommendation system regression Remote remote data science bootcamp Scrapy scrapy visualization seaborn Selenium sentiment analysis Shiny Shiny Dashboard Spark Special Special Summer Sports statistics streaming Student Interview Student Showcase SVM Switchup Tableau team TensorFlow Testimonial tf-idf Top Data Science Bootcamp twitter visualization web scraping Weekend Course What to expect word cloud word2vec XGBoost yelp