Analysis Of NYC Airbnb Listings
Image by TravelScape on Freepik
Research Questions for NYC Listings
In this Airbnb analysis we will try to answer these questions:
- Where is the best area for Airbnb listings in NYC?
- Where are the opportunities in the market that would allow for a new lister on Airbnb to optimize their revenue?
Analysis Background
Recently, Airbnb’s revenue from listings nearly doubled globally from $3.88 billion in 2020 to $6.85 billion in 2021 with North America growing the most among regions. With many families struggling to make ends meet during difficult economic times, listing a property with Airbnb may be an effective way to earn alternative revenue.
According to Mashvisor, as of March 2023, there are over 6.6 active listings on the platform that are managed by over 4.4 million hosts. Over 150 million people use Airbnb as their primary means of finding rental units. Because of the influx of new listings, listers often find it difficult to get started with Airbnb. They see a saturation of listings and are deterred from even initially listing their property. Now they can use this analysis for guidance. It helps identify potential gaps in the NYC listings market to help listers best optimize their revenue.
Airbnb Data
The data for this analysis was gathered from Inside Airbnb, specifically in New York City for the month of December in 2022. Although the dataset may not incorporate seasonality for the rest of the year, the dataset has columns with predicted availability over the next 365 days for each listing. To get a better understanding of the columns in this dataset, you can refer to the data dictionary at this link.
Initial Findings
For the initial analysis of the dataset I wanted to determine how each of the features influence each other. Accordingly, I created a Pearson's Correlation Heat-map to try and measure how each feature influences each other. This is seen in this figure.
From this we see that the price has a positive weak correlation with the number of listings a host has. It would seem that as the number of listings increases, the price of each listing also increases. This may be because many of the listings are in an area where the prices are just naturally higher, as is the case for Manhattan. It also seems that there is a weak correlation between the availability of a listing and its price. This is interesting because the availability_365 column is defined as the number of days a listing is available in the next 365 days, and it would seem that as the number of days a listing is available increases, the price also increases.
We can’t draw conclusions from unavailable listings because there are two different causes for that state. In the dataset dictionary, it states that a listing may be unavailable due to the owner of the property blocking the listing from being booked or because the listing has already been booked. This either shows that many of the listings in the used dataset are blocked by the lister or that there truly is a negative relationship between these two variables. To explore more, we would need to look at another dataset of all of the listers in NYC and delve more deeply into the types of listings there are.
Above is an initial heat map showing where the heaviest concentration of listings in NYC are. It shows that there is a large concentration of Listings in Manhattan and Brooklyn. As a new lister, this is logical because there are more tourist attractions in those areas, such as Central Park, the Empire State Building, etc.
To compare the listings and how successful they were, we created a column in the data frame named "annual_revenue" that was calculated from the availability_365 column and the price column. This was then added to each listing to show their estimated annual revenue. The neighborhoods with the highest average annual revenue are as follows.
- Chelsea - $119,443.27
- Brooklyn Heights - $105,301.77
- Theater District - $102,203.87
- SoHo - $96,167.34
- Tribeca - $89,216.63
- Financial District - $82,792.28
- West Village - $79,576.62
- Greenwich Village - $77,590.37
- Midtown - $77,515.15
- Boerum Hill - $77,406.62
- NoHo - $72,173.75
- Downtown Brooklyn - $69,474.00
- Battery Park City - $67,842.68
- Hell's Kitchen - $65,484.57
- Kips Bay - $63,182.99
- Columbia St - $62,889.90
- Vinegar Hill - $62,258.82
- East Village - $61,701.07
- Nolita - $59,398.92
- Cobble Hill - $57,816.72
From this figure, we can see that the annual revenue for the listings in those neighborhoods are in the range of $60,000-$110,000 a year. To get a better understanding, we would need to look into the categorical descriptors.
From the right figure above, it seemed as if the borough had a correlation with the annual revenue. Further exploration reveals that Manhattan has the greatest median price, while the Bronx has the lowest.
The next step in the analysis is to break the listings up into price bins. We did this because this then allowed for further analysis of all of the boroughs. We can now look at a price range where a lister should optimally list their property. Once done for all of the boroughs, some gaps in the market may appear.
Below are line charts for each borough and it shows the number of listings in the borough, the average days a listing is not booked, and the estimated annual revenue calculated from average nights books multiplied by the nightly rate compared to its price bin. A spike in the chart may show where a price bin is popular in the borough and may unveil an optimized range for a listing in a borough.
Brooklyn Analysis
From these charts, we can see that the majority of the listings from this dataset are from Brooklyn in the lower price bins. We can see that the number of listings decreases as you approach the $500 price bin, the availability of the listings increases, but the annual revenue is increasing. This means that there may be a market for listings up until the $500 price bin. When looking closer to the second chart, having a listing more than from $100-$500 may be optimal.
Manhattan Analysis
This analysis reveals that the $980-$990 price bin again creates more annual revenue than its neighboring price bins. In Manhattan there are also more listings around that price range, meaning that there is already a market at that price range. This may be because people finding listings have a budget around $1000 a night and are specifically looking for the most in that price range. It also shows that if a lister wanted, lowering or raising the price to that specific price bin may bear higher annual revenue.
Queens Analysis
From this, it can be seen that there is a large increase in estimated annual revenue at around the $820-$830 price bin. When explored further, it can also be seen that there is also more demand for listings in this price bin. It is showcased in the middle figure, where there is a steep drop in the number of days a listing is available in the price range. The neighboring price bins also show a comparable number of listings as the $820-$830 price bin, but with far less estimated annual revenue. This shows that there may be a market for listings in that price range, as well as in the $980-$990 price bin.
The Bronx Analysis
It can be seen that there are only about 250-300 listings in the Bronx. From the third chart it can be seen there is an increase in annual revenue at around the $250 price bin followed by a steep drop around $300. When explored further it can be seen that the price bin is less available meaning higher popularity, as well as an increase in annual revenue until the $300 price bin. This means that the pocket price bin may be between around $200-$300. If a lister has a property close to that price point, he/she may want to try and increase or decrease their listing price to make their listing more visible to listees.
Staten Island Analysis
From these charts we can see that there are not many listings in Staten Island from our dataset. The estimated annual revenue spike near the right end of the graph can be attributed to the fact that there are fewer than 10 listings in that price range in Staten Island. This is considered noise caused by a low sample size. This either means that there truly are not many listings in Staten Island or the data frame we used does not include all listings from Staten Island. To get a more in-depth look, we would need a Staten Island specific dataset which may provide the insights this dataset is missing.
Analyzing the Type of Room
From the earlier analysis we can see that a majority of the listings are in Manhattan and Brooklyn. When looking closer into which types of room generate the most revenue, we find each borough has a different optimal room type. Below are some analyses which find potential markets for new listers with a specific listing type.
Brooklyn Home
From these graphs we can find potential markets as well as parts of the market which listers should avoid. There is a price bin at $990-$1000 that generates more revenue than its neighboring price bins while also having a lower availability. There is also an increase in listings in that price point. It can also be seen that the price bin at $860-$870 should be avoided. The graphs show that there is less estimated revenue, increased availability while maintaining roughly the same number of listings. This means that the listings at that price point are not as popular as other price bins.
Manhattan Private
When looking at private room type listings in Manhattan, it can be seen again that there is a price point at $990-$1000 which is very popular. Again, there is a drastic increase in estimated annual revenue, a decrease in availability, and increase in number of listings. This means that the price bin is very popular among listing seekers. If a lister owns a private room in Manhattan, he/she should look to price their room at around $990-$1000. Listers should avoid the $400-$420 price range because, although they have more listings, the estimated annual revenue is lower when compared to its neighboring price bins, and the availability is greater than its neighboring price bins.
Limitations to the Analysis
Even though the analysis shows the price points in which listings are popular, it does not explain why they are popular. Listing seekers may also have preferences for particular premium features like elevators, upscale furnishings, rooms with nicer views, proximity to public transportation, or a quieter location. All of those features can influence how willing a person is to pay more. To provide a more comprehensive picture for listing seekers, more details about the listing are needed. For example, it is much more expensive to live in certain parts of Tribeca than it is to live in certain parts in Chinatown. The analysis done is more about looking at listings as an aggregate and does not detail the specifications for a listing to be in a specific price bin. It does not detail the specific features, such as maintenance, type of flooring, elevator access, and nearby services.