Data Analysis on Real Estate in Ames, Iowa
The skills I demoed here can be learned through taking Data Science with Machine Learning bootcamp with NYC Data Science Academy.
Ames has an interesting housing market because data shows there has been some expansion going on in recent years. Newer neighborhoods tend to be on the outskirts of the city, but there have also been a good number of renovations as well.
As you can see above, all the neighborhoods surround Iowa State University (ISU). Oldtown is one of the oldest neighborhoods in Ames, but it has some very high renovation activity. It seems that Ames, Iowa is in the process of becoming a more modern housing market. This is one of the recent developments that make Ames an up-and-coming housing market.
The graphic above shows that the more recent houses were built on the outskirts of Ames which suggests is expanding outward. That said, there are a few houses being built around the airport which might mean that the city wants to make it easier for people to travel into Ames. This could lead to increased business activity in the downtown area.
Year of House Built
The above graphic shows the year the house was built (if it was not remodeled) or the year of the most recent renovation (until 2011). This shows that there is high renovation activity in the neighborhoods with the oldest homes. There seems to be a higher concentration of renovation activity near the downtown area; this might increase economic activity in the business districts.
This facet graph shows the number of renovations since 1995 by neighborhood. There are only four neighborhoods on an upward trend. This suggests that the housing market is expanding by building new houses. Renovations may not be the primary way for expansion in Ames.
However, this means that there is room for investment in Ames in the area of housing renovation. We can see that Ames is expanding by building more modern homes near the airport, downtown, and the outskirts of town. The property value of these newer homes will be improved if the older homes in these areas were renovated to match the increasing quality of homes being built. So, if you wanted to invest in the housing market of Ames by renovating older homes, where would you start? Are there any neighborhoods that have a high probability of a substantial return on investment?
When doing data analysis, the number of observations is important. When the sample size is small, it is difficult to make reliable conclusions from the statistics of the data because it is usually difficult for small sample sizes to approximate the population. Therefore, we will focus our analysis on the neighborhoods in the top six for the number of observations, namely: Old Town, Northwest Ames (NAmes), College Creek (CollgCr), Northridge Heights (NridgHt), Somerset, and Gilbert.
Sale Price Distribution
From the boxplot, we can see Old Town has a lot of outliers. Why? It is most likely because Old Town is one of the oldest neighborhoods in Ames and there are a lot of new houses that were built. These new houses were probably sold for higher prices than the older homes. Also, the high number of outliers shows that Old Town's housing landscape is being updated. It might be in the city's business interest to update Old Town seeing that it is near downtown and other business districts.
Northridge Heights has the biggest distribution of sale prices. Northridge Heights is in the northern part of Ames. As stated before, Ames is expanding; Northridge Heights is a byproduct of this expansion process. Thus, the houses in this neighborhood are more modern, hence more expensive. The 25th percentile of the sale price in Northridge Heights is greater than the median sale price of each of the other five neighborhoods. This highlights that newer houses tend to be more expensive than older houses. Inflation is not the cause of the price increase because all the homes in this dataset were appraised in the same time period.
In the dataset, there were well over 100 features and so, to avoid the curse of dimensionality, some feature selection needed to take place. Lasso regression worked very well in this case, along with some judgment calls, to come up with the following set of features to analyze:
These were the features that were used in the descriptive modeling process. Our team wanted to understand the importance of each feature on the neighborhood level, so we ran Multiple Linear Regression for each neighborhood individually. Here were the R-squared values for each model:
The top graph shows the R-squared value for each model and the bottom graph shows the number of observations for each neighborhood. Notice that the neighborhoods with the highest R-squared tend to have the lowest number of observations, although there are some notable exceptions. Old Town has a high number of observations, but a relatively low R-squared value. How can this be? It is most likely because Old Town has a high number of outliers, as can be seen from the previous boxplot, therefore the model did not make a great fit.
If you were an investor looking to invest in the housing market in Ames by renovating houses, =what would be your strategy? Based on the multiple linear regression models, we decided to focus on the neighborhoods in the top 6 for the number of observations and give business insights from their regression coefficients. Here is what we found:
We start with Old Town. Old Town has a number of old homes, but is in the process of being renovated. As with many old places, it is important to modernize the necessities without losing the historic aspects of the property. Older homes usually have subpar insulation, by today's standards, so an improvement in insulation and heating quality would increase the values of these homes.
As you can see, the coefficients for Gilbert and Northwest Ames were very similar, so we decided to give business insights for both simultaneously. Also, buy properties within these neighborhoods that are closer to the airport and renovate them because the airport distance seems to have a high coefficient.
ISU is a college town, but families with no college students might prefer to live away from ISU. Also, college students usually need cheaper homes, thus the most expensive houses tend to be further away from ISU. Therefore buying homes under the median price that are further away from ISU should bring a good return on an investment after renovation.
Northridge Heights is one of the newer neighborhoods in Ames, which might explain why it has a more expensive housing market when compared to the other neighborhoods. Also, the more expensive house tend to be located further away from ISU. Therefore, one should try to find houses cheaper than the median price of 300k in the northern part of the neighborhood.
In Somerset, the median house price is 223k, so this advice is similar to the others: buy cheap and renovate to increase the sale price beyond the median price.
This concludes the Ames Housing project! This work was done by me, Brian Kuo, and Evan Kiolbassa.