Predicting Housing Prices in Ames: A Machine Learning Project

Gregory Brucchieri, Adrian Phillips-Samuels and William Fallon

Posted on Mar 20, 2018

Introduction

In this project we use machine learning techniques to attempt to predict housing prices in Ames, Iowa. The data comes from the Kaggle competition "House Prices: Advanced Regression Techniques". The team consists of Gregory Brucchieri, Billy Fallon and Adrian Phillips-Samuels. We are The Fighting Mongooses.

The Data

The data contained a training set with 1460 observations of 79 features and the target variable Sale Price. The features were a mix of 28 continuous variables and 51 categorical. 34 features contained missing values. We used a variety of techniques to impute these values, usually drawing from the variables and their description, which can be seen in the final code. We treated categorical variables as ordinal whenever possible. We observed 2 outliers with an unusual price/square footage ratio and chose robust scaling to account for these values. A number of additional features were engineered and categoricals were replaced through one hot encoding.

The Models

A number of models were used to explain and predict the sales price, including Random Forest, Gradient Boosting, XGBoost and linear modeling. In the end a weighted ensemble of Random Forest, Gradient Boosting and XGBoost provided our best model. We recieved a Kagle Score of .1232.

All code and results can be seen here, in our github repo. The final prediction code is in the Project_Consolidated.py file.

About Authors

Gregory Brucchieri

Gregory has a Master of Arts in Economics from NYU. He is a former business analyst with Humana, Inc, where he maintained provider relations and contract databases for smaller, local networks Humana had paired with. He is driven...

View all posts by Gregory Brucchieri >

Adrian Phillips-Samuels

View all posts by Adrian Phillips-Samuels >

William Fallon

View all posts by William Fallon >

Python

EDA and machine learning Ames housing price prediction project

Meetup

Machine learning Uber vs. Lyft price prediction modeling

Machine Learning

Predicting Customer Churn at Telco

Data Visualization

The Data Behind EV Driving

Capstone

Blind Dating Ensemble Classifier

Cancel reply

You must be logged in to post a comment.

No comments found.

Predicting Housing Prices in Ames: A Machine Learning Project

Introduction

The Data

The Models

About Authors

Gregory Brucchieri

Adrian Phillips-Samuels

William Fallon

Related Articles

Leave a Comment

Cancel reply

View Posts by Categories

Our Recent Popular Posts

View Posts by Tags

NYC Data Science Academy

Get detailed curriculum information about our
amazing bootcamp!

Offerings

About

SOCIAL MEDIA

Predicting Housing Prices in Ames: A Machine Learning Project

Introduction

The Data

The Models

About Authors

Gregory Brucchieri

Adrian Phillips-Samuels

William Fallon

Related Articles

Leave a Comment

Cancel reply

View Posts by Categories

Our Recent Popular Posts

View Posts by Tags

NYC Data Science Academy

Get detailed curriculum information about our amazing bootcamp!

Offerings

About

SOCIAL MEDIA

Get detailed curriculum information about our
amazing bootcamp!