Skip to content

In this project, I use the Random forest algorithm to build the house price prediction model on a dataset with 16 features and 4600 samples from Kaggle. Random Forest Regressor will be an optimal algorithm in this problem because it works well on both categorical and numerical features. Moreover, it is robust to missing values, new entries, and …

Notifications You must be signed in to change notification settings

JennyYu2017/ML-Regression-House-Price-Prediction-with-Random-Forest-Regressor

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 

Repository files navigation

Machine-Learning---House Price Prediction with Random Forest Regressor

In this project, I use the Random forest algorithm to build the house price prediction model on a dataset with 16 features and 4600 samples from Kaggle(https://www.kaggle.com/shree1992/housedata).

Random Forest Regressor will be an optimal algorithm in this problem because it works well on both categorical and numerical features. Moreover, it is robust to missing values, new entries, and outliers and will save us the effort to normalize the data considering each feature’s scale varies a lot.

The experiment shows that feature engineering using Recursive Feature Elimination does help improve model performance. Comparing the model trained on all features, the model trained on the 7 optimal features improves the R2 score from 0.4904 to 0.5435 while reducing the Mean absolute error from 134490.3 to 131453.1.

Plots and Visualizations

About

In this project, I use the Random forest algorithm to build the house price prediction model on a dataset with 16 features and 4600 samples from Kaggle. Random Forest Regressor will be an optimal algorithm in this problem because it works well on both categorical and numerical features. Moreover, it is robust to missing values, new entries, and …

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published