• Home
  • About
    • Sen LEI photo

      Sen LEI

      DO or DIE

    • Learn More
    • Email
    • Github
  • Posts
    • All Posts
    • All Tags
  • Blog
  • Projects

Real Estate Analysis

18 Sep 2018

Reading time ~1 minute

Data Sets (privately owned)

Goal

By providing some basic information about a specific house, city feature information, surrounding statistical summary based on the location of the house, macro economic information, and lagged price information, make prediction on the rental and sale price.

Main Model

Math part

where the subscripts are:

: observation

: city

: district

: grid

and the other notations are:

: house features per se

: city features

: “grid” surrounding features

: macro economic features

: IDW matrix of each city (calculated by the center cordinates of each distrcit of a city)

: average price (by district)

: lagged average price (by city)

: error term

Results

Compared to Linear Regression, LASSO and Ridge Regression, XGBoosting outperforms in prediction (which has the lowest MSE on test set);

Feature importance generated from XGBoosting has been confirmed by the LASSO feature selection results in some extent.

img: XGB Feature Importance

ARIMA Model on Average Price

img: Monthly Rental Forecast



XGBoostingreal estateARIMAfeature selectionforecast Share Tweet +1