Task : build a multilinear regression model to predict price
Summary : I build a multilinear regression model using : ols method various steps for multilinear regression models are :
- columns names
- columns data types
- duplicacy in records
- outlier detection
- etc...
- pairplot
- correlation (relation between x and y) (point to notice - 1 : is there any relation between x and x)
- building model (trial mode) (point to notice - 2 : rquared, AIC, p-value, adj_rsquared)
- a-1) checking if x and x have good correlation
- a-2) is p-value significant or not
- multicollinearity (VIF) (checking how individual feature affecting the model prediction)
- error / residue handing
- improving model (trial mode)
- checking influential points (cook's distance)
- improving model (trial mode)
After dealing with all the above case we build our final model. Final model has good R-squared value, less AIC value, significal p-value. Having all these requirements, we can now approve our model.
Conclusion :
[model1 - 0.868116 (R-squared)] [model2 - 0.883968 (R-squared)] [final_model - 0.888240 (R-squared)]