Skip to content

kunl222/End-To-End-Bulldozer-Prize-Regression

Repository files navigation

End-To-End-Bulldozer-Prize-Prediction

machine learning project with the goal of predicting the sale price of bulldozers. this kind of problem is known as a regression problem The data and evaluation metric we'll be using (root mean square log error or RMSLE) is from the Kaggle Bluebook for Bulldozers competition.

1. Problem Definition

How well can we predict the future sale price of a bulldozer, given its characteristics previous examples of how much similar bulldozers have been sold for?

2. Data

There are 3 datasets:

Train.csv

Historical bulldozer sales examples up to 2011 (close to 400,000 examples with 50+ different attributes, including SalePrice which is the target variable).

Valid.csv

Historical bulldozer sales examples from January 1 2012 to April 30 2012 (close to 12,000 examples with the same attributes as Train.csv).

Test.csv

Historical bulldozer sales examples from May 1 2012 to November 2012 (close to 12,000 examples but missing the SalePrice attribute, as this is what we'll be trying to predict).

3. Evaluation

For this problem, Kaggle has set the evaluation metric to being root mean squared log error (RMSLE). As with many regression evaluations, the goal will be to get this value as low as possible.

About

End-To-End-Bull-Dozer-Prize-Regression

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published