[Bug Related]Training loss unstable and sudden jump high #757

iFe1er · 2017-07-30T23:32:48Z

Environment info

Operating System: Windows 10,
Python Version: Python 2.7 Anaconda + Lightgbm

Error Message:

I was doing a binary classification task on a relatively large dataset(2.5GB, with my memory 16GB) , at about 500th rounds, it appears that one of the iteration is abnormal, causing both of training set and validation set error going up. These abnormal happens from time to time, sometimes simply changing a parameter such as bagging fraction from 0.8 to 0.9 can solve this problem, but it happends again if I add a new features.

At first i thought it is a problem of CPU core communicating problem, , but changing num_threads didn't seems helpful to me

As you see in the picture, the highlight output line is obviously abnormal. I knew that GBDT algorithm, the training error only decrease, never increase. However this line output is very different from the normal ones. It's logloss is higher, and very different from the one before and the one after.

The problem I also seen once on my Linux server's Lightgbm before. It is very disturbing and seriously affecting the accuracy of my model. I wonder why this happen and how to solve it.

BTW: I wonder if this is a memory related problem because my data file is very large, sometimes causing "Memory Error" when the code excuate the "lgb.train()" line.

Another appearance with different features and different parameters:

Also, the abnormal situation is NOT random, I can reproduce the result as many time as i want if I rerun my code.

My Parameter:
params = {
'boosting_type': 'gbdt',
'objective': 'binary',
'metric': 'binary_logloss',
'num_leaves': 96-1,
'learning_rate': 0.03,
'feature_fraction': 0.6,
'bagging_fraction': 0.8,
'bagging_freq': 1,
'verbose': 1
}

guolinke · 2017-07-31T01:10:36Z

@iFe1er
It is possible when you are using bagging, since some training data are out-of-bag.

iFe1er changed the title ~~Training loss unstable and sudden jump high~~ [Bug Related]Training loss unstable and sudden jump high Jul 30, 2017

guolinke closed this as completed Jul 31, 2017

lock bot locked as resolved and limited conversation to collaborators Jun 24, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug Related]Training loss unstable and sudden jump high #757

[Bug Related]Training loss unstable and sudden jump high #757

iFe1er commented Jul 30, 2017 •

edited

Loading

guolinke commented Jul 31, 2017

[Bug Related]Training loss unstable and sudden jump high #757

[Bug Related]Training loss unstable and sudden jump high #757

Comments

iFe1er commented Jul 30, 2017 • edited Loading

Environment info

Error Message:

guolinke commented Jul 31, 2017

iFe1er commented Jul 30, 2017 •

edited

Loading