[jvm-packages]Fix early stopping condition #3928

CodingCat · 2018-11-21T23:21:05Z

No description provided.

CodingCat · 2018-11-22T01:38:06Z

yanboliang

Looks good overall, it's better we can have more docs. Thanks.

mingyang · 2019-01-08T02:40:15Z

jvm-packages/xgboost4j/src/main/java/ml/dmlc/xgboost4j/java/XGBoost.java

+    for (int shift = 0; shift < earlyStoppingRounds - 1; shift++) {
+      // the initial value of onTrack is false and if the metrics in any of `earlyStoppingRounds`
+      // iterations goes to the expected direction, we should consider these `earlyStoppingRounds`
+      // as `onTrack`
      onTrack |= maximizeEvaluationMetrics ?


Does this |= mean that, if the metric is moving in the right direction any two consecutive steps within the earlyStoppingRounds from the current iteration, then this method will return true?

This may not be what people normally expect from setting early stopping. For example, I'm getting a real training progress below with earlyStoppingSteps set to 20: the training should stop around iterations 120 since the maximum PR AUC was observed around iteration 100. But the current logic seems to look for any upward pieces within earlyStoppingSteps and keep training.

In the python-package(see here), training stops if the current iteration is earlyStoppingSteps away from the best iteration. Should the spark version be consistent with the python implementation?

good catch, are you interested in filing a PR or an issue?

I don't really write in Scala. So created this issue instead.

CodingCat and others added 11 commits November 21, 2018 15:17

add back train method but mark as deprecated

d46d344

fix scalastyle error

57f8461

add back train method but mark as deprecated

af2fcba

fix scalastyle error

c44508f

add back train method but mark as deprecated

743de90

fix scalastyle error

0a06fa6

add back train method but mark as deprecated

b987b0d

fix scalastyle error

feba970

update version

d32a5f9

0.82

17db3ed

fix early stopping condition

19b6fb7

CodingCat mentioned this pull request Nov 21, 2018

[jvm-packages] Early Stopping Broken with xgboost4j-spark 0.81 #3927

Closed

remove unused

6d1cafd

yanboliang approved these changes Nov 24, 2018

View reviewed changes

Nan Zhu added 3 commits November 23, 2018 21:37

update comments

345daba

udpate comments

b437576

update test

fd08758

CodingCat merged commit 9c4ff50 into dmlc:master Nov 24, 2018

CodingCat deleted the fix_early_stopping branch November 24, 2018 08:18

mingyang reviewed Jan 8, 2019

View reviewed changes

hcho3 mentioned this pull request Mar 4, 2019

[RFC] Version 0.82 release candidate #4201

Merged

lock bot locked as resolved and limited conversation to collaborators Apr 8, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[jvm-packages]Fix early stopping condition #3928

[jvm-packages]Fix early stopping condition #3928

CodingCat commented Nov 21, 2018

CodingCat commented Nov 22, 2018

yanboliang left a comment

mingyang Jan 8, 2019

CodingCat Jan 8, 2019

mingyang Jan 8, 2019

[jvm-packages]Fix early stopping condition #3928

[jvm-packages]Fix early stopping condition #3928

Conversation

CodingCat commented Nov 21, 2018

CodingCat commented Nov 22, 2018

yanboliang left a comment

Choose a reason for hiding this comment

mingyang Jan 8, 2019

Choose a reason for hiding this comment

CodingCat Jan 8, 2019

Choose a reason for hiding this comment

mingyang Jan 8, 2019

Choose a reason for hiding this comment