fix calculation of weighted gamma loss (fixes #4174) #4283

mayer79 · 2021-05-13T17:01:14Z

jameslamb · 2021-05-13T17:20:53Z

Thanks for this! I've edited the pull request title to be more descriptive. Pull request titles become changelog entries in this project (please see https://github.com/microsoft/LightGBM/releases/tag/v3.2.1).

jameslamb · 2021-05-13T18:08:42Z

I don't think the CUDA build failures are related to these changes. All of the Dask tests are failing there with the following error

>       from distributed.protocol.core import dumps_msgpack
E       ImportError: cannot import name 'dumps_msgpack' from 'distributed.protocol.core' (/root/miniconda/envs/test-env/lib/python3.7/site-packages/distributed/protocol/core.py)

I can look into this in a few hours. I suspect that this is a result of some new version of a dependency being released, will have to check them later to see.

lorentzenchr

This fixed the bug. What about a unit test for this?

lorentzenchr · 2021-05-13T21:30:23Z

src/objective/regression_objective.hpp

@@ -695,7 +695,7 @@ class RegressionGammaLoss : public RegressionPoissonLoss {
    } else {
      #pragma omp parallel for schedule(static)
      for (data_size_t i = 0; i < num_data_; ++i) {
-        gradients[i] = static_cast<score_t>(1.0 - label_[i] / std::exp(score[i]) * weights_[i]);
+        gradients[i] = static_cast<score_t>((1.0 - label_[i] / std::exp(score[i])) * weights_[i]);


Suggested change

gradients[i] = static_cast<score_t>((1.0 - label_[i] / std::exp(score[i])) * weights_[i]);

gradients[i] = static_cast<score_t>((1.0 - label_[i] * std::exp(-score[i])) * weights_[i]);

Could be a tiny little bit faster.

@lorentzenchr
Regarding timing: the same pattern (i.e. y/exp(p) instead of y*exp(-p) appears multiple time in the same file. My suggestion would be to open a new issue/PR that would change this consistently, along with 1-2 experiments on timings.

Regarding unit tests: I added some in a new R script. On the current master, it fails. With the gamma weight fix it passes.

the exp dominates timing by far.

I agree that we should make the formula in the same file consistent. If we want to change division into multiplication, we should modify all these

LightGBM/src/objective/regression_objective.hpp

Lines 689 to 700 in adf36d7

if (weights_ == nullptr) {

#pragma omp parallel for schedule(static)

for (data_size_t i = 0; i < num_data_; ++i) {

gradients[i] = static_cast<score_t>(1.0 - label_[i] / std::exp(score[i]));

hessians[i] = static_cast<score_t>(label_[i] / std::exp(score[i]));

}

} else {

#pragma omp parallel for schedule(static)

for (data_size_t i = 0; i < num_data_; ++i) {

gradients[i] = static_cast<score_t>(1.0 - label_[i] / std::exp(score[i]) * weights_[i]);

hessians[i] = static_cast<score_t>(label_[i] / std::exp(score[i]) * weights_[i]);

}

Done in #4289.

jameslamb · 2021-05-14T20:57:04Z

Ok @mayer79 , now that we've merged #4288, if you update to the latest master I think the failing checks should pass. Sorry for the inconvenience!

jameslamb

Tests and changes look good to me! But another maintainer who's more familiar with gamma loss should probably review as well, so I'm just leaving a "comment" review.

jameslamb · 2021-05-14T21:00:38Z

R-package/tests/testthat/test_weighted_loss.R

@@ -0,0 +1,66 @@
+context("Case weights are respected")


This test looks good to me, thank you! This might be the first unit test we've added that covers weighted training at all in the R package, so really appreciate it.

It's ok with me for this to live in a new test file as you've set it up. test_basic.R (where most of the other lgb.train() tests live) has gotten kind of big, and there are already uses of lgb.train() outside of that file in their own files (such as https://github.com/microsoft/LightGBM/blob/c629cb0b6bd2830894b710cbd4d8241b82ac3105/R-package/tests/testthat/test_learning_to_rank.R or https://github.com/microsoft/LightGBM/blob/c629cb0b6bd2830894b710cbd4d8241b82ac3105/R-package/tests/testthat/test_custom_objective.R).

lorentzenchr · 2021-05-20T21:36:26Z

@shiyu1994 After having reviewed #4289, maybe you'd like to have another look here as it fixes an actual bug.

shiyu1994

The changes LGTM.

github-actions · 2023-08-23T20:51:19Z

This pull request has been automatically locked since there has not been any recent activity since it was closed. To start a new related discussion, open a new issue at https://github.com/microsoft/LightGBM/issues including a reference to this.

fixed weighted gamma obj

9b7ae72

mayer79 requested review from btrotta, chivee, guolinke and shiyu1994 as code owners May 13, 2021 17:01

mayer79 changed the title ~~fixes https://github.com/microsoft/LightGBM/issues/4174~~ fixes issue #4174 May 13, 2021

jameslamb added the fix label May 13, 2021

jameslamb changed the title ~~fixes issue #4174~~ fix calculation of weighted gamma loss (fixes #4174) May 13, 2021

lorentzenchr approved these changes May 13, 2021

View reviewed changes

jameslamb mentioned this pull request May 14, 2021

[ci] Dask tests on Linux failing #4285

Closed

added unit tests

9d198ff

mayer79 requested review from jameslamb and Laurae2 as code owners May 14, 2021 08:24

mayer79 added 2 commits May 14, 2021 10:39

fixing linter errors

75d7ac9

another linter

c8d0001

lorentzenchr mentioned this pull request May 14, 2021

Replace division of exponential in Gamma loss #4289

Merged

jameslamb mentioned this pull request May 14, 2021

[ci] pin dask and distributed in CI jobs #4288

Merged

jameslamb reviewed May 14, 2021

View reviewed changes

mayer79 added 4 commits May 15, 2021 09:19

set seed

52e596d

Merge branch 'microsoft:master' into weighted_gamma

411efd6

fix linter (integer seed)

3ea27ba

Merge branch 'master' into weighted_gamma

1e60df1

StrikerRUS added the awaiting review label May 20, 2021

jameslamb mentioned this pull request May 20, 2021

release 3.3.0 #4310

Closed

21 tasks

shiyu1994 approved these changes May 21, 2021

View reviewed changes

StrikerRUS merged commit 4b1b412 into microsoft:master May 21, 2021

StrikerRUS removed the awaiting review label May 21, 2021

jameslamb mentioned this pull request Dec 7, 2021

[R-package] [docs] add Michael Mayer to DESCRIPTION #4867

Merged

github-actions bot locked as resolved and limited conversation to collaborators Aug 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix calculation of weighted gamma loss (fixes #4174) #4283

fix calculation of weighted gamma loss (fixes #4174) #4283

mayer79 commented May 13, 2021

jameslamb commented May 13, 2021

jameslamb commented May 13, 2021

lorentzenchr left a comment

lorentzenchr May 13, 2021

mayer79 May 14, 2021

lorentzenchr May 14, 2021

shiyu1994 May 14, 2021

lorentzenchr May 20, 2021

jameslamb commented May 14, 2021

jameslamb left a comment

jameslamb May 14, 2021

lorentzenchr commented May 20, 2021

shiyu1994 left a comment

github-actions bot commented Aug 23, 2023

	gradients[i] = static_cast<score_t>((1.0 - label_[i] / std::exp(score[i])) * weights_[i]);
	gradients[i] = static_cast<score_t>((1.0 - label_[i] * std::exp(-score[i])) * weights_[i]);

	if (weights_ == nullptr) {
	#pragma omp parallel for schedule(static)
	for (data_size_t i = 0; i < num_data_; ++i) {
	gradients[i] = static_cast<score_t>(1.0 - label_[i] / std::exp(score[i]));
	hessians[i] = static_cast<score_t>(label_[i] / std::exp(score[i]));
	}
	} else {
	#pragma omp parallel for schedule(static)
	for (data_size_t i = 0; i < num_data_; ++i) {
	gradients[i] = static_cast<score_t>(1.0 - label_[i] / std::exp(score[i]) * weights_[i]);
	hessians[i] = static_cast<score_t>(label_[i] / std::exp(score[i]) * weights_[i]);
	}

fix calculation of weighted gamma loss (fixes #4174) #4283

fix calculation of weighted gamma loss (fixes #4174) #4283

Conversation

mayer79 commented May 13, 2021

jameslamb commented May 13, 2021

jameslamb commented May 13, 2021

lorentzenchr left a comment

Choose a reason for hiding this comment

lorentzenchr May 13, 2021

Choose a reason for hiding this comment

mayer79 May 14, 2021

Choose a reason for hiding this comment

lorentzenchr May 14, 2021

Choose a reason for hiding this comment

shiyu1994 May 14, 2021

Choose a reason for hiding this comment

lorentzenchr May 20, 2021

Choose a reason for hiding this comment

jameslamb commented May 14, 2021

jameslamb left a comment

Choose a reason for hiding this comment

jameslamb May 14, 2021

Choose a reason for hiding this comment

lorentzenchr commented May 20, 2021

shiyu1994 left a comment

Choose a reason for hiding this comment

github-actions bot commented Aug 23, 2023