Add decomposition for `ONNXSoftmaxCrossEntropyLossOp` #2968

srcarroll · 2024-10-07T18:29:29Z

No description provided.

Signed-off-by: Sam <srcarroll314@gmail.com>

jenkins-droid · 2024-10-07T18:29:45Z

Can one of the admins verify this patch?

srcarroll · 2024-10-07T19:33:59Z

i'm not really familiar with them so will have to look into it, but should i add to test/backend/inference_backend.py or do there already exists relevant tests for this op (i only see shape inference tests)? would such a test belong somewhere else?

srcarroll · 2024-10-07T19:43:40Z

also i'm not considering the ignore_index attribute here and will leave as a todo. but i don't know of a good way to match fail this case. any suggestions?

Edit: actually just realized it is an optional, not defaulted, attribute. so i know how to check. but if anyone has ideas on how to extend this to support it, i'm all ears. the simplest thing i can think of is to just subtract the slice at ignore_index from the result of ReduceSum.

Signed-off-by: Sam <srcarroll314@gmail.com>

jenkins-droid · 2024-10-07T19:45:06Z

Can one of the admins verify this patch?

AlexandreEichenberger · 2024-10-08T00:42:21Z

@jenkins-droid test this please

Signed-off-by: Sam <srcarroll314@gmail.com>

jenkins-droid · 2024-10-08T16:20:28Z

Can one of the admins verify this patch?

srcarroll · 2024-10-08T18:08:29Z

i think i misunderstood the mean reduction with weights case. looking into it now (my recent "fix" commit is wrong)

srcarroll · 2024-10-08T21:33:23Z

i think i misunderstood the mean reduction with weights case. looking into it now (my recent "fix" commit is wrong)

so i was just summing over the original weights tensor, but i should have been summing over W[n][d1][d2]...[dk] = weights[labels[i][d1][d2]...[dk]]. The simplest way i can think of producing W is doing onnx.Einsum(one_hot_labels, weights) {equation "ij...,j->i...} (or just a matmul if input is 2D). Any objection to this or any other suggestions? It probably wouldn't be very performant as one_hot_labels is a sparse matrix, but other solutions i can think of involve nasty indexing that i would like to avoid unless insisted upon.

AlexandreEichenberger · 2024-10-09T00:25:22Z

@jenkins-droid test this please

Signed-off-by: Sam <srcarroll314@gmail.com>

jenkins-droid · 2024-10-09T00:30:39Z

Can one of the admins verify this patch?

srcarroll · 2024-10-09T00:32:26Z

@jenkins-droid test this please

@AlexandreEichenberger you might want to cancel and rerun since i just pushed. also is there a rule of thumb for running tests? since this is still in draft i might push commits that shouldn't be tested yet. maybe i should just not have this drafted yet? sorry if i'm not following some etiquette for this

jenkins-droid · 2024-10-09T16:00:46Z

Can one of the admins verify this patch?

AlexandreEichenberger · 2024-10-09T20:03:31Z

@jenkins-droid test this please

AlexandreEichenberger · 2024-10-09T20:07:05Z

@jenkins-droid test this please

srcarroll · 2024-10-11T19:11:57Z

@AlexandreEichenberger should i undraft to get some feedback on the testing i asked about? I figured i should get feedback first but i don't know of a good person to ping for this.

Add decomposition for ONNXSoftmaxCrossEntropyLossOp

63c5467

Signed-off-by: Sam <srcarroll314@gmail.com>

Add comments

f673bec

Signed-off-by: Sam <srcarroll314@gmail.com>

Fix mean with weights

4ba3c2a

Signed-off-by: Sam <srcarroll314@gmail.com>

Fix mean + weight case again

f0ba333

Signed-off-by: Sam <srcarroll314@gmail.com>

Merge branch 'main' into decompose-softmax-ce

18a1ec3

Merge branch 'main' into decompose-softmax-ce

8c746b7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add decomposition for `ONNXSoftmaxCrossEntropyLossOp` #2968

Add decomposition for `ONNXSoftmaxCrossEntropyLossOp` #2968

srcarroll commented Oct 7, 2024

jenkins-droid commented Oct 7, 2024

srcarroll commented Oct 7, 2024 •

edited

Loading

srcarroll commented Oct 7, 2024 •

edited

Loading

jenkins-droid commented Oct 7, 2024

AlexandreEichenberger commented Oct 8, 2024

jenkins-droid commented Oct 8, 2024

srcarroll commented Oct 8, 2024 •

edited

Loading

srcarroll commented Oct 8, 2024

AlexandreEichenberger commented Oct 9, 2024

jenkins-droid commented Oct 9, 2024

srcarroll commented Oct 9, 2024

jenkins-droid commented Oct 9, 2024

AlexandreEichenberger commented Oct 9, 2024

AlexandreEichenberger commented Oct 9, 2024

srcarroll commented Oct 11, 2024

Add decomposition for ONNXSoftmaxCrossEntropyLossOp #2968

Are you sure you want to change the base?

Add decomposition for ONNXSoftmaxCrossEntropyLossOp #2968

Conversation

srcarroll commented Oct 7, 2024

jenkins-droid commented Oct 7, 2024

srcarroll commented Oct 7, 2024 • edited Loading

srcarroll commented Oct 7, 2024 • edited Loading

jenkins-droid commented Oct 7, 2024

AlexandreEichenberger commented Oct 8, 2024

jenkins-droid commented Oct 8, 2024

srcarroll commented Oct 8, 2024 • edited Loading

srcarroll commented Oct 8, 2024

AlexandreEichenberger commented Oct 9, 2024

jenkins-droid commented Oct 9, 2024

srcarroll commented Oct 9, 2024

jenkins-droid commented Oct 9, 2024

AlexandreEichenberger commented Oct 9, 2024

AlexandreEichenberger commented Oct 9, 2024

srcarroll commented Oct 11, 2024

Add decomposition for `ONNXSoftmaxCrossEntropyLossOp` #2968

Add decomposition for `ONNXSoftmaxCrossEntropyLossOp` #2968

srcarroll commented Oct 7, 2024 •

edited

Loading

srcarroll commented Oct 7, 2024 •

edited

Loading

srcarroll commented Oct 8, 2024 •

edited

Loading