Refactor onnx math #187

JackSullivan · 2021-10-23T03:17:13Z

Description

As we add ONNXExportable to new models, there is a bunch of duplicated logic on creating OnnxML.TensorProto instances. This PR creates (hopefully) fully general methods for generating TensorProtos and replaces existing implementations with them.

Craigacp

That's a nice tidy up. I've got a few comments about the typing and things which will make it easier (for me) to figure out what's going on.

Craigacp · 2021-10-23T04:10:17Z

Core/src/main/java/org/tribuo/onnx/ONNXOperators.java

@@ -307,7 +307,7 @@ private ONNXOperators(String value, int numInputs, int numOptionalInputs, int nu
        for (String o : outputs) {
            nodeBuilder.addOutput(o);
        }
-        nodeBuilder.setName(context.generateUniqueName(opName));
+        nodeBuilder.setName(context.generateUniqueName(opName) + ":" + outputs[0]);


ONNX names are required by the spec to be alphanumeric + underscores.

Craigacp · 2021-10-23T04:14:24Z

Math/src/main/java/org/tribuo/math/onnx/ONNXMathUtils.java

+        return OnnxMl.TensorProto.newBuilder()
+                .setName(context.generateUniqueName(name))
+                .setDataType(OnnxMl.TensorProto.DataType.FLOAT.getNumber())
+                .addAllDims(() -> dims.stream().map(Integer::longValue).iterator())


I think I'd prefer this to just collect(Collectors.toList()) rather than make a lambda to fabricate the iterable. Plus in 17 we can replace it with .toList() which is shorter.

Or even just make it accept List<Long> as that's what ONNX is expecting, and the upcast on the calling side when creating the list might be done automatically?

Craigacp · 2021-10-23T04:17:34Z

Math/src/main/java/org/tribuo/math/onnx/ONNXMathUtils.java

+                ? floatTensorBuilder(context, name, Collections.singletonList(parameters.length),
+                fb -> Arrays.stream(parameters).forEachOrdered(d -> fb.put((float)d)))
+                : doubleTensorBuilder(context, name, Collections.singletonList(parameters.length),
+                db -> Arrays.stream(parameters).forEachOrdered(db::put));


No 5 line ternary operators. An if statement with multiple returns is fine.

Craigacp · 2021-10-23T04:20:33Z

Math/src/main/java/org/tribuo/math/onnx/ONNXMathUtils.java

+     */
+    public static OnnxMl.TensorProto floatVectorBuilder(ONNXContext context, String name, SGDVector vector) {
+        return floatTensorBuilder(context, name, Collections.singletonList(vector.size()),
+                fb -> vector.forEach(vt -> fb.put(vt.index,(float) vt.value)));


I think it might be nicer to put the type on fb here. (FloatBuffer fb) -> ... means it's immediately obvious what's going on, otherwise you have to go look at floatTensorBuilder to figure out the inferred type. Ditto for the rest of the times this idiom appears in this file.

Craigacp · 2021-10-23T04:25:48Z

Regression/SLM/src/main/java/org/tribuo/regression/slm/SparseLinearModel.java

+                    DenseVector[] denseWeights = new DenseVector[weights.length];
+                    for (int i = 0; i < denseWeights.length; i++) {
+                        denseWeights[i] = weights[i].densify();
+                    }


I think denseWeights should be pulled out of this lambda and be a local variable. It feels like a lot of magic to do in the lambda. Or even just make it into a DenseSparseMatrix and pass that in to floatMatrixBuilder. It would be a little easier to see what's going on then.

Craigacp · 2021-10-23T04:29:26Z

Could you rebase this PR to get rid of all the factorization machines commits it pulled in again?

util methods.

…re matrices

Craigacp

LGTM.

JackSullivan requested a review from Craigacp October 23, 2021 03:17

Craigacp requested changes Oct 23, 2021

View reviewed changes

Craigacp added Oracle employee This PR is from an Oracle employee squash-commits Squash the commits when merging this PR labels Oct 25, 2021

JackSullivan added 4 commits October 27, 2021 11:04

Refactoring methods that create OnnxMl.TensorProto instances into

8a771f2

util methods.

Fix OnnxMathUtils.floatMatrixBuilder to properly transpose non-squa…

846fd00

…re matrices

Updates based on comments

d14f923

Merge branch 'main' into refactor-onnx-math-2

56db2fe

JackSullivan force-pushed the refactor-onnx-math branch from 8800de5 to 56db2fe Compare October 27, 2021 18:45

Craigacp approved these changes Oct 27, 2021

View reviewed changes

Craigacp merged commit 0dea62a into oracle:main Oct 27, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor onnx math #187

Refactor onnx math #187

JackSullivan commented Oct 23, 2021

Craigacp left a comment

Craigacp Oct 23, 2021

Craigacp Oct 23, 2021

Craigacp Oct 23, 2021

Craigacp Oct 23, 2021

Craigacp Oct 23, 2021

Craigacp Oct 23, 2021

Craigacp commented Oct 23, 2021

Craigacp left a comment

Refactor onnx math #187

Refactor onnx math #187

Conversation

JackSullivan commented Oct 23, 2021

Description

Craigacp left a comment

Choose a reason for hiding this comment

Craigacp Oct 23, 2021

Choose a reason for hiding this comment

Craigacp Oct 23, 2021

Choose a reason for hiding this comment

Craigacp Oct 23, 2021

Choose a reason for hiding this comment

Craigacp Oct 23, 2021

Choose a reason for hiding this comment

Craigacp Oct 23, 2021

Choose a reason for hiding this comment

Craigacp Oct 23, 2021

Choose a reason for hiding this comment

Craigacp commented Oct 23, 2021

Craigacp left a comment

Choose a reason for hiding this comment