TensorPrimitives improvements in .NET 10.0 #93286

stephentoub · 2023-10-10T12:56:43Z

ghost · 2023-10-10T12:56:50Z

Tagging subscribers to this area: @dotnet/area-system-numerics-tensors
See info in area-owners.md if you want to be subscribed.

Issue Details

Regardless of any additional types we may want to add to System.Numerics.Tensors, we would like to expand the set of APIs exposed on the TensorPrimitives static class in a few ways:

Additional operations from Math{F} that don't currently have representation on TensorPrimitives, e.g. CopySign, Reciprocal{Sqrt}{Estimate}, Sqrt, Ceiling, Floor, Truncate, Log10, Log(x, y) (with y as both span and scalar), Pow(x, y) (with y as both span and scalar), Cbrt, IEEERemainder, Acos, Acosh, Cos, Asin, Asinh, Sin, Atan. This unmerged commit has a sketch, but it's out-of-date with improvements that have been made to the library since, and all of the operations should be vectorized.
Additional operations defined in the numerical interfaces that don't currently have representation on TensorPrimitives, e.g. BitwiseAnd, BitwiseOr, BitwiseXor, Exp10, Exp10M1, Exp2, Exp2M1, ExpM1, Atan2, Atan2Pi, ILogB, Lerp, ScaleB, Round, Log10P1, Log2P1, LogP1, Hypot, RootN, AcosPi, AsinPi, AtanPi, CosPi, SinPi, TanPi
Additional operations defined in BLAS / LAPACK that don't currently have representation on TensorPrimitives
Additional operations that would enable completely removing the internal CpuMath class from ML.NET, e.g. Add (with indices), AddScale (with indices), DotProductSparse, MatrixTimesSource, ScaleAdd improvement via AddMultiply or MultipleAdd overloads, SdcaL1UpdateDense, SdcaL1UpdateSparse, and ZeroMatrixItems (might exist in System.Memory).
Generic overloads of all relevant methods, constrained to the appropriate numerical types

Concrete proposal to follow.

Author:	stephentoub
Assignees:	-
Labels:	`api-suggestion`, `area-System.Numerics.Tensors`
Milestone:	9.0.0

Szer · 2023-10-10T14:12:08Z

Could you please elaborate on the advantages of having these APIs in a BCL rather than in a specialized NuGet package (like numpy in Python)? This could provide a valuable perspective for further discussion.

stephentoub · 2023-10-10T14:16:06Z

Could you please elaborate on the advantages of having these APIs in a BCL rather than in a specialized NuGet package

It is a nuget package today. It's currently not part of netcoreapp. If it were to be pulled into netcoreapp as well, it would be because we'd be using it from elsewhere in netcoreapp, e.g. using it from APIs like Enumerable.Average, BitArray.And, ManagedWebSocket.ApplyMask, etc., which we very well may do in the future (that has no impact on it continuing to be available as a nuget package).

xoofx · 2023-10-13T15:15:58Z

Hey @stephentoub,

Would it be possible to expose the low level parts of the API instead of only providing Span versions?

e.g

public static Vector128<float> Log2(Vector128<float> value);
public static Vector256<float> Log2(Vector256<float> value);
public static Vector512<float> Log2(Vector512<float> value);
//...etc.

I did that for a prototype for a similar API and it's working great.
One reason to expose these APIs is that you can actually build higher level functions (e.g for tensors, the zoo of the activation functions) and build Span versions on top of them.

These API can then be used for other kind of custom Span batching (not related to tensors), where the packing of the vector is different (e.g 4xfloat chuncked xxxx, yyyy, zzzz)

tannergooding · 2023-10-13T18:05:28Z

Would it be possible to expose the low level parts of the API instead of only providing Span versions?

Yes, but it needs to be its own proposal and cover all 5 vector types (Vector, Vector64/128/256/512) and consider whether its applicable to Vector2/3/4 as well.

xoofx · 2023-10-13T18:07:09Z

Yes, but it needs to be its own proposal and cover all 5 vector types (Vector, Vector64/128/256/512)

Cool, I will try to write something.

xoofx · 2023-10-14T10:24:29Z

Would it be possible to expose the low level parts of the API instead of only providing Span versions?

Follow-up, created the proposal #93513

msedi · 2023-11-26T21:33:43Z

@stephentoub:

If it were to be pulled into netcoreapp as well, it would be because we'd be using it from elsewhere in netcoreapp

if brought to the BCL wouldn't it make sense to rename TensorPrimitives to lets say ArrayMath, VectorMath or VectorPrimitives. Tensor seems a bit exaggerated for what it does, namely doing some math on arrays.

tannergooding · 2023-11-26T22:28:38Z

@msedi that would be a breaking change. Additionally, the intent is to expand it to the full set of BLAS support, so Tensor is a very apt and appropriate name that was already scrutinized, reviewed, and approved by API review

msedi · 2023-11-27T15:44:05Z

@tannergooding: Sure you right, I was just under the impression that there could be something more primitive. The tensor ist something, lets say higher level whereas the vector/array methods are on a lower level. But I'm completely fine with it whenever I know where to find it,

BTW. When looking at the code and the effort for the TensorPrimitives are there any efforts the JIT will some day manage to do the SIMD unfolding for us?

tannergooding · 2023-11-27T15:52:32Z

the JIT will some day manage to do the SIMD unfolding for us?

The JIT is unlikely to get auto-vectorization in the near future as such support is complex and quite expensive to do. Additionally, outside of particular domains, such support does not often light up and has measurable impact to real world apps even less frequently. Especially for small workloads it can often have the opposite effect and slow down your code. In the domains where it does light up, and particularly where it would be beneficial to do, you are often going to get better perf by writing your own SIMD code directly.

It is therefore my opinion that our efforts would be better spent providing APIs from the BCL that provide this acceleration for you. Such as all the APIs on Span<T>, accelerating LINQ, the new APIs on TensorPrimitives, etc. It may likewise be beneficial to expose some SIMD infrastructure helpers like we've defined internally for TensorPrimitives; that is expose some public form of InvokeSpanSpanIntoSpan and friends, which would allow developers to only worry about providing the inner kernel and to have the rest of the SIMD logic (leading/trailing elements, alignment, unrolling, etc) handled internally. Efforts like ISimdVector<TSelf, T> also fit the bill of making it simpler for devs to write SIMD code.

msedi · 2023-11-27T15:57:44Z

@tannergooding : Thanks for the info. That makes sense For our case we wrote source generators to generate all the array primitives, currently with Vector but I wanted to benchmark against your implementations. I assume yours is better ;-)

tannergooding · 2024-08-15T14:15:57Z

Remaining work is for .NET 10

stephentoub added api-suggestion Early API idea and discussion, it is NOT ready for implementation area-System.Numerics.Tensors labels Oct 10, 2023

stephentoub added this to the 9.0.0 milestone Oct 10, 2023

stephentoub mentioned this issue Oct 13, 2023

[API Proposal]: Extend System.Numerics.Tensors.TensorPrimitives with primitive types other than float #93474

Open

xoofx mentioned this issue Oct 14, 2023

[API Proposal]: Math float vectorized functions for Vector64/128/256/512 and Vector2/3/4 #93513

Open

jeffhandley removed the api-suggestion Early API idea and discussion, it is NOT ready for implementation label Oct 23, 2023

jeffhandley mentioned this issue Oct 23, 2023

Remaining .NET 8 work for TensorPrimitives #92219

Closed

65 tasks

stephentoub self-assigned this Nov 8, 2023

stephentoub mentioned this issue Nov 9, 2023

[API Proposal]: Generic overloads of existing TensorPrimitives methods #94553

Closed

stephentoub mentioned this issue Jan 19, 2024

Add lots more TensorPrimitives operations #97192

Merged

ericstj changed the title ~~Augment TensorPrimitives for post-.NET 8~~ TensorPrimitives improvements in .NET 9.0 Feb 2, 2024

ericstj mentioned this issue Feb 2, 2024

Enable .NET 9 developers to build more intelligent apps #97896

Closed

3 tasks

ericstj added the Epic Groups multiple user stories. Can be grouped under a theme. label Feb 9, 2024

eiriktsarpalis mentioned this issue Feb 16, 2024

Refactor the generic TensorPrimitives implementation into multiple files. #98566

Merged

stephentoub removed their assignment Apr 13, 2024

tannergooding modified the milestones: 9.0.0, 10.0.0 Aug 15, 2024

ericstj changed the title ~~TensorPrimitives improvements in .NET 9.0~~ TensorPrimitives improvements in .NET 10.0 Oct 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TensorPrimitives improvements in .NET 10.0 #93286

TensorPrimitives improvements in .NET 10.0 #93286

stephentoub commented Oct 10, 2023 •

edited by eiriktsarpalis

Loading

ghost commented Oct 10, 2023

Szer commented Oct 10, 2023

stephentoub commented Oct 10, 2023 •

edited

Loading

xoofx commented Oct 13, 2023

tannergooding commented Oct 13, 2023

xoofx commented Oct 13, 2023

xoofx commented Oct 14, 2023

msedi commented Nov 26, 2023

tannergooding commented Nov 26, 2023

msedi commented Nov 27, 2023

tannergooding commented Nov 27, 2023

msedi commented Nov 27, 2023 •

edited

Loading

tannergooding commented Aug 15, 2024

TensorPrimitives improvements in .NET 10.0 #93286

TensorPrimitives improvements in .NET 10.0 #93286

Comments

stephentoub commented Oct 10, 2023 • edited by eiriktsarpalis Loading

ghost commented Oct 10, 2023

Szer commented Oct 10, 2023

stephentoub commented Oct 10, 2023 • edited Loading

xoofx commented Oct 13, 2023

tannergooding commented Oct 13, 2023

xoofx commented Oct 13, 2023

xoofx commented Oct 14, 2023

msedi commented Nov 26, 2023

tannergooding commented Nov 26, 2023

msedi commented Nov 27, 2023

tannergooding commented Nov 27, 2023

msedi commented Nov 27, 2023 • edited Loading

tannergooding commented Aug 15, 2024

stephentoub commented Oct 10, 2023 •

edited by eiriktsarpalis

Loading

stephentoub commented Oct 10, 2023 •

edited

Loading

msedi commented Nov 27, 2023 •

edited

Loading