Handling broadcasts #31

MikeInnes · 2017-05-12T14:08:52Z

Using broadcasting operators in 0.6 gives deprecation warnings, and soon won't work at all as the .+ etc function objects are removed. We also need a more generic way to handle generic f.(xs) applications.

I suggest that DataFlow lowers any broadcast f.(xs...) to Broadcast(f)(xs...), where broadcast is simply a wrapper around f. Calls to broadcast can be appropriately overloaded, both in Julia code and in conversions to backends, as well as made to generate . calls again when lowered back to syntax.

DataFlow now just creates explicit broadcast calls as part of desugaring.

The text was updated successfully, but these errors were encountered:

staticfloat · 2017-05-21T04:28:55Z

This is an unusually annoying depwarn, because it seems to be triggered upon every invocation of a Flux call. :)

MikeInnes · 2017-05-22T07:02:06Z

Yup. I've just been using --depwarn=no (as per usual) but should probably sort this out ASAP.

stevengj · 2017-05-25T15:45:32Z

You should just stop overloading .+ etcetera in 0.6.

MikeInnes · 2017-05-25T15:49:38Z

That's not a reasonable requirement in any case where we can't broadcast arbitrary Julia functions. As it turns out there are quite a lot of cases like that, including TensorFlow, MXNet, and many of the GPU libraries. This is a real use case and it's unfortunate that the discussions in Base didn't take it into account whatsoever.

stevengj · 2017-05-25T15:51:17Z

Why can't you broadcast arbitrary Julia functions?

MikeInnes · 2017-05-25T15:54:48Z

Many libraries that provide an array abstraction do so in a numpy-like fashion – you get a set of "vectorised" operations like +, * etc. but anything that accesses individual elements is either breaking the abstraction barrier or unusably slow.

stevengj · 2017-05-25T15:59:01Z

If you only support a small set of operators on your data, there are plenty of binary operators to choose from that you can define. You don't have to use .+. Dot operators now carry with them an expectation of fusion and support for in-place operations like x .+= foo.(x.^2) .- 3 without temporary arrays.

In the longer term, the whole Matlab/numpy-like style, where only certain vectorized operations are fast (at the cost of lots of temporary arrays), kind of defeats the point of Julia.

stevengj · 2017-05-25T16:00:37Z

I also don't see how that applies to Flux and DataFlow, which are pure-Julia packages as far as I can tell.

MikeInnes · 2017-05-25T16:09:05Z

In the long term, yes, I'd love to have this stuff all implemented in Julia and compile GPU code on the fly etc. But that isn't going to happen immediately, so interop with existing libraries is the only reasonable option right now.

Can you elaborate on how, say, broadcasting +, sin etc should be written over GPU arrays, if not with .+ and sin.? (Bearing in mind we want to be generic over array types.)

I expect it would be possible to implement broadcasting syntax in a trait-like way in which the container can choose whether to fuse, which would solve the problem for us.

oxinabox · 2017-05-25T16:13:00Z

I also don't see how that applies to Flux and DataFlow, which are pure-Julia packages as far as I can tell.

Flux has a lazy dependency on TensorFlow, and/or MXNet

stevengj · 2017-05-25T16:32:21Z

TensorFlow allows you to define efficient custom operations in C++, and it's also possible in MXNet; why couldn't you do that from Julia?

Anyway, basically .+ means fusing broadcast now in Julia, so if you want something that is not a broadcast call you should use a different symbol (e.g. ⊕ or ⨦) or function name. (I'd like to update Julia that you can use e.g. +̃ or +′ as operators.)

MikeInnes · 2017-05-25T16:38:52Z

It's technically possible, it's just a big project, given that we need robust GPU compilation among other things. The right solution to this is not "wait until 2025".

The thing is, I do want broadcast. The semantics are all the same, and changing the user API (especially for something so common) for an implementation detail is not reasonable.

Not being able to write generic code that works over a range of implementation strategies kind of defeats the point of Julia.

stevengj · 2017-05-25T16:45:04Z

Not having fusion for user-defined container types and operations in Julia would be a much bigger sacrifice than saying that you need to rename if you explicitly want non-fusing operations.

MikeInnes · 2017-05-25T16:47:30Z

I'm not arguing we should trade one for the other, but I'm repeating myself now.

I expect it would be possible to implement broadcasting syntax in a trait-like way in which the container can choose whether to fuse, which would solve the problem for us.

stevengj · 2017-05-25T16:49:23Z

I expect it would be possible to implement broadcasting syntax in a trait-like way in which the container can choose whether to fuse, which would solve the problem for us.

Nope, because fusion happens at a syntactic level (at lowering time), before types are known.

Changing fusion to a compile-time optimization that depends on inference is a complete redesign (and would also result in semantics that depend on inference). It's something that's been tried many times in many languages and has always failed to achieve genericity for user-defined types and functions. That is a "wait until 2050" solution.

MikeInnes · 2017-05-25T16:55:52Z

if should_fuse(x, y)
  broadcast((x, y) -> x + y, x, y)
else
  broadcast(+, x, y)
end

This is still a syntactical transformation that doesn't depend on inference. should_fuse can default to true and be compiled away in the base case (just like promotion rules), leaving you identical code to the current output. But overriding it for GPUArray etc would solve our problem.

stevengj · 2017-05-25T17:02:02Z

I see, yes, that would be possible.

oxinabox mentioned this issue May 16, 2017

Implicit broadcast harmful? malmaud/TensorFlow.jl#135

Closed

MikeInnes mentioned this issue May 22, 2017

tensorflow error: you must feed a value for placeholder tensor 'placeholder_8' with dtype float #33

Closed

oxinabox mentioned this issue May 25, 2017

x.*x not customizable JuliaLang/julia#22053

Closed

stevengj mentioned this issue May 25, 2017

better internal interface for extending broadcast JuliaLang/julia#22060

Closed

MikeInnes closed this as completed Aug 22, 2017

banachtech mentioned this issue May 16, 2023

Flux.state binding does not exist #2256

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handling broadcasts #31

Handling broadcasts #31

MikeInnes commented May 12, 2017 •

edited

Loading

staticfloat commented May 21, 2017

MikeInnes commented May 22, 2017

stevengj commented May 25, 2017

MikeInnes commented May 25, 2017

stevengj commented May 25, 2017

MikeInnes commented May 25, 2017

stevengj commented May 25, 2017 •

edited

Loading

stevengj commented May 25, 2017

MikeInnes commented May 25, 2017 •

edited

Loading

oxinabox commented May 25, 2017

stevengj commented May 25, 2017 •

edited

Loading

MikeInnes commented May 25, 2017

stevengj commented May 25, 2017 •

edited

Loading

MikeInnes commented May 25, 2017

stevengj commented May 25, 2017 •

edited

Loading

MikeInnes commented May 25, 2017

stevengj commented May 25, 2017

Handling broadcasts #31

Handling broadcasts #31

Comments

MikeInnes commented May 12, 2017 • edited Loading

staticfloat commented May 21, 2017

MikeInnes commented May 22, 2017

stevengj commented May 25, 2017

MikeInnes commented May 25, 2017

stevengj commented May 25, 2017

MikeInnes commented May 25, 2017

stevengj commented May 25, 2017 • edited Loading

stevengj commented May 25, 2017

MikeInnes commented May 25, 2017 • edited Loading

oxinabox commented May 25, 2017

stevengj commented May 25, 2017 • edited Loading

MikeInnes commented May 25, 2017

stevengj commented May 25, 2017 • edited Loading

MikeInnes commented May 25, 2017

stevengj commented May 25, 2017 • edited Loading

MikeInnes commented May 25, 2017

stevengj commented May 25, 2017

MikeInnes commented May 12, 2017 •

edited

Loading

stevengj commented May 25, 2017 •

edited

Loading

MikeInnes commented May 25, 2017 •

edited

Loading

stevengj commented May 25, 2017 •

edited

Loading

stevengj commented May 25, 2017 •

edited

Loading

stevengj commented May 25, 2017 •

edited

Loading