`generated function body is not pure` when using dot notation #21094

gasagna · 2017-03-19T21:15:40Z

I have a generated function in which I rely on Julia's new dot notation for "vectorised" operations on arrays. This results in an error shown below. The minimal working example is

versioninfo()

@generated function foo(out, x)
    quote
        out .= x .+ x
    end
end

foo([0.0], [1.0])

The output is

Julia Version 0.6.0-pre.alpha.146
Commit c399558* (2017-03-15 15:20 UTC)
Platform Info:
  OS: macOS (x86_64-apple-darwin15.6.0)
  CPU: Intel(R) Core(TM) i7-4980HQ CPU @ 2.80GHz
  WORD_SIZE: 64
  BLAS: libopenblas (USE64BITINT DYNAMIC_ARCH NO_AFFINITY Haswell)
  LAPACK: libopenblas64_
  LIBM: libopenlibm
  LLVM: libLLVM-3.9.1 (ORCJIT, haswell)
ERROR: LoadError: generated function body is not pure. this likely means it contains a closure or comprehension.
Stacktrace:
 [1] include_from_node1(::String) at ./loading.jl:539
 [2] include(::String) at ./sysimg.jl:14
 [3] process_options(::Base.JLOptions) at ./client.jl:305
 [4] _start() at ./client.jl:371
while loading /Users/davide/Codes/IMEXRK.jl/src/deleteme.jl, in expression starting on line 10
[Finished in 2.1s with exit code 1]
[cmd: ['/Users/davide/Software/julia-0.6/usr/bin/julia', '/Users/davide/Codes/IMEXRK.jl/src/deleteme.jl']]
[dir: /Users/davide/Codes/IMEXRK.jl/src]
[path: /usr/bin:/bin:/usr/sbin:/sbin]

The only reference I could find similar to what I get is #19299. This issue seems different to me tough.

The text was updated successfully, but these errors were encountered:

yuyichao · 2017-03-19T21:38:14Z

This is the same issue. The broadcast syntax generates anonymous function. I think we should remove this limitation at some point but for now this should be closed as a dup.

gasagna · 2017-03-19T21:49:03Z

Since #19299 was marked as a won't fix, I reckon it will never be possible to use the dot syntax in generated functions, which is a pity as the syntax is extremely convenient. Can that issue be reopened or reconsidered to keep track of this?

gasagna · 2017-06-30T17:26:39Z

Is there a time scale for this issue to be somehow solved, 0.7, 1.0? I find the dot broadcast notation very useful to express complex arithmetic operations on generic types, with in place operations. This issue however prevents me from using it in generated functions.

Any thoughts on how one would rewrite the original MWE with something that can be currently implemented? Thanks

yuyichao · 2017-06-30T18:01:27Z

Move the expression to a normal function.

gasagna · 2017-06-30T19:20:06Z

Not the best of the solutions as I need to abandon the generated function.

yuyichao · 2017-06-30T20:38:51Z

You don't. Just put that part in the normal function and call that function in the generated function.

gasagna · 2017-07-02T00:43:24Z

I have several expressions that would use the dot notation in my generated function. Moving all of those in different functions is not feasible, and not very transparent.

I have somehow solved my issue by creating a macro @over_i that generate a loop so that the example in the original post becomes:

@generated function foo(out::T, x::T)
    quote
       @over_i out[i] = x[i] + x[i]
    end
end

and type T needs to implement a basic indexing behaviour.

fix #21094 fix #22060 fix #22053 replaces #22063

marius311 · 2017-12-09T23:16:42Z

@gasagna your solution is arguably already more readable, but if you really wanted to keep the loop fusion but not move the function definition in another place, you can always do stuff like,

@generated function foo(out, x)
    quote
        $((out,x)->(out .= x .+ x))(out,x)
    end
end

gasagna · 2017-12-10T21:28:30Z

Thanks @marius311 but I hope #23692 will land for v0.7!

Among other things, this supports returning AbstractRanges for appropriate inputs. Fixes #21094, fixes #22053

@lower

This patch represents the combined efforts of four individuals, over 60 commits, and an iterated design over (at least) three pull requests that spanned nearly an entire year (closes #22063, #23692, #25377 by superceding them). This introduces a pure Julia data structure that represents a fused broadcast expression. For example, the expression `2 .* (x .+ 1)` lowers to: ```julia julia> Meta.@lower 2 .* (x .+ 1) :($(Expr(:thunk, CodeInfo(:(begin Core.SSAValue(0) = (Base.getproperty)(Base.Broadcast, :materialize) Core.SSAValue(1) = (Base.getproperty)(Base.Broadcast, :make) Core.SSAValue(2) = (Base.getproperty)(Base.Broadcast, :make) Core.SSAValue(3) = (Core.SSAValue(2))(+, x, 1) Core.SSAValue(4) = (Core.SSAValue(1))(*, 2, Core.SSAValue(3)) Core.SSAValue(5) = (Core.SSAValue(0))(Core.SSAValue(4)) return Core.SSAValue(5) end))))) ``` Or, slightly more readably as: ```julia using .Broadcast: materialize, make materialize(make(*, 2, make(+, x, 1))) ``` The `Broadcast.make` function serves two purposes. Its primary purpose is to construct the `Broadcast.Broadcasted` objects that hold onto the function, the tuple of arguments (potentially including nested `Broadcasted` arguments), and sometimes a set of `axes` to include knowledge of the outer shape. The secondary purpose, however, is to allow an "out" for objects that _don't_ want to participate in fusion. For example, if `x` is a range in the above `2 .* (x .+ 1)` expression, it needn't allocate an array and operate elementwise — it can just compute and return a new range. Thus custom structures are able to specialize `Broadcast.make(f, args...)` just as they'd specialize on `f` normally to return an immediate result. `Broadcast.materialize` is identity for everything _except_ `Broadcasted` objects for which it allocates an appropriate result and computes the broadcast. It does two things: it `initialize`s the outermost `Broadcasted` object to compute its axes and then `copy`s it. Similarly, an in-place fused broadcast like `y .= 2 .* (x .+ 1)` uses the exact same expression tree to compute the right-hand side of the expression as above, and then uses `materialize!(y, make(*, 2, make(+, x, 1)))` to `instantiate` the `Broadcasted` expression tree and then `copyto!` it into the given destination. All-together, this forms a complete API for custom types to extend and customize the behavior of broadcast (fixes #22060). It uses the existing `BroadcastStyle`s throughout to simplify dispatch on many arguments: * Custom types can opt-out of broadcast fusion by specializing `Broadcast.make(f, args...)` or `Broadcast.make(::BroadcastStyle, f, args...)`. * The `Broadcasted` object computes and stores the type of the combined `BroadcastStyle` of its arguments as its first type parameter, allowing for easy dispatch and specialization. * Custom Broadcast storage is still allocated via `broadcast_similar`, however instead of passing just a function as a first argument, the entire `Broadcasted` object is passed as a final argument. This potentially allows for much more runtime specialization dependent upon the exact expression given. * Custom broadcast implmentations for a `CustomStyle` are defined by specializing `copy(bc::Broadcasted{CustomStyle})` or `copyto!(dest::AbstractArray, bc::Broadcasted{CustomStyle})`. * Fallback broadcast specializations for a given output object of type `Dest` (for the `DefaultArrayStyle` or another such style that hasn't implemented assignments into such an object) are defined by specializing `copyto(dest::Dest, bc::Broadcasted{Nothing})`. As it fully supports range broadcasting, this now deprecates `(1:5) + 2` to `.+`, just as had been done for all `AbstractArray`s in general. As a first-mover proof of concept, LinearAlgebra uses this new system to improve broadcasting over structured arrays. Before, broadcasting over a structured matrix would result in a sparse array. Now, broadcasting over a structured matrix will _either_ return an appropriately structured matrix _or_ a dense array. This does incur a type instability (in the form of a discriminated union) in some situations, but thanks to type-based introspection of the `Broadcasted` wrapper commonly used functions can be special cased to be type stable. For example: ```julia julia> f(d) = round.(Int, d) f (generic function with 1 method) julia> @inferred f(Diagonal(rand(3))) 3×3 Diagonal{Int64,Array{Int64,1}}: 0 ⋅ ⋅ ⋅ 0 ⋅ ⋅ ⋅ 1 julia> @inferred Diagonal(rand(3)) .* 3 ERROR: return type Diagonal{Float64,Array{Float64,1}} does not match inferred return type Union{Array{Float64,2}, Diagonal{Float64,Array{Float64,1}}} Stacktrace: [1] error(::String) at ./error.jl:33 [2] top-level scope julia> @inferred Diagonal(1:4) .+ Bidiagonal(rand(4), rand(3), 'U') .* Tridiagonal(1:3, 1:4, 1:3) 4×4 Tridiagonal{Float64,Array{Float64,1}}: 1.30771 0.838589 ⋅ ⋅ 0.0 3.89109 0.0459757 ⋅ ⋅ 0.0 4.48033 2.51508 ⋅ ⋅ 0.0 6.23739 ``` In addition to the issues referenced above, it fixes: * Fixes #19313, #22053, #23445, and #24586: Literals are no longer treated specially in a fused broadcast; they're just arguments in a `Broadcasted` object like everything else. * Fixes #21094: Since broadcasting is now represented by a pure Julia datastructure it can be created within `@generated` functions and serialized. * Fixes #26097: The fallback destination-array specialization method of `copyto!` is specifically implemented as `Broadcasted{Nothing}` and will not be confused by `nothing` arguments. * Fixes the broadcast-specific element of #25499: The default base broadcast implementation no longer depends upon `Base._return_type` to allocate its array (except in the empty or concretely-type cases). Note that the sparse implementation (#19595) is still dependent upon inference and is _not_ fixed. * Fixes #25340: Functions are treated like normal values just like arguments and only evaluated once. * Fixes #22255, and is performant with 12+ fused broadcasts. Okay, that one was fixed on master already, but this fixes it now, too. * Fixes #25521. * The performance of this patch has been thoroughly tested through its iterative development process in #25377. There remain [two classes of performance regressions](#25377) that Nanosoldier flagged. * #25691: Propagation of constant literals sill lose their constant-ness upon going through the broadcast machinery. I believe quite a large number of functions would need to be marked as `@pure` to support this -- including functions that are intended to be specialized. (For bookkeeping, this is the squashed version of the [teh-jn/lazydotfuse](#25377) branch as of a1d4e7e. Squashed and separated out to make it easier to review and commit) Co-authored-by: Tim Holy <tim.holy@gmail.com> Co-authored-by: Jameson Nash <vtjnash@gmail.com> Co-authored-by: Andrew Keller <ajkeller34@users.noreply.github.com>

@lower

This patch represents the combined efforts of four individuals, over 60 commits, and an iterated design over (at least) three pull requests that spanned nearly an entire year (closes #22063, #23692, #25377 by superceding them). This introduces a pure Julia data structure that represents a fused broadcast expression. For example, the expression `2 .* (x .+ 1)` lowers to: ```julia julia> Meta.@lower 2 .* (x .+ 1) :($(Expr(:thunk, CodeInfo(:(begin Core.SSAValue(0) = (Base.getproperty)(Base.Broadcast, :materialize) Core.SSAValue(1) = (Base.getproperty)(Base.Broadcast, :make) Core.SSAValue(2) = (Base.getproperty)(Base.Broadcast, :make) Core.SSAValue(3) = (Core.SSAValue(2))(+, x, 1) Core.SSAValue(4) = (Core.SSAValue(1))(*, 2, Core.SSAValue(3)) Core.SSAValue(5) = (Core.SSAValue(0))(Core.SSAValue(4)) return Core.SSAValue(5) end))))) ``` Or, slightly more readably as: ```julia using .Broadcast: materialize, make materialize(make(*, 2, make(+, x, 1))) ``` The `Broadcast.make` function serves two purposes. Its primary purpose is to construct the `Broadcast.Broadcasted` objects that hold onto the function, the tuple of arguments (potentially including nested `Broadcasted` arguments), and sometimes a set of `axes` to include knowledge of the outer shape. The secondary purpose, however, is to allow an "out" for objects that _don't_ want to participate in fusion. For example, if `x` is a range in the above `2 .* (x .+ 1)` expression, it needn't allocate an array and operate elementwise — it can just compute and return a new range. Thus custom structures are able to specialize `Broadcast.make(f, args...)` just as they'd specialize on `f` normally to return an immediate result. `Broadcast.materialize` is identity for everything _except_ `Broadcasted` objects for which it allocates an appropriate result and computes the broadcast. It does two things: it `initialize`s the outermost `Broadcasted` object to compute its axes and then `copy`s it. Similarly, an in-place fused broadcast like `y .= 2 .* (x .+ 1)` uses the exact same expression tree to compute the right-hand side of the expression as above, and then uses `materialize!(y, make(*, 2, make(+, x, 1)))` to `instantiate` the `Broadcasted` expression tree and then `copyto!` it into the given destination. All-together, this forms a complete API for custom types to extend and customize the behavior of broadcast (fixes #22060). It uses the existing `BroadcastStyle`s throughout to simplify dispatch on many arguments: * Custom types can opt-out of broadcast fusion by specializing `Broadcast.make(f, args...)` or `Broadcast.make(::BroadcastStyle, f, args...)`. * The `Broadcasted` object computes and stores the type of the combined `BroadcastStyle` of its arguments as its first type parameter, allowing for easy dispatch and specialization. * Custom Broadcast storage is still allocated via `broadcast_similar`, however instead of passing just a function as a first argument, the entire `Broadcasted` object is passed as a final argument. This potentially allows for much more runtime specialization dependent upon the exact expression given. * Custom broadcast implmentations for a `CustomStyle` are defined by specializing `copy(bc::Broadcasted{CustomStyle})` or `copyto!(dest::AbstractArray, bc::Broadcasted{CustomStyle})`. * Fallback broadcast specializations for a given output object of type `Dest` (for the `DefaultArrayStyle` or another such style that hasn't implemented assignments into such an object) are defined by specializing `copyto(dest::Dest, bc::Broadcasted{Nothing})`. As it fully supports range broadcasting, this now deprecates `(1:5) + 2` to `.+`, just as had been done for all `AbstractArray`s in general. As a first-mover proof of concept, LinearAlgebra uses this new system to improve broadcasting over structured arrays. Before, broadcasting over a structured matrix would result in a sparse array. Now, broadcasting over a structured matrix will _either_ return an appropriately structured matrix _or_ a dense array. This does incur a type instability (in the form of a discriminated union) in some situations, but thanks to type-based introspection of the `Broadcasted` wrapper commonly used functions can be special cased to be type stable. For example: ```julia julia> f(d) = round.(Int, d) f (generic function with 1 method) julia> @inferred f(Diagonal(rand(3))) 3×3 Diagonal{Int64,Array{Int64,1}}: 0 ⋅ ⋅ ⋅ 0 ⋅ ⋅ ⋅ 1 julia> @inferred Diagonal(rand(3)) .* 3 ERROR: return type Diagonal{Float64,Array{Float64,1}} does not match inferred return type Union{Array{Float64,2}, Diagonal{Float64,Array{Float64,1}}} Stacktrace: [1] error(::String) at ./error.jl:33 [2] top-level scope julia> @inferred Diagonal(1:4) .+ Bidiagonal(rand(4), rand(3), 'U') .* Tridiagonal(1:3, 1:4, 1:3) 4×4 Tridiagonal{Float64,Array{Float64,1}}: 1.30771 0.838589 ⋅ ⋅ 0.0 3.89109 0.0459757 ⋅ ⋅ 0.0 4.48033 2.51508 ⋅ ⋅ 0.0 6.23739 ``` In addition to the issues referenced above, it fixes: * Fixes #19313, #22053, #23445, and #24586: Literals are no longer treated specially in a fused broadcast; they're just arguments in a `Broadcasted` object like everything else. * Fixes #21094: Since broadcasting is now represented by a pure Julia datastructure it can be created within `@generated` functions and serialized. * Fixes #26097: The fallback destination-array specialization method of `copyto!` is specifically implemented as `Broadcasted{Nothing}` and will not be confused by `nothing` arguments. * Fixes the broadcast-specific element of #25499: The default base broadcast implementation no longer depends upon `Base._return_type` to allocate its array (except in the empty or concretely-type cases). Note that the sparse implementation (#19595) is still dependent upon inference and is _not_ fixed. * Fixes #25340: Functions are treated like normal values just like arguments and only evaluated once. * Fixes #22255, and is performant with 12+ fused broadcasts. Okay, that one was fixed on master already, but this fixes it now, too. * Fixes #25521. * The performance of this patch has been thoroughly tested through its iterative development process in #25377. There remain [two classes of performance regressions](#25377) that Nanosoldier flagged. * #25691: Propagation of constant literals sill lose their constant-ness upon going through the broadcast machinery. I believe quite a large number of functions would need to be marked as `@pure` to support this -- including functions that are intended to be specialized. (For bookkeeping, this is the squashed version of the [teh-jn/lazydotfuse](#25377) branch as of a1d4e7e. Squashed and separated out to make it easier to review and commit) Co-authored-by: Tim Holy <tim.holy@gmail.com> Co-authored-by: Jameson Nash <vtjnash@gmail.com> Co-authored-by: Andrew Keller <ajkeller34@users.noreply.github.com>

yuyichao closed this as completed Mar 19, 2017

mbauman mentioned this issue Mar 19, 2017

"generated function body is not pure" being too greedy? #19299

Closed

vtjnash reopened this Apr 20, 2017

vtjnash added a commit that referenced this issue Sep 12, 2017

introduce runtime representation of broadcast fusion

a13aacd

fix #21094 fix #22060 fix #22053 replaces #22063

vtjnash added a commit that referenced this issue Sep 12, 2017

introduce runtime representation of broadcast fusion

bfd0a3e

fix #21094 fix #22060 fix #22053 replaces #22063

vtjnash added a commit that referenced this issue Sep 13, 2017

introduce runtime representation of broadcast fusion

535fefa

fix #21094 fix #22060 fix #22053 replaces #22063

vtjnash mentioned this issue Sep 13, 2017

RFC: introduce runtime representation of broadcast fusion #23692

Closed

timholy added a commit that referenced this issue Jan 3, 2018

Test fixes to #21094, #22053

53b3d14

timholy added a commit that referenced this issue Jan 3, 2018

Integrate lazy broadcast representation into new broadcast machinery

a049a70

Among other things, this supports returning AbstractRanges for appropriate inputs. Fixes #21094, fixes #22053

timholy mentioned this issue Jan 3, 2018

RFC: Customizable lazy broadcasting with options for pure-Julia fusion and eager evaluation #25377

Closed

timholy added a commit that referenced this issue Jan 7, 2018

Integrate lazy broadcast representation into new broadcast machinery

9115e1f

Among other things, this supports returning AbstractRanges for appropriate inputs. Fixes #21094, fixes #22053

timholy added a commit that referenced this issue Jan 7, 2018

Integrate lazy broadcast representation into new broadcast machinery

0698edc

Among other things, this supports returning AbstractRanges for appropriate inputs. Fixes #21094, fixes #22053

mbauman mentioned this issue Apr 24, 2018

Customizable lazy fused broadcasting in pure Julia #26891

Merged

mbauman added the broadcast Applying a function over a collection label Apr 24, 2018

mbauman closed this as completed in #26891 Apr 26, 2018

blegat mentioned this issue Sep 29, 2018

[release-0.18] Segfault on Julia v0.6 jump-dev/JuMP.jl#1501

Closed

c42f mentioned this issue Sep 11, 2019

Simplify lowering of typed comprehension #32709

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`generated function body is not pure` when using dot notation #21094

`generated function body is not pure` when using dot notation #21094

gasagna commented Mar 19, 2017

yuyichao commented Mar 19, 2017

gasagna commented Mar 19, 2017

gasagna commented Jun 30, 2017

yuyichao commented Jun 30, 2017

gasagna commented Jun 30, 2017

yuyichao commented Jun 30, 2017

gasagna commented Jul 2, 2017

marius311 commented Dec 9, 2017

gasagna commented Dec 10, 2017

generated function body is not pure when using dot notation #21094

generated function body is not pure when using dot notation #21094

Comments

gasagna commented Mar 19, 2017

yuyichao commented Mar 19, 2017

gasagna commented Mar 19, 2017

gasagna commented Jun 30, 2017

yuyichao commented Jun 30, 2017

gasagna commented Jun 30, 2017

yuyichao commented Jun 30, 2017

gasagna commented Jul 2, 2017

marius311 commented Dec 9, 2017

gasagna commented Dec 10, 2017

`generated function body is not pure` when using dot notation #21094

`generated function body is not pure` when using dot notation #21094