Some broadcast performance tweaks #19879

pabloferz · 2017-01-05T17:59:36Z

This allows dot-calls to fully specialize on the first argument when it is a type. The solution here does not extend to type arguments in other positions. For that, something like #19829 for type arguments would be needed, but I'm not sure that is possible in general (except for built-in types).

This PR also allows to remove shape checking in broadcast! when applying an @inbounds to it.

Should fix #19849

pabloferz · 2017-01-05T18:17:29Z

With this patch, on my machine, I get

julia> A = collect(0.0:0.1:100.0);

julia> @benchmark (x->trunc(Int,x)).($A)
BenchmarkTools.Trial: 
  memory estimate:  8.00 kb
  allocs estimate:  1
  --------------
  minimum time:     2.635 μs (0.00% GC)
  median time:      2.738 μs (0.00% GC)
  mean time:        3.113 μs (6.93% GC)
  maximum time:     124.929 μs (92.14% GC)
  --------------
  samples:          10000
  evals/sample:     9
  time tolerance:   5.00%
  memory tolerance: 1.00%

julia> @benchmark trunc.(Int, $A)
BenchmarkTools.Trial: 
  memory estimate:  8.00 kb
  allocs estimate:  1
  --------------
  minimum time:     2.662 μs (0.00% GC)
  median time:      2.798 μs (0.00% GC)
  mean time:        3.061 μs (6.39% GC)
  maximum time:     69.634 μs (90.93% GC)
  --------------
  samples:          10000
  evals/sample:     9
  time tolerance:   5.00%
  memory tolerance: 1.00%

tkelman · 2017-01-05T21:42:49Z

Nice. A little off-topic, but I sent you an email (at the address you use in your gitconfig) on new year's eve, did it go to spam?

pabloferz · 2017-01-05T22:07:44Z

I just had a look at the email you sent me (it got lost among a bunch of other emails). That's a really nice new year's surprise. Thank you!

pabloferz · 2017-01-05T22:17:31Z

Let's give my new granted powers a run: @nanosoldier runbenchmarks(ALL, vs = ":master")

tkelman · 2017-01-05T22:21:20Z

I think you need to click a confirm button somewhere, github likes to hide these things

pabloferz · 2017-01-05T22:31:53Z

Found it. Let's try again. @nanosoldier runbenchmarks(ALL, vs = ":master")

nanosoldier · 2017-01-06T02:34:12Z

Your benchmark job has completed - possible performance regressions were detected. A full report can be found here. cc @jrevels

martinholters · 2017-01-06T15:39:52Z

I had another idea the other day: To make the outer layer broadcast[!] inspect its argument types, and if any of them are scalar, inline them in the function argument, so that e.g. with T=Int and xs=rand(5), broadcast(f, T, xs) would call broadcast(a -> f(T, a), xs). I'm not sure whether that is at all possible without loosing inferability. The problem boils down to being able to write something like

function fixarg{i}(f, ::Type{Val{i}}, v)
    (args...) -> begin
        f(args[1:i-1]..., v, args[i:end]...)
    end
end

so that foo() = fixarg(^, Val{2}, 2)(3) could be inferred. I think this is impossible as it would need to instantiate a new closure type based on the Val{i} type, i.e. during inference. Can someone confirm this or is the general idea worth pursuing?

tkelman · 2017-01-06T16:08:32Z

like #19724?

martinholters · 2017-01-06T16:22:35Z

Yes, of course, thanks for the pointer! Let me see whether I can make something useful out of that...

pabloferz · 2017-01-06T16:31:34Z

There's also some discussion of other possible approaches here

stevengj · 2017-01-07T16:14:06Z

We can't do #19829 for type objects, because there's no "type literal" in Julia... something like Int is just an identifier, and it is only resolved to be a type much later in the compilation process (long after lowering).

broadcast performance tweaks

49859b9

KristofferC added performance Must go faster broadcast Applying a function over a collection labels Jan 5, 2017

JeffBezanson merged commit 236a8af into JuliaLang:master Jan 6, 2017

pabloferz deleted the pz/bc-typearg branch January 6, 2017 22:18

pabloferz mentioned this pull request Jan 9, 2017

broadcast[!] over combinations of scalars and sparse vectors/matrices #19724

Merged

KristofferC mentioned this pull request Feb 26, 2017

fast broadcast over combinations of tuples and scalars #20817

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Some broadcast performance tweaks #19879

Some broadcast performance tweaks #19879

pabloferz commented Jan 5, 2017 •

edited

Loading

pabloferz commented Jan 5, 2017

tkelman commented Jan 5, 2017

pabloferz commented Jan 5, 2017

pabloferz commented Jan 5, 2017 •

edited

Loading

tkelman commented Jan 5, 2017

pabloferz commented Jan 5, 2017

nanosoldier commented Jan 6, 2017

martinholters commented Jan 6, 2017

tkelman commented Jan 6, 2017

martinholters commented Jan 6, 2017

pabloferz commented Jan 6, 2017 •

edited

Loading

stevengj commented Jan 7, 2017

Some broadcast performance tweaks #19879

Some broadcast performance tweaks #19879

Conversation

pabloferz commented Jan 5, 2017 • edited Loading

pabloferz commented Jan 5, 2017

tkelman commented Jan 5, 2017

pabloferz commented Jan 5, 2017

pabloferz commented Jan 5, 2017 • edited Loading

tkelman commented Jan 5, 2017

pabloferz commented Jan 5, 2017

nanosoldier commented Jan 6, 2017

martinholters commented Jan 6, 2017

tkelman commented Jan 6, 2017

martinholters commented Jan 6, 2017

pabloferz commented Jan 6, 2017 • edited Loading

stevengj commented Jan 7, 2017

pabloferz commented Jan 5, 2017 •

edited

Loading

pabloferz commented Jan 5, 2017 •

edited

Loading

pabloferz commented Jan 6, 2017 •

edited

Loading