Reduce number of `getindex(::Type, ...)` methods #44127

martinholters · 2022-02-11T11:38:37Z

Previously, there were special cases for T[], T[a], T[a,b] and T[a,b,c]. Together with the general case for more elements, that meant five methods to consider in cases like T[x...] where the length of x was not known at compile time. That was beyond the inference limit and such a call would be inferred as Any. So this change gets rid of all the special cases.

The loop-based general case worked well if all arguments were of the same type, but otherwise suffered from type-instability inside the loop. Without the special cases for low element count this would be hit more often, so the loop is replaced with a call to afoldl that basically unrolls the loop for up to 32 elements.

julia> f(x) = Int[x...];

julia> @code_typed(f([1]))[2]
Any # master
Vector{Int64} (alias for Array{Int64, 1}) # PR

julia> @btime Float64[1, 2.0, 0x3];
  35.326 ns (1 allocation: 80 bytes) # master
  35.215 ns (1 allocation: 80 bytes) # PR

julia> @btime Float64[1, 2.0, 0x3, 4.0f0];
  255.610 ns (7 allocations: 256 bytes) # master
  36.245 ns (1 allocation: 96 bytes) # PR

julia> @btime Float64[1, 2, 3, 4];
  35.578 ns (1 allocation: 96 bytes) # master
  35.143 ns (1 allocation: 96 bytes) # PR

Motivated by JuliaMath/FFTW.jl#231.

base/array.jl

JeffBezanson · 2022-02-11T16:59:27Z

This is good; I think the only downside is that we lose the vararg tuple elision after 32:

julia> @btime Int[1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33]
  32.142 ns (1 allocation: 336 bytes)

julia> @btime gi(Int,1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33)
  351.113 ns (4 allocations: 656 bytes)

(gi is the method from this PR)
Maybe manually inlining afoldl would fix it? Declaring the call site @inline doesn't seem to work; maybe it is too big.

N5N3 · 2022-02-11T18:03:25Z

I guess we can use for loop if vals isa NTuple?
Similar to #44063, that PR also wants to resolve instability in loop.

martinholters · 2022-02-15T10:40:27Z

Updated with @N5N3's idea to keep the for loop in the NTuple case. Now Int[1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33] looks good (=as in master), too. CI failure on linux64 is a timeout I'd assume to be unrelated.

Previously, there were special cases for `T[]`, `T[a]`, `T[a,b]` and `T[a,b,c]`. Together with the general case for more elements, that meant five methods to consider in cases like `T[x...]` where the length of `x` was not known at compile time. That was beyond the inference limit and such a call would be inferred as `Any`. So this change gets rid of all the special cases. The loop-based general case worked well if all arguments were of the same type, but otherwise suffered from type-instability inside the loop. Without the special cases for low element count this would be hit more often, so for the non-homogenous case, the loop is replaced with a call to `afoldl` that basically unrolls the loop for up to 32 elements.

vtjnash

SGTM. Seems a clever solution to this

martinholters · 2022-02-17T07:15:45Z

CI is all green after another rebase, feedback is generally positive. I'm going to merge tomorrow unless anyone objects (or beats me to it).

martinholters · 2022-02-17T18:54:33Z

CI is all green after another rebase

I spoke too soon, buildbot/tester_linux64 timed out again. But I've seen that in other PRs, too, so let me declare it unrelated.

oscardssmith · 2022-02-18T20:15:40Z

Why is this getting backported?

KristofferC · 2022-02-18T20:33:01Z

There is some leeway for PRs that were more or less finished when feature freeze hit and just needed review/CI to go through.

Previously, there were special cases for `T[]`, `T[a]`, `T[a,b]` and `T[a,b,c]`. Together with the general case for more elements, that meant five methods to consider in cases like `T[x...]` where the length of `x` was not known at compile time. That was beyond the inference limit and such a call would be inferred as `Any`. So this change gets rid of all the special cases. The loop-based general case worked well if all arguments were of the same type, but otherwise suffered from type-instability inside the loop. Without the special cases for low element count this would be hit more often, so for the non-homogeneous case, the loop is replaced with a call to `afoldl` that basically unrolls the loop for up to 32 elements. (cherry picked from commit b8e5d7e)

Previously, there were special cases for `T[]`, `T[a]`, `T[a,b]` and `T[a,b,c]`. Together with the general case for more elements, that meant five methods to consider in cases like `T[x...]` where the length of `x` was not known at compile time. That was beyond the inference limit and such a call would be inferred as `Any`. So this change gets rid of all the special cases. The loop-based general case worked well if all arguments were of the same type, but otherwise suffered from type-instability inside the loop. Without the special cases for low element count this would be hit more often, so for the non-homogeneous case, the loop is replaced with a call to `afoldl` that basically unrolls the loop for up to 32 elements.

dkarrasch reviewed Feb 11, 2022

View reviewed changes

base/array.jl Outdated Show resolved Hide resolved

martinholters force-pushed the mh/getindex-type-with-afoldl branch from 71c4298 to b8bbc31 Compare February 14, 2022 13:30

KristofferC added the backport 1.8 Change should be backported to release-1.8 label Feb 16, 2022

martinholters force-pushed the mh/getindex-type-with-afoldl branch from b8bbc31 to f9db5d9 Compare February 16, 2022 13:40

vtjnash reviewed Feb 16, 2022

View reviewed changes

KristofferC mentioned this pull request Feb 18, 2022

release-1.8: Backports for julia 1.8-beta1/2 #44237

Merged

33 tasks

martinholters merged commit b8e5d7e into master Feb 18, 2022

martinholters deleted the mh/getindex-type-with-afoldl branch February 18, 2022 08:47

KristofferC removed the backport 1.8 Change should be backported to release-1.8 label Feb 24, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce number of `getindex(::Type, ...)` methods #44127

Reduce number of `getindex(::Type, ...)` methods #44127

martinholters commented Feb 11, 2022

JeffBezanson commented Feb 11, 2022 •

edited

Loading

N5N3 commented Feb 11, 2022

martinholters commented Feb 15, 2022 •

edited

Loading

vtjnash left a comment

martinholters commented Feb 17, 2022

martinholters commented Feb 17, 2022

oscardssmith commented Feb 18, 2022

KristofferC commented Feb 18, 2022

Reduce number of getindex(::Type, ...) methods #44127

Reduce number of getindex(::Type, ...) methods #44127

Conversation

martinholters commented Feb 11, 2022

JeffBezanson commented Feb 11, 2022 • edited Loading

N5N3 commented Feb 11, 2022

martinholters commented Feb 15, 2022 • edited Loading

vtjnash left a comment

Choose a reason for hiding this comment

martinholters commented Feb 17, 2022

martinholters commented Feb 17, 2022

oscardssmith commented Feb 18, 2022

KristofferC commented Feb 18, 2022

Reduce number of `getindex(::Type, ...)` methods #44127

Reduce number of `getindex(::Type, ...)` methods #44127

JeffBezanson commented Feb 11, 2022 •

edited

Loading

martinholters commented Feb 15, 2022 •

edited

Loading