Fix #17970 (output of map over BitArrays) #18198

pabloferz · 2016-08-23T13:19:41Z

With this patch the output of mapping over BitArrays is properly decided based on the return type of the function (if Bool the return type will be a BitArray and an Array of the appropriate type otherwise).

@martinholters Can you take a look?

Cc: @JeffBezanson

pabloferz · 2016-08-23T13:22:06Z

Also, can anyone check that there are not performance regressions (once tests pass)?

JeffBezanson · 2016-08-23T14:35:35Z

I think this is too inference-dependent; our current rule is that the result can only depend on inference for empty arrays. I would prefer the simple and predictable behavior of only returning a BitArray for the set of boolean functions listed in the code (~, !, &, |, etc.)

pabloferz · 2016-08-23T15:17:19Z

~~Ok, I changed so it returns BitArray only for the special boolean functions ~, !, &, etc. (returning, for any other case, an Array of the appropriate type).~~

EDIT: It turns out that the fall-back map (which relies on collect) works properly with BitArrays, so in the end, the solution was to remove some methods and reorganize the specialized ones.

tkelman · 2016-08-23T16:15:02Z

@nanosoldier runbenchmarks(ALL, vs = ":master")

JeffBezanson · 2016-08-23T16:22:53Z

Ah yes, that's a good solution.

nanosoldier · 2016-08-23T18:50:23Z

Your benchmark job has completed - possible performance regressions were detected. A full report can be found here. cc @jrevels

martinholters · 2016-08-24T07:41:13Z

LGTM - hooray, BitChunkFunctor is gone!
I doesn't look to me like any of the performance regressions are caused by this. But I also wonder if mapping over BitArrays is exercised at all in the benchmarks. Did you do some specific benchmarking locally? Thinking about specialization, would bit_map!{F<:Function}(f::F, ...) be better?

pabloferz · 2016-08-24T13:09:57Z

AFAICS, I also believe that the performance regressions are unrelated. And I think you're right, it doesn't seem like there are benchmarks for mapping over BitArrays. Locally I checked that the special cases (&, ~, etc.) weren't affected.

As for the specialization, that might help a bit in some cases, but I'm not sure.

pabloferz · 2016-08-24T13:17:20Z

our current rule is that the result can only depend on inference for empty arrays

@JeffBezanson Is there any consideration of using inference when the result type is concrete as an optimization and falling back to use the values otherwise (or that is problematic in any way)?

yuyichao · 2016-08-24T13:19:54Z

Is there any consideration of using inference when the result type is concrete as an optimization

Type inference is supposed to handle that automatically.

pabloferz · 2016-08-24T13:27:46Z

@yuyichao I should have been more precise, I was particularly referring to doing it here https://github.com/JuliaLang/julia/blob/master/base/array.jl#L312. We currently have

function _collect(c, itr, ::EltypeUnknown, isz::Union{HasLength,HasShape})
    st = start(itr)
    if done(itr,st)
        return _similar_for(c, _default_eltype(typeof(itr)), itr, isz)
    end
    v1, st = next(itr, st)
    collect_to_with_first!(_similar_for(c, typeof(v1), itr, isz), v1, itr, st)
end

but this could be improved a bit like this

function _collect(c, itr, ::EltypeUnknown, isz::Union{HasLength,HasShape})
    et = _default_eltype(typeof(itr))
    isleaftype(et) && return copy!(_similar_for(c, et, itr, isz), itr)
    st = start(itr)
    if done(itr,st)
        return _similar_for(c, et, itr, isz)
    end
    v1, st = next(itr, st)
    collect_to_with_first!(_similar_for(c, typeof(v1), itr, isz), v1, itr, st)
end

yuyichao · 2016-08-24T13:31:12Z

Yes and I'm under the impression that the type inference should handle the first case just fine if the callback is inferrable. Is that not the case?

pabloferz · 2016-08-24T13:38:03Z

Well, it is handled as it should. What I should have probably had to say was that copy! is faster than collect_to_with_first! when the type can be inferred.

yuyichao · 2016-08-24T13:50:13Z

collect_to_with_first! looks pretty efficient to me.

We can certainly tweak the implementation at many points if benchmarking different cases suggests that there's a good reason to do so. I don't see a reason we have to special case the leaf inferred type to get the same performance though.

JeffBezanson · 2016-08-24T14:45:42Z

It should be possible to make collect_to_with_first! where the type is known produce the same code as any other efficient loop. I'm pretty sure it does already, at least in some cases.

pabloferz · 2016-08-24T15:11:02Z

I see, good to know. Sorry for the noise, then.

EDIT: The reason I brought it up the first time was because I was under the impression that it was not the case for some example I was testing. But after benchmarking it properly I see it works a you point out.

pabloferz · 2016-08-25T13:42:32Z

Should this one be backported?

martinholters · 2016-08-25T13:52:08Z

IIUC, this does not change behavior for anything that works in master (or 0.5), but enables something that used to work in 0.4 (and really should work). So backporting sounds like a good idea.

pabloferz · 2016-10-22T03:40:15Z

Bump. This should be good to go as well as back-portable.

(cherry picked from commit a6fec5b) ref #18198

pabloferz force-pushed the pz/mapbitarray branch from 1d79105 to 3959f77 Compare August 23, 2016 13:33

pabloferz force-pushed the pz/mapbitarray branch from 3959f77 to 12c5b38 Compare August 23, 2016 15:13

pabloferz force-pushed the pz/mapbitarray branch from 12c5b38 to 7e86c19 Compare August 23, 2016 15:52

tkelman added the potential benchmark Could make a good benchmark in BaseBenchmarks label Aug 24, 2016

pabloferz mentioned this pull request Aug 29, 2016

[release-0.5] Backports for 0.5.0-rc4 #18276

Merged

nalimilan mentioned this pull request Aug 29, 2016

Behavior of map() and broadcast() JuliaStats/NullableArrays.jl#144

Open

pabloferz force-pushed the pz/mapbitarray branch from 7e86c19 to 0faf311 Compare September 6, 2016 10:33

Base the output of map over BitArrays on the function

a6fec5b

pabloferz force-pushed the pz/mapbitarray branch from 0faf311 to a6fec5b Compare October 22, 2016 02:12

JeffBezanson merged commit 7b41e72 into JuliaLang:master Nov 4, 2016

JeffBezanson added the backport pending 0.5 label Nov 4, 2016

pabloferz deleted the pz/mapbitarray branch November 4, 2016 19:46

tkelman pushed a commit that referenced this pull request Feb 22, 2017

Base the output of map over BitArrays on the function

7da02da

(cherry picked from commit a6fec5b) ref #18198

tkelman removed the backport pending 0.5 label Mar 5, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix #17970 (output of map over BitArrays) #18198

Fix #17970 (output of map over BitArrays) #18198

pabloferz commented Aug 23, 2016 •

edited

Loading

pabloferz commented Aug 23, 2016 •

edited

Loading

JeffBezanson commented Aug 23, 2016

pabloferz commented Aug 23, 2016 •

edited

Loading

tkelman commented Aug 23, 2016

JeffBezanson commented Aug 23, 2016

nanosoldier commented Aug 23, 2016

martinholters commented Aug 24, 2016

pabloferz commented Aug 24, 2016 •

edited

Loading

pabloferz commented Aug 24, 2016

yuyichao commented Aug 24, 2016

pabloferz commented Aug 24, 2016

yuyichao commented Aug 24, 2016

pabloferz commented Aug 24, 2016 •

edited

Loading

yuyichao commented Aug 24, 2016 •

edited

Loading

JeffBezanson commented Aug 24, 2016

pabloferz commented Aug 24, 2016 •

edited

Loading

pabloferz commented Aug 25, 2016

martinholters commented Aug 25, 2016

pabloferz commented Oct 22, 2016

Fix #17970 (output of map over BitArrays) #18198

Fix #17970 (output of map over BitArrays) #18198

Conversation

pabloferz commented Aug 23, 2016 • edited Loading

pabloferz commented Aug 23, 2016 • edited Loading

JeffBezanson commented Aug 23, 2016

pabloferz commented Aug 23, 2016 • edited Loading

tkelman commented Aug 23, 2016

JeffBezanson commented Aug 23, 2016

nanosoldier commented Aug 23, 2016

martinholters commented Aug 24, 2016

pabloferz commented Aug 24, 2016 • edited Loading

pabloferz commented Aug 24, 2016

yuyichao commented Aug 24, 2016

pabloferz commented Aug 24, 2016

yuyichao commented Aug 24, 2016

pabloferz commented Aug 24, 2016 • edited Loading

yuyichao commented Aug 24, 2016 • edited Loading

JeffBezanson commented Aug 24, 2016

pabloferz commented Aug 24, 2016 • edited Loading

pabloferz commented Aug 25, 2016

martinholters commented Aug 25, 2016

pabloferz commented Oct 22, 2016

pabloferz commented Aug 23, 2016 •

edited

Loading

pabloferz commented Aug 23, 2016 •

edited

Loading

pabloferz commented Aug 23, 2016 •

edited

Loading

pabloferz commented Aug 24, 2016 •

edited

Loading

pabloferz commented Aug 24, 2016 •

edited

Loading

yuyichao commented Aug 24, 2016 •

edited

Loading

pabloferz commented Aug 24, 2016 •

edited

Loading