RFC: Do not consider iterators as scalars in broadcast #25356

nalimilan · 2018-01-02T12:34:40Z

Consider that all types implementing start are collections, and throw an error for SizeUnknown and IsInfinite iterators. This makes broadcast fail by default for most iterators, since the current fallback functions assume that collections support indexing. Custom iterators could implement their own methods, but the default ones should probably be improved to collect iterators without requring indexing.

Possible fix for #18618.

This change is not terribly appealing as-is as it does not add any new feature, it just throws an error when calling broadcast on iterators, where they were previously treated as scalar. Treating iterators as scalars isn't very useful, as can be seen from the fast that only one test relied on this. In the perspective of the feature freeze, throwing errors is good since it will allow supporting broadcast on iterators without breaking existing code. But I'm not yet sure what the fallback implementation can look like: it would probably have to process one element at a time, but if multiple iterators are passed one of them would have to be collected temporarily somewhere (since repeated indexing is not possible in general).

Consider that all types implementing start() are collections, and throw an error for SizeUnknown and IsInfinite iterators. This makes broadcast() fail by default for most iterators, since the current fallback functions assume that collections support indexing. Custom iterators could implement their own methods, but the default ones should probably be improved to collect iterators without requring indexing.

timholy

Sorry I didn't notice the original discussion. I think that with a small change this may be workable. Nanosoldier will be critical.

Needs tests, though.

timholy · 2018-01-02T13:15:05Z

base/broadcast.jl

-BroadcastStyle(::Type) = Scalar()
+hasshape_ndims(::Base.HasShape{N}) where {N} = N
+function BroadcastStyle(::Type{T}) where T
+    if method_exists(start, Tuple{T})


This is not inferrable, and thus might be pretty bad for code like

function foo(x) y = Float64.(x) # now do something with y end

Unfortunately #16422 wouldn't help.

At the same time, I recognize that any problematic type can be optimized by adding a specific defintion.

At a minimum we may have to change the typeof calls in collect_styles to Core.Typeof, and then specialize

BroadcastStyle(::Type{Type{T}}) where T = Scalar()

We could require iterators to define a trait. That would be more consistent with what we do elsewhere, and that wouldn't be a terrible burden either. I had contemplated adding an NotIterable type to Base.iteratorsize, but that could also be a separate function like isiterable.

On balance I think it's better to require iterators to define a trait, and make scalars the default.

One slightly-crazy thought is that the trait name could be the output of BroadcastStyle. But on balance I'm not sure this is a good idea, because there may be reasons to have things that act like scalars that don't return Scalar(). It's probably better to have a separate isiterable trait.

@vtjnash mentioned here that method_exists could be made inferable now, presumably circumnavigating the problem mentioned in #16422.

It could, but it hasn't been in the past since we don't want the coupling. Also, #25261 will break this.

timholy · 2018-01-02T13:21:27Z

base/generator.jl

@@ -53,7 +53,7 @@ end
 abstract type IteratorSize end
 struct SizeUnknown <: IteratorSize end
 struct HasLength <: IteratorSize end
-struct HasShape <: IteratorSize end
+struct HasShape{N} <: IteratorSize end


nalimilan · 2018-01-02T13:45:10Z

base/traits.jl

@@ -57,3 +57,20 @@ struct RangeStepRegular   <: TypeRangeStep end # range with regular step
 struct RangeStepIrregular <: TypeRangeStep end # range with rounding error

 TypeRangeStep(instance) = TypeRangeStep(typeof(instance))
+
+## iterable trait


This code isn't used in the PR currently, but it illustrates the alternative approach based on a trait rather than o method_exists(start, Tuple{T}). It fixes the type inference issue (provided the fallback doesn't call method_exists as it currently does).

BTW, I've noted an inconsistency in the naming of traits: we have iteratorsize, iteratoreltype, but IndexStyle, TypeRangeStep, TypeArithmetic and TypeOrder. Looks like the CamelCase variants are more numerous and more recent, so maybe we should adopt that convention everywhere? Added to #20402.

Uppercase makes the most sense when what will be returned is a type-instance---for example, IndexStyle(a) will return T() where T<:IndexStyle. With better constant-prop it's less obvious that we need to return a dedicated type-instance, although that does have some advantage in clarity.

👎 Adding a new thing every iterable type needs to define is not ideal.

Maybe not "ideal", but not the end of the world either IMHO given that you need to define several methods anyway. And if we really don't want to add another trait, we can add a new type to iteratorsize (and rename it), which will have the advantage of making the choice of the type more explicit.

Anyway I'm all ears if somebody has a better solution that works (i.e. is inferrable, see @timholy's comment above).

JeffBezanson · 2018-01-02T19:25:44Z

base/iterators.jl

+prod_iteratorsize(::HasLength, ::HasLength) = HasShape{2}()
+prod_iteratorsize(::HasLength, ::HasShape{N}) where {N} = HasShape{N+1}()
+prod_iteratorsize(::HasShape{N}, ::HasLength) where {N} = HasShape{N+1}()
+prod_iteratorsize(::HasShape{M}, ::HasShape{N}) where {M,N} = HasShape{M+N}()


I don't think these will be inferrable.

@code_warntype seems to be happy with a similar example:

f(::AbstractArray{S,M}, ::AbstractArray{T,N}) where {S,T,M,N} = Array{M+N}() @code_warntype f([1], [1 2])

Oh, you're right.

mbauman · 2018-03-22T16:01:08Z

Superseded by #26435.

timholy reviewed Jan 2, 2018

View reviewed changes

nalimilan added the broadcast label Jan 2, 2018

nalimilan commented Jan 2, 2018

View reviewed changes

This was referenced Jan 2, 2018

Broadcast had one job (e.g. broadcasting over iterators and generator) #18618

Closed

API consistency review #20402

Closed

JeffBezanson reviewed Jan 2, 2018

View reviewed changes

This was referenced Jan 16, 2018

Change findfirst/findlast/findnext/findprev to return the same index type as keys() #25577

Merged

Change findfirst and findlast to return cartesian indices with HasShape iterators #25655

Merged

mbauman closed this Mar 22, 2018

mbauman added the broadcast Applying a function over a collection label Apr 24, 2018

nalimilan deleted the nl/iterable branch December 1, 2018 17:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RFC: Do not consider iterators as scalars in broadcast #25356

RFC: Do not consider iterators as scalars in broadcast #25356

nalimilan commented Jan 2, 2018

timholy left a comment •

edited

Loading

timholy Jan 2, 2018

nalimilan Jan 2, 2018

timholy Jan 2, 2018

mauro3 Jan 3, 2018

vtjnash Jan 3, 2018

timholy Jan 2, 2018

nalimilan Jan 2, 2018

timholy Jan 2, 2018

JeffBezanson Jan 2, 2018

nalimilan Jan 2, 2018

JeffBezanson Jan 2, 2018 •

edited

Loading

nalimilan Jan 2, 2018

JeffBezanson Jan 2, 2018

mbauman commented Mar 22, 2018

RFC: Do not consider iterators as scalars in broadcast #25356

RFC: Do not consider iterators as scalars in broadcast #25356

Conversation

nalimilan commented Jan 2, 2018

timholy left a comment • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

JeffBezanson Jan 2, 2018 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mbauman commented Mar 22, 2018

timholy left a comment •

edited

Loading

JeffBezanson Jan 2, 2018 •

edited

Loading