WIP:Add Transpose immutable #6837

andreasnoack · 2014-05-14T08:07:42Z

Here is a sketch of how a Transpose type could work. Before continuing with redefining all the methods to the new type, I would like to hear what people think about it. Basically, I am just introducing the immutable

immutable Transpose{T, N, S<:AbstractArray{T, N}, Conj} <: AbstractArray{T, N}
    data::S
end

and although there is an N there, this proposal deals with arrays of dimension one and two. That is what is possible to transpose right now and I think it is beyond the scope of this proposal to handle higher dimensions.

This is basically a kind of lazy transpose which we can use for dispatch to optimised methods. If the idea is accepted, we can remove all the Ax_mul_Bx! methods and just have mul! and similarly for ldiv/rdiv, see #5332.

With this proposal transpose(Vector{T}) is a VectorTranspose{T,Vector{T},false}, which is still a one dimensional thing, but with this we can have

julia> A=randn(3,3);x=randn(3);

julia> x'A*x
-3.0853326038138893

julia> x'x
3.0143512744908234

which was discussed in #4774. Another useful consequence would be that we can evaluate something like A*B*C' more efficiently. Right now B*C' calls A_mul_Bc without actually forming C', but A*B*C' calls *(A,B,ctranspose(C)) so for more complicated matrix expression there is more copying than necessary.

cc: @jiahao, @toivoh

tkelman · 2014-05-14T09:38:04Z

I like the general idea of lazy transpose. The tricky part is how do you propose to handle setindex, does writing into a transpose object change the original matrix? Guess you just document really clearly if transpose returns a view, and you use B = full(A') (or should that be B = copy(A')?) when you need a copy? Same concern as for slices, great for performance but a temporary disruption that could cause bugs in existing code.

Properly supporting sparse here basically means introducing a SparseMatrixCSR type, transpose outputs the same data as the opposite type and the current sparse transpose implementation becomes the conversion operation between the two types.

jiahao · 2014-05-14T17:14:42Z

What does transpose mean for N>2?

toivoh · 2014-05-14T17:23:07Z

Since we want a transposed vector to be a row vector (as opposed to a row matrix) I guess thst the only reasonable definition of transposing higher dimensional arrays would be to swap the first two dimensions. But I agree that starting out with N <= 2 makes a lot of sense.

Also, thanks for trying this out!

jiahao · 2014-05-14T17:32:32Z

I guess thst the only reasonable definition of transposing higher dimensional arrays would be to swap the first two dimensions.

I don't see a unique definition for N>2; you could easily argue for swapping the last two dimensions, or reversing the order of all the dimensions.

kmsquire · 2014-05-14T17:34:29Z

Python Pandas defines a version of transpose for N=3 that allows arbitrary swapping of dimensions, which seems (to me) like a reasonable definition.

Do LAPACK routines allow for arbitrary strides along each dimension, or only the major stride?

mbauman · 2014-05-14T17:40:20Z

Numpy's stock transpose (x.T) reverses the dimensions by default. Matlab refuses and punts to permute in its error message. I think erroring on N>2 is sensible.

tknopp · 2014-05-14T17:44:51Z

+1 for reversing the dimensions. This basically gives us row major arrays.

jiahao · 2014-05-14T17:53:42Z

Matlab refuses and punts to permute in its error message.

That's really my point, that for Transpose{N>2} to be sensible, it needs to store a permutation (or similar combinatorial object) to distinguish between all the possible generalizations.

StefanKarpinski · 2014-05-14T17:57:56Z

Reversing the dimensions is ok as long as Transpose{Vector} is treated like a row matrix the same way that Vector is treated like a column matrix. Making all of that consistent is going to take some effort.

toivoh · 2014-05-14T18:29:36Z

Reversing the dimensions would mean that the transpose of a column vector
would be the same column vector. What we want is that a transposed vector
should behave like the first and second dimensions were swapped, just as
with a matrix. Swapping the last two dimensions on a column vector would
swap the nonexistant zeroth dimension with the first, again unlike the
matrix case.

But I agree that transpose in more than two dimensions d
oesn't really make sense unless you store a permutation (that seems to be
hard to unify with the row vector kind of transposition)

On Wed, May 14, 2014 at 7:58 PM, Stefan Karpinski
notifications@github.comwrote:

Reversing the dimensions is ok as long as Transpose{Vector} is treated
like a row matrix the same way that Vector is treated like a column matrix.
Making all of that consistent is going to take some effort.

—
Reply to this email directly or view it on GitHubhttps://github.com//pull/6837#issuecomment-43114985
.

andreasnoack · 2014-05-14T19:55:56Z

@tkelman I/We better document this well, but the general shift to views will probably give some surprises in terms of breaking code, but hopefully also in faster code.

@kmsquire LAPACK requires the storage to be contiguous along the major stride. BLAS allows arbitrary strides for vector arguments i.e. BLAS 1 & 2.

@jiahao I'm not familiar with transposing arrays with N>2. In this sketch, I have made transpose and ctranspose return an error when N>2 although the type is actually defined in general. Based on the discussion in this thread, it is my feeling that we could start out as it is now and then consider adding a permutation property to the type at some point when we feel ready for higher dimensions.

I think the permutation vector approach can be combined with the covector behaviour. Although the permutation of [1] might feel a little trivial, we would still have Transpose type to dispatch on to control how the transposed vector should behave.

But again, right now we only support transposing when N<3 and I think this proposal is an improvement for that, maybe limited, case. If anyone find it problematic to change the way we handle transposes now, please raise your voice or I'll continue to work on this proposal.

Finally, I would be happy if someone with Scheme skills could have a look at the lines I have commented out in julia-syntax.scm.

simonbyrne · 2014-05-21T14:38:45Z

I think this is an excellent idea (I particularly like that u' v could replace dot(u,v)), and a very elegant solution to the A_mul_B mess.

I think it is worth considering making conjugate-transpose a distinct type, say CTranspose{T<:Complex, N, S <: AbstractArray{T,N}}, as I feel that would make dispatch a little cleaner.

StefanKarpinski · 2014-05-21T16:05:17Z

I agree about conjugate transpose. If we're going to have Transpose it seems like we'd really want CTranspose as well.

simonbyrne · 2014-05-21T16:40:58Z

There is one other issue with the A_mul_B! methods that this doesn't address: at the moment the 2-argument form of A_mul_B!(A,B) still doesn't specify which matrix should be overwritten. Most methods overwrite B in this case, but there are some exceptions.

Some possible solutions:

Define mul!l and mul!r (for overwriting the left and right argument, respectively)
Create an Inplace{N} type, which could be the first argument of mul!, i.e. mul!(Inplace{1},A,B) would overwrite A, and mul!(Inplace{2},A,B) would overwrite B.

simonbyrne · 2014-05-22T14:47:35Z

To follow up, I threw together a package for providing a convenient syntax for in-place operations, e.g.

@in2! X' \ y

will call Ac_mul_B!(X,y). It duplicates some of the proposed functionality proposed in this PR, so will reduce in size considerably if this gets accepted.

Jutho · 2014-05-22T15:19:38Z

While I like the general attitude of switching to views where possible, and also see the advantage of having a Transpose type for the matrix multiplication zoo, one might want to consider how to deal with the corresponding CTranspose type? Since this involves an additional complex conjugation, it is not simply a view. If the CTranspose type is used outside the context of matrix multiplication, when will the conjugation be applied? Every time the user accesses a specific entry of the CTranspose object. If this happens many times, it could become much more inefficient then just computing the complex conjugated matrix or Hermitian conjugated matrix once.

StefanKarpinski · 2014-05-22T15:31:40Z

The CTranspose type isn't entirely necessary. ctranspose of real arrays could apply the lazy Transpose construction, while ctranspose of complex arrays could do the actual eager ctranspose. Of course, that introduces a problem generic programming in that for some array types a' is a view of a and modifying it will affect a whereas for others it creates a copy and you can safely modify a' without affecting a. If you weren't allowed to modify lazy transposes, this would be less problematic, but I think that having a CTranspose type that does this might be best:

setindex!{M<:Matrix}(t::CTranspose{M}, v, i, j) = (t.array[j,i] = v')

toivoh · 2014-05-22T18:39:47Z

I would be inclined to think that negating the imaginary part of a complex
number is pretty cheap compared to loading from / storing to memory.

StefanKarpinski · 2014-05-23T20:14:34Z

Yes, I suspect that's true.

Jutho · 2014-05-25T09:50:06Z

That's probably true indeed. I just started wondering about this since I noticed that the current code for getindex in this PR does not take the complex conjugation into account. But that's perfectly fine, it is WIP after all.

simonbyrne · 2014-05-27T08:37:53Z

I was playing around with this idea over the weekend, and while it makes things drastically simpler in general (linalg/matmul.jl can be reduced to a handful of methods), some new problems are introduced, all due to having Transpose{T,1} <: AbstractVector{T}.

Whereas Transpose{T,2} still behaves exactly like an Matrix{T}, a Transpose{T,1} can act differently than an Vector{T} (e.g. *(a::Vector{T}, b::Tranpose{T,1}) and *(a::Tranpose{T,1}, b::Vector) return very different objects) and so almost any time an AbstractVector occurs, additional methods are required to differentiate between the two behaviours, and I don't think this approach is sustainable over the whole julia ecosystem.

As a result, we would probably need to split Transpose into separate types, say TransposeVector and TransposeMatrix. There are two options that I've thought of:

Make TransposeVector{T} <: AbstractMatrix{T}: if we were to follow this strictly, it would mean that we would no longer be able to have u' v == dot(u,v).
Make TransposeVector{T] <: AbstractArray{T,-1} (or some other non-standard array type), to emphasise that its behaviour is different than a standard Array.

simonbyrne · 2014-05-28T14:04:15Z

To follow up, here's my attempt: 1919d59 (branched from @andreasnoackjensen's code). It defines separate types Transpose/ConjTranspose (for matrices), and Covector/ConjCovector (for vectors).

It almost halves the number of lines in matmul.jl, and significantly reduces duplicated code (e.g. each BLAS call appears only once).

StefanKarpinski · 2014-05-28T14:20:02Z

@simonbyrne – that seems like a fairly sane, targeted approach. I guess it feels a bit limited not to make the transpose construction more general but since we a) can't agree on what higher order transpose should mean and b) it causes all kinds of generic programming problems as you outlined, maybe the targeted approach for just vectors and matrices is best.

It also occurs to me that we could use Array{T,-n} as the actual transposed types rather than using a wrapper. Arrays with negative dimensionality would count their dimensions downward instead of upward. Then Array{T,-1} could be covector, Array{T,-2} a "comatrix", and so on. A comatrix would basically be a row-major matrix.

simonbyrne · 2014-05-28T15:04:17Z

@StefanKarpinski That's perhaps worth considering, though it would mean that every AbstractMatrix would need to define a corresponding type for its transpose. And we would still need a Conjugate wrapper type for dealing with complex-transpose.

lindahua · 2014-05-28T15:13:00Z

We should be careful about the idea of Array{T,-N}. Many functions defined over Array are not supposed to work with such types.

The Transpose type is more clean and explicit.

jiahao · 2014-05-28T18:05:27Z

I'm not a fan of Array{T,-N}: it doesn't generalize correctly beyond N>2 since for higher ranks you can have nontrivial behaviors associated with different dimensions being "up" or "down". I'd rather just have TransposeVector be its own thing rather than try to shoehorn it into the type hierarchy.

StefanKarpinski · 2014-05-28T18:10:17Z

I generally agree that the Transpose and Covector parametric types are probably a safer way to go.

On May 28, 2014, at 11:13 AM, Dahua Lin notifications@github.com wrote:

We should be careful about the idea of Array{T,-N}. Many functions defined over Array are not supposed to work with such types.

The Transpose type is more clean and explicit.

—
Reply to this email directly or view it on GitHub.

simonbyrne · 2014-05-29T16:27:58Z

One more brainteaser: what should be the (conjugate)transpose of a matrix of matrices? Should we also transpose the individual entries?

timholy · 2016-04-29T09:42:23Z

base/linalg/transpose.jl

+        ctransposeblock!(B,A,m,n-newn,offseti,offsetj+newn)
+    end
+    return B
+end


Boy, there's a lot of near-duplicated code here. I'd really prefer a generic implementation in which you passed transpose or ctranspose as an argument.

timholy · 2016-04-29T09:48:40Z

Nice!

How did the benchmarks work out? Naturally, I'm only concerned about performance for element-by-element access patterns. Does LLVM hoist the conjugated test out of a loop? If not, any chance that changing it to an if statement would cause it to be hoisted? (Without hoisting, your ifelse is much better.) If there's no hoisting, what's the performance hit?

andreasnoack · 2016-04-29T13:42:34Z

Thanks for the comments. I'll look into them and there is still work to do here and more benchmarks are needed to detect if the new methods are efficient. Right now, I wanted to direct the attention to the compile time issue which is quite visible in

https://travis-ci.org/JuliaLang/julia/jobs/126553560#L1648

My feeling is that unless something can be done with the compile time this change is simply not feasible.

timholy · 2016-04-29T14:29:16Z

Yeah, I noticed that. Quite a bummer.

StefanKarpinski · 2016-11-10T17:57:01Z

Bump. Needs to happen soon to go into 0.6.

JeffBezanson · 2016-11-10T18:36:03Z

Is this still blocked on a compiler performance problem? Is there an issue for it?

andreasnoack · 2016-11-10T18:42:44Z

I don't think the compile time is a blocker anymore. The huge number of method compilations could be avoided by making sure that elements in A, B, and C got promoted before calling the actual multiplication routine.

StefanKarpinski · 2016-11-11T16:21:06Z

That sounds promising. It would be great if this could get done in time for the 0.6 feature freeze at end of year.

andyferris · 2016-12-02T01:47:36Z

I said it in the line comments, but I would prefer that we punt entirely on conjugation here. I'm hoping that between my TransposedVector, this Transpose/TransposedMatrix and a new Conj wrapper, we could replace the Ac_mul_Bc class of function with methods on *.

I feel that having the conjugation more modular should be a conceptual advantage (and probably better for developers). For instance, you can have Conj of a higher-dimensional array (but not a transpose).

JeffBezanson · 2017-01-04T16:01:38Z

I believe we're aiming for RowVector in 0.6, and TransposedMatrix later, so changing milestone.

ViralBShah · 2017-11-19T00:20:03Z

@andreasnoack Are we likely to be able to make this by feature freeze?

mbauman · 2017-11-19T04:34:35Z

A much more up-to-date version here is in #23424.

andreasnoack · 2017-11-19T14:02:27Z

Yes. This is not relevant anymore. The current efforts are in #23424.

pygy mentioned this pull request May 24, 2014

Immutable tensors #6960

Closed

simonbyrne mentioned this pull request May 29, 2014

WIP: Sparse CSR Matrices #7029

Closed

timholy reviewed Apr 29, 2016
View reviewed changes

andreasnoack mentioned this pull request May 4, 2016

Infinitely growing compilation recursion #16201

Closed

andreasnoack mentioned this pull request Jun 2, 2016

improvements to A_mul_B! section of the manual #16702

Merged

tkelman mentioned this pull request Jun 5, 2016

document that 2-arg mul! methods trashes all arguments #16772

Closed

andreasnoack added this to the 0.6.0 milestone Sep 14, 2016

andreasnoack self-assigned this Sep 14, 2016

This was referenced Dec 21, 2016

Introduce RowVector as the transpose of a vector #19670

Merged

Reconsider uses of promote_op #19669

Closed

JeffBezanson modified the milestones: 1.x, 0.6.0, 1.0 Jan 4, 2017

ViralBShah added the triage This should be discussed on a triage call label Nov 19, 2017

andreasnoack closed this Nov 19, 2017

andreasnoack deleted the anj/transpose branch November 19, 2017 14:02

JeffBezanson removed the triage This should be discussed on a triage call label Nov 20, 2017

Sacha0 mentioned this pull request Dec 2, 2017

Taking matrix transposes seriously #20978

Closed

WIP:Add Transpose immutable #6837

WIP:Add Transpose immutable #6837

Conversation

andreasnoack commented May 14, 2014

tkelman commented May 14, 2014

jiahao commented May 14, 2014

toivoh commented May 14, 2014

jiahao commented May 14, 2014

kmsquire commented May 14, 2014

mbauman commented May 14, 2014

tknopp commented May 14, 2014

jiahao commented May 14, 2014

StefanKarpinski commented May 14, 2014

toivoh commented May 14, 2014

andreasnoack commented May 14, 2014

simonbyrne commented May 21, 2014

StefanKarpinski commented May 21, 2014

simonbyrne commented May 21, 2014

simonbyrne commented May 22, 2014

Jutho commented May 22, 2014

StefanKarpinski commented May 22, 2014

toivoh commented May 22, 2014

StefanKarpinski commented May 23, 2014

Jutho commented May 25, 2014

simonbyrne commented May 27, 2014

simonbyrne commented May 28, 2014

StefanKarpinski commented May 28, 2014

simonbyrne commented May 28, 2014

lindahua commented May 28, 2014

jiahao commented May 28, 2014

StefanKarpinski commented May 28, 2014

simonbyrne commented May 29, 2014

timholy Apr 29, 2016

Choose a reason for hiding this comment

timholy commented Apr 29, 2016

andreasnoack commented Apr 29, 2016 • edited Loading

timholy commented Apr 29, 2016

StefanKarpinski commented Nov 10, 2016

JeffBezanson commented Nov 10, 2016

andreasnoack commented Nov 10, 2016

StefanKarpinski commented Nov 11, 2016

andyferris commented Dec 2, 2016

JeffBezanson commented Jan 4, 2017

ViralBShah commented Nov 19, 2017 • edited Loading

mbauman commented Nov 19, 2017

andreasnoack commented Nov 19, 2017

andreasnoack commented Apr 29, 2016 •

edited

Loading

ViralBShah commented Nov 19, 2017 •

edited

Loading