LinearAlgebra: adjoint for bidiag/tridiag may preserve structure #54027

jishnub · 2024-04-10T17:49:31Z

After this,

julia> B = Bidiagonal(rand(ComplexF64,3), rand(ComplexF64,2), :U)
3×3 Bidiagonal{ComplexF64, Vector{ComplexF64}}:
 0.0444083+0.405889im  0.982608+0.798427im            ⋅    
           ⋅           0.723202+0.839624im    0.35967+0.482086im
           ⋅                    ⋅           0.0729189+0.835082im

julia> B'
3×3 Bidiagonal{ComplexF64, Base.ReshapedArray{ComplexF64, 1, Adjoint{ComplexF64, Vector{ComplexF64}}, Tuple{}}}:
 0.0444083-0.405889im           ⋅                     ⋅    
  0.982608-0.798427im  0.723202-0.839624im            ⋅    
           ⋅            0.35967-0.482086im  0.0729189-0.835082im

julia> B'' === B # false on master
true

Similar changes for Tridiagonal. This makes matrix multiplication much more optimized, as we hit the O(N) tridiag methods. On master,

julia> T = Tridiagonal(rand(ComplexF64,399), rand(ComplexF64,400), rand(ComplexF64,399));

julia> v = rand(ComplexF64, size(T,2));

julia> @btime $T * $v;
  1.241 μs (3 allocations: 6.31 KiB)

julia> @btime $T' * $v;
  537.109 μs (3 allocations: 6.31 KiB)

This PR

julia> @btime $T' * $v;
  1.304 μs (4 allocations: 6.39 KiB)

This specialized `copyto!` for combinations of banded structured matrix types so that the copy may be O(N) instead of the fallback O(N^2) implementation. E.g.: ```julia julia> T = Tridiagonal(zeros(999), zeros(1000), zeros(999)); julia> B = Bidiagonal(ones(1000), fill(2.0, 999), :U); julia> @Btime copyto!($T, $B); 1.927 ms (0 allocations: 0 bytes) # master 229.870 ns (0 allocations: 0 bytes) # PR ``` This also changes the `copyto!` implementation for mismatched matrix sizes, bringing it closer to the docstring. So, the following works on master: ```julia julia> Ddest = Diagonal(zeros(4)); julia> Dsrc = Diagonal(ones(2)); julia> copyto!(Ddest, Dsrc) 4×4 Diagonal{Float64, Vector{Float64}}: 1.0 ⋅ ⋅ ⋅ ⋅ 1.0 ⋅ ⋅ ⋅ ⋅ 0.0 ⋅ ⋅ ⋅ ⋅ 0.0 ``` but this won't work anymore with this PR. This was inconsistent anyway, as materializing the matrices produces a different result, which shouldn't be the case: ```julia julia> copyto!(Matrix(Ddest), Dsrc) 4×4 Matrix{Float64}: 1.0 0.0 0.0 0.0 0.0 1.0 0.0 0.0 0.0 0.0 0.0 0.0 1.0 0.0 0.0 0.0 ``` After this PR, the way to carry out the copy would be ```julia julia> copyto!(Ddest, CartesianIndices(Dsrc), Dsrc, CartesianIndices(Dsrc)) 4×4 Diagonal{Float64, Vector{Float64}}: 1.0 ⋅ ⋅ ⋅ ⋅ 1.0 ⋅ ⋅ ⋅ ⋅ 0.0 ⋅ ⋅ ⋅ ⋅ 0.0 ``` This change fixes https://github.com/JuliaLang/julia/issues/46005. Also fixes https://github.com/JuliaLang/julia/issues/53997 After this, ```julia julia> @Btime copyto!(C, B) setup=(n = 1_000; B = Bidiagonal(randn(n), randn(n-1), :L); C = Bidiagonal(randn(n), randn(n-1), :L)); 158.405 ns (0 allocations: 0 bytes) julia> @Btime copyto!(C, B) setup=(n = 10_000; B = Bidiagonal(randn(n), randn(n-1), :L); C = Bidiagonal(randn(n), randn(n-1), :L)); 4.706 μs (0 allocations: 0 bytes) julia> @Btime copyto!(C, B) setup=(n = 100_000; B = Bidiagonal(randn(n), randn(n-1), :L); C = Bidiagonal(randn(n), randn(n-1), :L)); 120.880 μs (0 allocations: 0 bytes) ``` which is roughly linear scaling. Taken along with #54027, the speed-ups would also apply to the adjoints of banded matrices.

…iaLang#54027)

This specialized `copyto!` for combinations of banded structured matrix types so that the copy may be O(N) instead of the fallback O(N^2) implementation. E.g.: ```julia julia> T = Tridiagonal(zeros(999), zeros(1000), zeros(999)); julia> B = Bidiagonal(ones(1000), fill(2.0, 999), :U); julia> @Btime copyto!($T, $B); 1.927 ms (0 allocations: 0 bytes) # master 229.870 ns (0 allocations: 0 bytes) # PR ``` This also changes the `copyto!` implementation for mismatched matrix sizes, bringing it closer to the docstring. So, the following works on master: ```julia julia> Ddest = Diagonal(zeros(4)); julia> Dsrc = Diagonal(ones(2)); julia> copyto!(Ddest, Dsrc) 4×4 Diagonal{Float64, Vector{Float64}}: 1.0 ⋅ ⋅ ⋅ ⋅ 1.0 ⋅ ⋅ ⋅ ⋅ 0.0 ⋅ ⋅ ⋅ ⋅ 0.0 ``` but this won't work anymore with this PR. This was inconsistent anyway, as materializing the matrices produces a different result, which shouldn't be the case: ```julia julia> copyto!(Matrix(Ddest), Dsrc) 4×4 Matrix{Float64}: 1.0 0.0 0.0 0.0 0.0 1.0 0.0 0.0 0.0 0.0 0.0 0.0 1.0 0.0 0.0 0.0 ``` After this PR, the way to carry out the copy would be ```julia julia> copyto!(Ddest, CartesianIndices(Dsrc), Dsrc, CartesianIndices(Dsrc)) 4×4 Diagonal{Float64, Vector{Float64}}: 1.0 ⋅ ⋅ ⋅ ⋅ 1.0 ⋅ ⋅ ⋅ ⋅ 0.0 ⋅ ⋅ ⋅ ⋅ 0.0 ``` This change fixes https://github.com/JuliaLang/julia/issues/46005. Also fixes https://github.com/JuliaLang/julia/issues/53997 After this, ```julia julia> @Btime copyto!(C, B) setup=(n = 1_000; B = Bidiagonal(randn(n), randn(n-1), :L); C = Bidiagonal(randn(n), randn(n-1), :L)); 158.405 ns (0 allocations: 0 bytes) julia> @Btime copyto!(C, B) setup=(n = 10_000; B = Bidiagonal(randn(n), randn(n-1), :L); C = Bidiagonal(randn(n), randn(n-1), :L)); 4.706 μs (0 allocations: 0 bytes) julia> @Btime copyto!(C, B) setup=(n = 100_000; B = Bidiagonal(randn(n), randn(n-1), :L); C = Bidiagonal(randn(n), randn(n-1), :L)); 120.880 μs (0 allocations: 0 bytes) ``` which is roughly linear scaling. Taken along with JuliaLang/julia#54027, the speed-ups would also apply to the adjoints of banded matrices.

LinearAlgebra: adjoint for bidiag/tridiag may preserve structure

f834364

jishnub added the linear algebra Linear algebra label Apr 10, 2024

jishnub mentioned this pull request Apr 11, 2024

LinearAlgebra: copyto! between banded matrix types #54041

Merged

Merge branch 'master' into jishnub/bitridiagcomplexadj

ea02113

This comment was marked as outdated.

Sign in to view

dkarrasch merged commit 8ac1db5 into master May 20, 2024
5 of 7 checks passed

dkarrasch deleted the jishnub/bitridiagcomplexadj branch May 20, 2024 15:21

lazarusA pushed a commit to lazarusA/julia that referenced this pull request Jul 12, 2024

LinearAlgebra: adjoint for bidiag/tridiag may preserve structure (Jul…

753e737

…iaLang#54027)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LinearAlgebra: adjoint for bidiag/tridiag may preserve structure #54027

LinearAlgebra: adjoint for bidiag/tridiag may preserve structure #54027

jishnub commented Apr 10, 2024

This comment was marked as outdated.

LinearAlgebra: adjoint for bidiag/tridiag may preserve structure #54027

LinearAlgebra: adjoint for bidiag/tridiag may preserve structure #54027

Conversation

jishnub commented Apr 10, 2024

This comment was marked as outdated.