stricter buffers in SparseMatrixCSC #40523

abraunst · 2021-04-19T05:50:15Z

This is a rebase of #30676, without the addition of a capacity-querying function.

The general idea is about making length(nonzeros(A))==nnz(A) in SparseMatrixCSC, with strict buffer checking on exported functions (a few internal functions still return illegal buffers because the construction is done in several places, in that cases we create an empty matrix and resize the buffers afterwards. They should be clearly marked by comments)

Adresses #30662. See also: #26560, #30435.

Still WIP.

abraunst · 2021-04-19T10:52:10Z

rebasing

abraunst · 2021-04-20T06:13:31Z

Test errors seem unrelated, rebasing.

abraunst · 2021-04-20T09:16:13Z

I'd say the failing check is unrelated. Could we run nanosoldier to look for performance regressions?

KristofferC · 2021-04-20T10:04:22Z

@nanosoldier runbenchmarks(ALL, vs=":master")

nanosoldier · 2021-04-20T13:16:44Z

Something went wrong when running your job:

NanosoldierError: failed to run benchmarks against comparison commit: failed process: Process(`sudo /run/media/system/data/nanosoldier/cset/bin/cset shield -e su nanosoldier -- -c /run/media/system/data/nanosoldier/workdir/jl_CZM7oH/benchscript.sh`, ProcessExited(1)) [1]

Logs and partial data can be found here
cc @christopher-dG

abraunst · 2021-04-20T16:42:23Z

Is he on strike?

abraunst · 2021-04-22T05:57:43Z

can we uhh... retry?

KristofferC · 2021-04-22T06:29:25Z

Looking at https://github.com/JuliaCI/NanosoldierReports/blob/master/benchmark/by_hash/421a07f_vs_d998c7e/logs/d998c7e74c84bb6f78916e71bf4efab936347170_against.out it seems it gets stuck at

loading group "sort"... done (took 0.30277937 seconds)
loading group "sparse"...

That's a bit suspicious since this code touches the sparse stuff.

abraunst · 2021-04-22T06:32:55Z

Looking at https://github.com/JuliaCI/NanosoldierReports/blob/master/benchmark/by_hash/421a07f_vs_d998c7e/logs/d998c7e74c84bb6f78916e71bf4efab936347170_against.out it seems it gets stuck at
loading group "sort"... done (took 0.30277937 seconds)
loading group "sparse"... 
That's a bit suspicious since this code touches the sparse stuff.

ah ... I'll have a look at that

KristofferC · 2021-04-22T06:40:35Z

I tried it locally and it had no problem...

loading group "scalar"... done (took 32.864596103 seconds)
loading group "sort"... done (took 2.51484769 seconds)
loading group "sparse"... done (took 9.814697974 seconds)
loading group "collection"... done (took 13.754587869 seconds)

Let's try again then

@nanosoldier runbenchmarks(ALL, vs=":master")

nanosoldier · 2021-04-22T10:05:07Z

Something went wrong when running your job:

NanosoldierError: failed to run benchmarks against comparison commit: failed process: Process(`sudo /run/media/system/data/nanosoldier/cset/bin/cset shield -e su nanosoldier -- -c /run/media/system/data/nanosoldier/workdir/jl_2biAVI/benchscript.sh`, ProcessExited(1)) [1]

Logs and partial data can be found here
cc @christopher-dG

vtjnash · 2021-04-22T13:56:32Z

Ah, sorry, master is broken right now ("comparison commit") because we need this PR.

@nanosoldier runbenchmarks("sparse" || "problem" || "broadcast" || "linalg", vs="@671bccba27e69b40dabe73409df2b1feba25944c")

nanosoldier · 2021-04-22T15:24:59Z

Your benchmark job has completed - possible performance regressions were detected. A full report can be found here. cc @christopher-dG

abraunst · 2021-04-22T15:56:30Z

If I understand correctly the only things that do not seem noise are 2 memory regressions in broadcast ["broadcast", "sparse", ("(1000, 1000)", 1)] and ["broadcast", "sparse", ("(10000000,)", 1)] (possibly fault of the PR) and one of performance ["sparse", "index", ("spmat", "col", "array", 1000)], but this seems a fluke (it is just getindex, that should not be affected)

vtjnash · 2021-04-22T16:05:08Z

Yeah, LGTM

KristofferC · 2021-04-22T17:20:54Z

stdlib/SuiteSparse/src/cholmod.jl

@@ -1970,4 +1957,58 @@ end
 (*)(A::SparseVecOrMat{Float64,Ti},
    B::Hermitian{Float64,SparseMatrixCSC{Float64,Ti}}) where {Ti} = sparse(Sparse(A)*Sparse(B))

+# Sort all the indices in each column for the construction of a CSC sparse matrix
+# sortBuffers!(A, sortindices = :sortcols)        # Sort each column with sort()
+function sortBuffers!(m, n, colptr::Vector{Ti}, rowval::Vector{Ti}, nzval::Vector{Tv}) where {Ti <: Integer, Tv}


Nit, but this looks a bit non-standard to me, any special reason for the camel case?

You're right, this comes from sortSparseMatrixCSC that was in sparsematrix.jl (which I didn't even remove for the moment), I'll rename it (it should not be exported though).

vtjnash · 2021-04-22T20:07:09Z

I think this can be merged, once the reviewer comments are addressed. Any objections?

abraunst · 2021-04-22T20:36:21Z

the memory regression should not be there, I'll investigate...

* Add sizehint!(::SparseMatrixCSC, args...), * Fix illegal SparseMatrixCSC construction in cholmod and linalg. * Remove tests targetting now illegal buffers * Fix invalid buffer creation in kron and more

…tems

…rt_buffers!

abraunst · 2021-04-23T10:47:38Z

Should be fixed. The problem was that by removing the capacity function from the PR, we lose track of what is the target length of the buffers (which were sizehinted appropriately, but the allocated length was inaccessible). So I resorted to returning invalid buffers from _allocres (which is internal). Can we do another round of benchmarking?

KristofferC · 2021-04-23T11:22:50Z

@nanosoldier runbenchmarks("sparse" || "problem" || "broadcast" || "linalg", vs="@671bccba27e69b40dabe73409df2b1feba25944c")

nanosoldier · 2021-04-23T12:40:10Z

Your benchmark job has completed - possible performance regressions were detected. A full report can be found here. cc @christopher-dG

abraunst · 2021-04-23T14:42:17Z

The memory stuff seems gone. The other performance stuff that appeared seems funky...

Make length(A.nzval)==nnz(A) and add strict buffer checking (#30662) * Add sizehint!(::SparseMatrixCSC, args...), * Fix illegal SparseMatrixCSC construction in cholmod and linalg. * Remove tests targetting now illegal buffers * Fix invalid buffer creation in kron and more * use widelength in sizehint! to cope with large matrices in 32 bit systems

With this patch the output buffers to `sparse!` are resized in order to satisfy the buffer length checks in the `SparseMatrixCSC` constructor that were introduced in JuliaLang/julia#40523. Previously `csccolptr` was never resized, and `cscrowval` and `cscnzval` were only resized if the buffers were too short (i.e. never truncated). The requirement `length(csccolptr) >= n + 1` could be kept, but seems unnecessary since all buffers need to be resized anyway (to pass the constructor checks). In particular this fixes calling `sparse!` with `I`, `J`, `V` as both input and output buffers: `sparse!(I, J, V, m, n, ..., I, J, V)`. Fixes #313.

…314) With this patch the output buffers to `sparse!` are resized in order to satisfy the buffer length checks in the `SparseMatrixCSC` constructor that were introduced in JuliaLang/julia#40523. Previously `csccolptr` was never resized, and `cscrowval` and `cscnzval` were only resized if the buffers were too short (i.e. never truncated). The requirement `length(csccolptr) >= n + 1` could be kept, but seems unnecessary since all buffers need to be resized anyway (to pass the constructor checks). In particular this fixes calling `sparse!` with `I`, `J`, `V` as both input and output buffers: `sparse!(I, J, V, m, n, ..., I, J, V)`. Fixes #313.

…314) With this patch the output buffers to `sparse!` are resized in order to satisfy the buffer length checks in the `SparseMatrixCSC` constructor that were introduced in JuliaLang/julia#40523. Previously `csccolptr` was never resized, and `cscrowval` and `cscnzval` were only resized if the buffers were too short (i.e. never truncated). The requirement `length(csccolptr) >= n + 1` could be kept, but seems unnecessary since all buffers need to be resized anyway (to pass the constructor checks). In particular this fixes calling `sparse!` with `I`, `J`, `V` as both input and output buffers: `sparse!(I, J, V, m, n, ..., I, J, V)`. Fixes #313. (cherry picked from commit 85a381b)

vtjnash mentioned this pull request Apr 19, 2021

Make length(A.nzval)==nnz(A) #30662 #30676

Closed

abraunst force-pushed the strictsparse branch from 729250d to 0840866 Compare April 19, 2021 10:51

dkarrasch added the sparse Sparse arrays label Apr 19, 2021

abraunst force-pushed the strictsparse branch from 0840866 to 46e110f Compare April 20, 2021 06:12

rikhuijzer mentioned this pull request Apr 22, 2021

Improve type stability for tryparse VersionNumber #40557

Merged

vtjnash mentioned this pull request Apr 22, 2021

[SparseArrays] similar on sparse matrix returned uninitialized space #40444

Merged

KristofferC reviewed Apr 22, 2021

View reviewed changes

vtjnash added triage This should be discussed on a triage call and removed triage This should be discussed on a triage call labels Apr 22, 2021

abraunst added 4 commits April 23, 2021 07:49

* Make length(A.nzval)==nnz(A) and add strict buffer checking #30662.

2863cb7

* Add sizehint!(::SparseMatrixCSC, args...), * Fix illegal SparseMatrixCSC construction in cholmod and linalg. * Remove tests targetting now illegal buffers * Fix invalid buffer creation in kron and more

remove two more tests

7b6054d

use widelength in sizehint! to cope with large matrices in 32 bit sys…

48ab6f2

…tems

remove unused method sortSparseMatrixCSC!, rename sortBuffers! -> _so…

c075d00

…rt_buffers!

fix regression in memory allocation

42467d4

abraunst force-pushed the strictsparse branch from 46e110f to 42467d4 Compare April 23, 2021 10:42

vtjnash changed the title ~~WIP: stricter buffers in SparseMatrixCSC~~ stricter buffers in SparseMatrixCSC Apr 23, 2021

vtjnash added the merge me PR is reviewed. Merge when all tests are passing label Apr 23, 2021

vtjnash merged commit 248c02f into JuliaLang:master Apr 24, 2021

dkarrasch removed the merge me PR is reviewed. Merge when all tests are passing label Apr 24, 2021

ChrisRackauckas mentioned this pull request Jul 14, 2021

sparse Jacobians fail with Julia 1.7 beta3 SciML/Sundials.jl#315

Closed

mtfishman mentioned this pull request Sep 17, 2021

[BUG] adjacency_matrix fails for SimpleGraph with self-loops (Julia 1.7) sbromberger/LightGraphs.jl#1594

Open

mkitti mentioned this pull request Dec 14, 2021

Sparse matrix error Julia 1.7, stricter buffers in SparseMatrixCSC JuliaIO/MAT.jl#169

Closed

KristofferC mentioned this pull request Jan 14, 2022

Make length(A.nzval)==nnz(A) JuliaSparse/SparseArrays.jl#44

Closed

abraemer mentioned this pull request Feb 14, 2022

Change type signature of _checkbuffers and _goodbuffers JuliaSparse/SparseArrays.jl#77

Merged

fredrikekre mentioned this pull request Dec 8, 2022

Buffer checking breaks buffer re-use JuliaSparse/SparseArrays.jl#313

Closed

fredrikekre mentioned this pull request Dec 9, 2022

Resize buffers in sparse! to satisfy buffer checks in constructor JuliaSparse/SparseArrays.jl#314

Merged

gdalle mentioned this pull request Aug 6, 2024

Introduce coloring results gdalle/SparseMatrixColorings.jl#38

Merged

8 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

stricter buffers in SparseMatrixCSC #40523

stricter buffers in SparseMatrixCSC #40523

abraunst commented Apr 19, 2021

abraunst commented Apr 19, 2021

abraunst commented Apr 20, 2021

abraunst commented Apr 20, 2021

KristofferC commented Apr 20, 2021

nanosoldier commented Apr 20, 2021

abraunst commented Apr 20, 2021

abraunst commented Apr 22, 2021

KristofferC commented Apr 22, 2021

abraunst commented Apr 22, 2021

KristofferC commented Apr 22, 2021

nanosoldier commented Apr 22, 2021

vtjnash commented Apr 22, 2021

nanosoldier commented Apr 22, 2021

abraunst commented Apr 22, 2021

vtjnash commented Apr 22, 2021

KristofferC Apr 22, 2021 •

edited

Loading

abraunst Apr 22, 2021

vtjnash commented Apr 22, 2021

abraunst commented Apr 22, 2021

abraunst commented Apr 23, 2021

KristofferC commented Apr 23, 2021

nanosoldier commented Apr 23, 2021

abraunst commented Apr 23, 2021

stricter buffers in SparseMatrixCSC #40523

stricter buffers in SparseMatrixCSC #40523

Conversation

abraunst commented Apr 19, 2021

abraunst commented Apr 19, 2021

abraunst commented Apr 20, 2021

abraunst commented Apr 20, 2021

KristofferC commented Apr 20, 2021

nanosoldier commented Apr 20, 2021

abraunst commented Apr 20, 2021

abraunst commented Apr 22, 2021

KristofferC commented Apr 22, 2021

abraunst commented Apr 22, 2021

KristofferC commented Apr 22, 2021

nanosoldier commented Apr 22, 2021

vtjnash commented Apr 22, 2021

nanosoldier commented Apr 22, 2021

abraunst commented Apr 22, 2021

vtjnash commented Apr 22, 2021

KristofferC Apr 22, 2021 • edited Loading

Choose a reason for hiding this comment

abraunst Apr 22, 2021

Choose a reason for hiding this comment

vtjnash commented Apr 22, 2021

abraunst commented Apr 22, 2021

abraunst commented Apr 23, 2021

KristofferC commented Apr 23, 2021

nanosoldier commented Apr 23, 2021

abraunst commented Apr 23, 2021

KristofferC Apr 22, 2021 •

edited

Loading