sparse-constructor does not always purge zeros #9928

mauro3 · 2015-01-26T08:13:07Z

Sparse constructor sparse should purge all zeros but it does not when combining two entries to zero:

julia> sparse([1,1], [2,2], [1,-1])
1x2 sparse matrix with 1 Int64 entries:
        [1, 2]  =  0

expected would be

julia> sparse([1,1], [2,2], [1,-1])
1x2 sparse matrix with 0 Int64 entries:

The text was updated successfully, but these errors were encountered:

mlubin · 2015-01-26T18:00:50Z

If handling this case requires a lot of extra work and reallocations, I wouldn't say this is necessarily a bug.

tkelman · 2015-01-26T19:37:43Z

What do Matlab, Octave, or SciPy do here?

mauro3 · 2015-01-26T20:35:23Z

I think matlab never stores explicit zeros. Certainly removes them for this case.

SciPy is in the other camp and never removes zeros unless if explicitly told with c.eliminate_zeros().

Looks like Julia is in the middle.

StefanKarpinski · 2015-01-26T20:47:49Z

Is it just me or does being in the middle seems like the worst of both worlds? Both of the other behaviors are easy to predict and understand. When you currently ask "does this zero end up stored", the only answer we can give is "it depends".

tkelman · 2015-01-26T20:52:45Z

To some extent, yes. I think the idea at the moment is we are trying to have a high-level, user-friendly interface for sparse matrices that does remove zeros, and a low-level, high-performance way of working with sparse matrices for experts and library authors that doesn't. If we're going to try to pick a direction and move more consistently one way vs the other, I'd very much prefer going more in the SciPy direction than the Matlab direction.

mauro3 · 2015-01-26T20:57:21Z

Yes, I agree, SciPy direction would be better. Also would align with Julia's philosophy of not being too magical.

A bit off topic, here what happens in SciPy if one assigns to a not allocated spot:

In [34]: c[1,1] = 4
/usr/lib/python3.4/site-packages/scipy/sparse/compressed.py:730: SparseEfficiencyWarning: Changing the sparsity structure of a csc_matrix is expensive. lil_matrix is more efficient.
  SparseEfficiencyWarning)

mlubin · 2015-01-27T01:30:29Z

I'm not really bothered by the behavior because zeros are harmless, but I also prefer the SciPy approach.

ViralBShah · 2015-01-27T05:35:20Z

They are not completely harmless. Stored zeros will make some of our solvers trip, IIRC. We do need to remove these zeros in the default user-facing APIs. If someone wants to use stored zeros, I think it is reasonable to expect that they will work with the CSC data structure.

tkelman · 2015-01-27T05:45:02Z

I don't think removing them automatically is the right answer here. If there are a handful of places where they cause problems, then those call sites should explicitly canonicalize and remove stored zeros on entry, and these limitations should be documented.

timholy · 2015-01-27T09:50:57Z

Also agreed that SciPy-like is the way to go here, and +1 for @tkelman's proposal.

ViralBShah · 2015-01-27T19:41:20Z

What do stored zeros mean here? In the example presented by @mauro3, did the user intend to have a stored zero, or expected that the zero be removed?

The current sparse matrix design removes zeros that may appear as a result within an operation, much like Matlab and octave. There are cases like this one, where the stored zero is unintentional, and should be considered a bug in the current design.

If we choose to go the SciPy route, then we could decide not to check for zeros that may appear in the course of an operation, and that they become stored zeros. In such a case, sparse() and setindex may then allow a user to provide zeros in the input and they can get stored. However, this is a design decision, and requires the entire sparse matrix implemenation to conform to this. It would eliminate zero checking in many places.

I am guessing that a lot many users are familiar with the Matlab/Octave design, but developers like the SciPy choice, as it is much more flexible. I personally would be ok if we all want to change to the SciPy style. However, calling the current behaviour ok, because we allow for stored zeros is confusing, as @StefanKarpinski notes. It should be well defined, consistent across all operations, and predictable.

zouhairm · 2015-02-13T18:42:38Z

As an end user, I want to throw my 2 cents in: definitely confusing that I can't figure out whether to expect nzval's to always be non-zero or not.

As pointed out in my email to the julia-users google group, this behavior is particularly confusing to an end user (especially that the help for nnz and countnz imply that they are equivalent for sparse matrices)

help?> nnz
Base.nnz(A)
Returns the number of stored (filled) elements in a sparse matrix.
help?> countnz
Base.countnz(A)
Counts the number of nonzero values in array A (dense or sparse).
Note that this is not a constant-time operation. For sparse
matrices, one should usually use "nnz", which returns the number
of stored values.

A = spdiagm(ones(5));
println(nnz(A), countnz(A)) #prints 55
A[1,1] = 0
println(nnz(A), countnz(A)) #prints 44
A.nzval[1] = 0
println(nnz(A), countnz(A)) #prints 43

tkelman · 2015-02-13T18:51:53Z

Part of the problem is when you say A[1,1] = 0 with a sparse A, you could mean a few different things. If A[1,1] is a stored nonzero, removing it from the sparse matrix requires an expensive reallocation of the entire data structure. If this sparse matrix is, say, the Jacobian in a nonlinear optimization problem where you're reusing the same structure repeatedly with different nonzero values, you could get a value that happens to equal 0 but only for a single iteration, and you don't want to reallocate the entire data structure for that.

We might need either different ways to say "0" where one means stored and one means non-stored. Or different behaviors for different methods of assignment, which is what we have now. The documentation should certainly be clearer that a "stored nonzero" can have a value equal to 0.

mlubin · 2015-02-13T18:52:07Z

@zoohair, everyone wants something different from sparse matrices, so it would help to hear a bit about what you're trying to do with them. Currently, sparse matrices will only have stored zeros if the user explicitly puts them there.

zouhairm · 2015-02-13T19:04:53Z

Sure.

I'm using sparse matrices to store transition probabilities T in an MDP. The state space is huge, but only few transitions are possible. When it comes to solve the MDP there's a matrix-vector T*v multiplication to be done (v is not sparse). This is done one row at a time for algorithmic purposes (speeds up convergence).

In the past, I have simply use dot() and passed it the appropriate row of T and v, but I have an application where the values in v are repeated (i.e. v = [v1, v1, v1, v1, ...]), and rather than storing all of v, I want to just store the subvector v1 and still carry out the dot product. So I just want to iterate over the relevant columns of T and index into v1 appropriately to carry out the dot product.

So if there are zeros stored in the matrix, it doesn't make a difference since they won't change the result. But the fact that in order to get the non-zero indices findn needs to check all entries to be != 0 seems inefficient to me... I'm find if findn returned indices to the "structural nonzeros", but of course I can see how this would be an issue for other users who expect the entries to be non-zero.

mlubin · 2015-02-13T19:12:09Z

Could you post some pseudocode on how you plan on extracting a row using findn? There's a fast way which doesn't involve findn and a number of potentially very slow ways. In either case, the overhead of checking for zeros in findn is negligible compared with making sure that the rest of the operation is implemented efficiently.

stevengj · 2015-02-13T19:25:25Z

See also the discussion of nnz in #6769.

zouhairm · 2015-02-13T21:40:06Z

@mlubin: I realized that Julia is using CSC instead of CSR, so finding the rows for a given column is easier. So I'm just going to store the transpose of the matrix I care about and use this instead:

#Returns row indices for non-zero entries in a given column
function findn_rows{Tv,Ti}(S::SparseMatrixCSC{Tv,Ti}, colIdx::Integer)
    return S.rowval[S.colptr[colIdx] : (S.colptr[colIdx+1]-1)]
end

mlubin · 2015-02-13T21:47:19Z

@zoohair, yes that seems sensible. Be sure to avoid accessing elements as S[i,j] inside a loop.

ViralBShah · 2015-07-03T01:33:50Z

To fix this, we just need a pass to remove any stored zeros in the sparse constructor, if they occur from combination. This is pretty straightforward, but if lazy, one could even just translate the cs_fkeep routine in CSparse.

KristofferC · 2015-07-03T01:48:38Z

Don't we alread have fkeep?

julia/base/sparse/csparse.jl

Line 337 in 3aed3f5

function fkeep!{Tv,Ti}(A::SparseMatrixCSC{Tv,Ti}, f, other)

We also have dropzeros which takes away all stored zeros.

ViralBShah · 2015-07-03T22:39:05Z

Thanks. I was doing this at an airport with my laptop running out of charge. For now, I'll incoprorate this, and the larger discussion on stored zeros can continue elsewhere later.

tkelman · 2016-03-05T02:23:43Z

closed by #15242

mauro3 mentioned this issue Jan 26, 2015

sparse setindex and stored zeros #9906

Closed

ViralBShah added the sparse Sparse arrays label Jan 26, 2015

ViralBShah added this to the 0.4 milestone Feb 14, 2015

mauro3 mentioned this issue May 21, 2015

Improve performance of issym and ishermitian for sparse matrices #11371

Merged

KristofferC mentioned this issue Aug 13, 2015

A keepzeros option for sparse(I, J, V) #12605

Closed

ViralBShah modified the milestones: 0.5.0, 0.4.x Oct 1, 2015

Sacha0 mentioned this issue Dec 19, 2015

methods for left-division ops involving triangular matrices and sparse vectors #14447

Merged

Sacha0 mentioned this issue Jan 26, 2016

MIT-licensed sparse() parent method and expert driver #14798

Closed

tkelman closed this as completed Mar 5, 2016

Sacha0 mentioned this issue Jul 13, 2016

Make setindex! for sparse matrices and vectors not purge stored entries on zero assignment #17404

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sparse-constructor does not always purge zeros #9928

sparse-constructor does not always purge zeros #9928

mauro3 commented Jan 26, 2015

mlubin commented Jan 26, 2015

tkelman commented Jan 26, 2015

mauro3 commented Jan 26, 2015

StefanKarpinski commented Jan 26, 2015

tkelman commented Jan 26, 2015

mauro3 commented Jan 26, 2015

mlubin commented Jan 27, 2015

ViralBShah commented Jan 27, 2015

tkelman commented Jan 27, 2015

timholy commented Jan 27, 2015

ViralBShah commented Jan 27, 2015

zouhairm commented Feb 13, 2015

tkelman commented Feb 13, 2015

mlubin commented Feb 13, 2015

zouhairm commented Feb 13, 2015

mlubin commented Feb 13, 2015

stevengj commented Feb 13, 2015

zouhairm commented Feb 13, 2015

mlubin commented Feb 13, 2015

ViralBShah commented Jul 3, 2015

KristofferC commented Jul 3, 2015

ViralBShah commented Jul 3, 2015

tkelman commented Mar 5, 2016

sparse-constructor does not always purge zeros #9928

sparse-constructor does not always purge zeros #9928

Comments

mauro3 commented Jan 26, 2015

mlubin commented Jan 26, 2015

tkelman commented Jan 26, 2015

mauro3 commented Jan 26, 2015

StefanKarpinski commented Jan 26, 2015

tkelman commented Jan 26, 2015

mauro3 commented Jan 26, 2015

mlubin commented Jan 27, 2015

ViralBShah commented Jan 27, 2015

tkelman commented Jan 27, 2015

timholy commented Jan 27, 2015

ViralBShah commented Jan 27, 2015

zouhairm commented Feb 13, 2015

tkelman commented Feb 13, 2015

mlubin commented Feb 13, 2015

zouhairm commented Feb 13, 2015

mlubin commented Feb 13, 2015

stevengj commented Feb 13, 2015

zouhairm commented Feb 13, 2015

mlubin commented Feb 13, 2015

ViralBShah commented Jul 3, 2015

KristofferC commented Jul 3, 2015

ViralBShah commented Jul 3, 2015

tkelman commented Mar 5, 2016