`nan` propagation in matrix multiplication #340

ivirshup · 2020-04-27T07:25:04Z

Describe the bug

Matrix multiplication does not propagate nan like numpy does.

To Reproduce

import numpy as np
import sparse

A = np.eye(4)
A[2, 2] = np.nan

np.eye(4) @ A
# array([[ 1.,  0., nan,  0.],
#        [ 0.,  1., nan,  0.],
#        [ 0.,  0., nan,  0.],
#        [ 0.,  0., nan,  1.]])

(sparse.eye(4) @ sparse.COO(A)).todense()
# array([[ 1.,  0.,  0.,  0.],
#        [ 0.,  1.,  0.,  0.],
#        [ 0.,  0., nan,  0.],
#        [ 0.,  0.,  0.,  1.]])

Expected behavior

I would expect the same values from numpy and sparse. However, I see that nan breaks the expected sparsity pattern, which could make efficient implementation difficult. Maybe a warning or error would be appropriate?

System

OS and version: macOS 10.15.3
sparse version (sparse.__version__) '0.9.1'
NumPy version (np.__version__) '1.18.3'
Numba version (numba.__version__) '0.49.0'

Additional context

Scipy sparse matrices have the same behavior: BUG: scipy.sparse.csc_matrix: matrix multiplication with nan scipy/scipy#7532
I believe a similar topic was touched on in Support Everything that XArray Expects #1, for checking results of operations against the fill value.

The text was updated successfully, but these errors were encountered:

hameerabbasi · 2020-04-27T07:27:16Z

Yes, this is because we (perhaps incorrectly) assume that 0 * x == 0 in matmul, and nowhere else.

hameerabbasi · 2020-04-27T07:34:10Z

Is there a use-case for this behaviour?

ivirshup · 2020-04-27T08:04:08Z

I'm doing outer joins, which I've ended up implementing with matrix multiplication since that works fairly consistently across array types. This was an inconsistency I came across. I've already worked around it, just thought it would be good for this behavior to be defined since I wasn't expecting this.

FWIW, Julia sparse arrays do the same thing.

using LinearAlgebra, SparseArrays

S = sparse(1.0I, 4, 4)
S[3, 3] = NaN
sparse(I, 4, 4) * S
# 4×4 SparseMatrixCSC{Float64,Int64} with 4 stored entries:
#   [1, 1]  =  1.0
#   [2, 2]  =  1.0
#   [3, 3]  =  NaN
#   [4, 4]  =  1.0

hameerabbasi · 2020-06-27T10:02:53Z

A short-term solution here would be to have a warning (like you said), (PRs welcome). If we want a long-term solution, we're looking for #365.

ivirshup added the bug Indicates an unexpected problem or unintended behavior label Apr 27, 2020

hameerabbasi mentioned this issue Jun 19, 2020

Incorrect results with differently ordered coords #360

Closed

sayandip18 mentioned this issue Apr 17, 2021

Add warning in matmul #459

Closed

This was referenced Apr 25, 2021

Added Warnings for NaN propagation. #468

Closed

Added warnings for NaN propagation #469

Merged

hameerabbasi closed this as completed in #469 Apr 25, 2021

ivirshup mentioned this issue Dec 11, 2023

Support for sparse matrices that do not default to 0 but NaN instead scverse/anndata#1236

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`nan` propagation in matrix multiplication #340

`nan` propagation in matrix multiplication #340

ivirshup commented Apr 27, 2020 •

edited

Loading

hameerabbasi commented Apr 27, 2020

hameerabbasi commented Apr 27, 2020

ivirshup commented Apr 27, 2020

hameerabbasi commented Jun 27, 2020

nan propagation in matrix multiplication #340

nan propagation in matrix multiplication #340

Comments

ivirshup commented Apr 27, 2020 • edited Loading

hameerabbasi commented Apr 27, 2020

hameerabbasi commented Apr 27, 2020

ivirshup commented Apr 27, 2020

hameerabbasi commented Jun 27, 2020

`nan` propagation in matrix multiplication #340

`nan` propagation in matrix multiplication #340

ivirshup commented Apr 27, 2020 •

edited

Loading