Arithmetic between real arrays and Python complex scalars (revisited) #841

mdhaber · 2024-09-11T04:38:13Z

The array API standard is now clear (in Type Promotion Rules):

This means that following does not have defined behavior:

x = xp.asarray([1., 2., 3.])
x + 1j

However, the behavior is defined consistently in NumPy, CuPy, PyTorch, jax.numpy¹, Dask array, and Tensorflow² in the intuitive way: the real and imaginary component dtypes of the complex output array match the dtype of the input array.

This issue was discussed explicitly in gh-478 beginning with #478 (comment). I see several comments that seem supportive of allowing this operation, e.g.

I would hope it at least works for Python scalars. x + 0j seems like the simplest and most obvious way to convert a real array to a complex one.

The comment I see against it (#478 (comment)) is

I know it's very common to do this (also * 1.0 to convert integer to real). However, it doesn't seem like great practice irrespective of the decision here - there's an explicit astype function for converting to other dtypes.

It looked like the summary comments suggested that this would be allowed, but it was not specifically addressed in the PR that closed the issue. (See postscript for more information.)

I thought it might help to add some perspective on how this impacts a developer translating code to the standard: when tests including an operation like

x + 1j

begin to fail with array_api_strict only, the developer is faced with the choice of skipping the array_api_strict tests or figuring out what is wrong and changing the code to something like:

xp.astype(x, xp.result_type(x.dtype, xp.complex64)) + 1j

If they also want it to be able to preserve the lower precision types supported by some libraries (e.g. for NumPy), there would be additional hurdles. I tested for a few libraries, and array API standard compatible version of the code also increases the execution time notably for small arrays. Individually, the inconvenience, complexity, and overhead are small, but they add up. I am excited about adding array API support to SciPy, but not everyone is supportive, and performance regressions and complicated-looking diffs can make it difficult to garner support.

I would suggest that the operation be defined, even if it means adding an exception to the simply stated rules for array/Python arithmetic.

The more specific language about this not being defined was added in gh-513, which provided the justification:

Given that current guidance requires converting a scalar to the array data type, to accommodate complex scalars and real-valued arrays, we'd need to specify casting rules for complex to real,

I think that is referring to the language:

That guidance was added in gh-74, but I can't trace the origin further. Perhaps there can be exceptions to that rule?

when higher precision support is enabled, see changelog ↩
With NumPy experimental behavior (https://github.com/data-apis/array-api/issues/478#issuecomment-1272631595). Vanilla Tensorflow doesn't seem to support the other array-scalar operations that are defined, but already this was not deemed to be sufficient reason to prevent their inclusion in the standard (https://github.com/data-apis/array-api/issues/478#issuecomment-1270409660). ↩

The text was updated successfully, but these errors were encountered:

asmeurer · 2024-09-11T19:44:18Z

I still agree that this should be supported. We should be careful to not confuse downcasting a complex array (or scalar) to real float, which is indeed not supported and for very good reasons, with upcasting a real float to complex.

In fact, real and complex floating-point arrays are already supported to promote together:

>>> import array_api_strict as xp
>>> xp.asarray([0., 1.]) + xp.asarray([0j, 1j])
Array([0.+0.j, 1.+1.j], dtype=array_api_strict.complex128)

So it seems odd that this doesn't work for Python scalars.

The piece of text that @mdhaber quoted at the top of the issue seems to imply that there is ambiguity whether the resulting array should be complex64 or complex128. But I think this is already pretty well spelled out for the Python float case https://data-apis.org/array-api/latest/API_specification/type_promotion.html#mixing-arrays-with-python-scalars

Convert the scalar to zero-dimensional array with the same data type as that of the array used in the expression.

Execute the operation for array 0-D array (or 0-D array array if scalar was the left-hand argument).

i.e., float32_array + python_float gives float32 and float64_array + python_float gives float64. In other words, the corresponding rule should be that a float32_array + python_complex gives a complex64 array and float64_array + python_complex gives a complex128 array.

I think the real question is, what is the idea behind the rule "convert the scalar to zero-dimensional array with the same data type as that of the array"? If the idea is that "when adding a Python scalar to an array, the dtype doesn't change", then indeed real_array + complex_scalar shouldn't be supported. But if it's rather that "the precision doesn't change", then it should be supported.

rgommers · 2024-10-03T18:14:15Z

xp.astype(x, xp.result_type(x.dtype, xp.complex64)) + 1j

This is quite clumsy indeed. Independent of the decision on allowing x + 1j, we should also make this nicer by supporting dtype kinds in astype I think, so the above becomes:

xp.astype(x, 'complex')  # or maybe `'complex floating', if we'd use the `isdtype` kind specifiers

Doing that would also address the problem for unsigned -> signed integers, which cannot be done with a Python scalar equivalent like + 0j.

kgryte added API change Changes to existing functions or objects in the API. topic: Complex Data Types Complex number data types. topic: Type Promotion Type promotion. Needs Discussion Needs further discussion. labels Sep 11, 2024

kgryte added this to the v2024 milestone Sep 11, 2024

asmeurer mentioned this issue Oct 8, 2024

ENH: real and complex dtype functions data-apis/array-api-extra#13

Open

lucascolley mentioned this issue Oct 24, 2024

feat: astype: accept a kind of data type #848

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Arithmetic between real arrays and Python complex scalars (revisited) #841

Arithmetic between real arrays and Python complex scalars (revisited) #841

mdhaber commented Sep 11, 2024 •

edited

Loading

asmeurer commented Sep 11, 2024

rgommers commented Oct 3, 2024

Arithmetic between real arrays and Python complex scalars (revisited) #841

Arithmetic between real arrays and Python complex scalars (revisited) #841

Comments

mdhaber commented Sep 11, 2024 • edited Loading

Footnotes

asmeurer commented Sep 11, 2024

rgommers commented Oct 3, 2024

mdhaber commented Sep 11, 2024 •

edited

Loading