`signal`: Add type stubs to `_waveforms.pyi`. #195

pavyamsiri · 2024-11-24T00:04:46Z

Contributes to closing #99.

Adds type stubs to the file _waveforms.pyi and corresponding type tests to test overload coverage for the function gausspulse.

Notes

I am not used to the new optype.numpy types so tell me if I did anything suboptimally or wrong.

Also for chirp I found that for the float dtypes, float16 and float32 that the output dtype is same as the input dtype but for everything else that is float64. I wanted to make an overload to cover this behaviour but I don't know how to do it without the overloads overlapping.

Technically for some of functions you can pass in complex dtypes and it will work fine, but the functions want times and I don't think they were expecting imaginary times i.e. some functions return just the real part but the array would be complex dtype if the input is complex. For this reason I didn't allow complex arrays.

Funny Bug

Also I discovered a 19 year old bug in scipy. The intention of sawtooth and square is that the output dtype is the same as the input t's dtype if the dtype is float32 or float64 otherwise it is always float64. But the check for float32 can never pass because

if t.dtype.char in ['fFdD']:
    ytype = t.dtype.char
else:
    ytype = 'd'

is checking if a single character string is an element of a list of a single string instead of checking if the character is part of the string like t.dtype.char in 'fFdD'.

This makes the type checking easier because the return type is only a single possible type, but I find it a bit funny that this has been here for so long.

jorenham

I am not used to the new optype.numpy types so tell me if I did anything suboptimally or wrong.

That's of course no problem. I'm actually didn't expect you to notice the new optype at all haha.

You're already using the ToFloat and ToInt correctly 👌🏻.
For the ToInt{1,2,N}D array-like aliases, there's only one (subtle) difference with the np._typing._ArrayLike{}_co analogues which you might have missed, namely that they don't accept "bare" scalars like float or np.float16. That way, the e.g. ToFloat and ToFloatND types don't overload, which can help a lot when writing overloads. It does so by additionally requiring a __len__ method to be implemented in the new onp.CanArrayND protocol, instead of just __array__ like with np._typing._SupportsArray.

Also for chirp I found that for the float dtypes, float16 and float32 that the output dtype is same as the input dtype but for everything else that is float64. I wanted to make an overload to cover this behaviour but I don't know how to do it without the overloads overlapping.

I left a suggestion on a possible way to deal with that. But it's also fine by me without any overloads. Because usually functions that accept e.g. float64 arrays, also accept float16 arrays (although there are a couple of exceptions to this as you know).

Technically for some of functions you can pass in complex dtypes and it will work fine, but the functions want times and I don't think they were expecting imaginary times i.e. some functions return just the real part but the array would be complex dtype if the input is complex. For this reason I didn't allow complex arrays.

Ah yes I commented on one such case before I read this. As I also said there; I have no idea what "complex time" is used for, and have never seen it used in statistical and econometric literature. But if you decide to allow it, be sure to use a separate overload, so that real input results in real output.
Anyway, I'll leave this decision up to you.

scipy-stubs/signal/_waveforms.pyi

tests/signal/test_waveforms.pyi

pavyamsiri · 2024-11-24T03:24:27Z

The gausspulse function has such a weird API. I did all the overloads I believe but this does bloat the stubs quite a bit.

pavyamsiri · 2024-11-24T03:36:05Z

It seems for some reason, the literal "cutoff" overlaps with onp.ToFloatND? Only for mypy it seems, pyright doesn't see an overlap.

pavyamsiri · 2024-11-24T06:04:33Z

I think mypy gets confused because of the definition of ToFloatND

ToFloatND: TypeAlias = _To1D[_f_co, float] | Sequence["ToFloatND"]

The self-referential type is ignored by mypy and it substitutes "ToFloatND" with ... meaning str fits because it is a Sequence[str] which leads to overload overlaps.

jorenham · 2024-11-24T17:15:03Z

I think mypy gets confused because of the definition of ToFloatND
ToFloatND: TypeAlias = _To1D[_f_co, float] | Sequence["ToFloatND"]
The self-referential type is ignored by mypy and it substitutes "ToFloatND" with ... meaning str fits because it is a Sequence[str] which leads to overload overlaps.

Yea you're right:
https://mypy-play.net/?mypy=latest&python=3.12&flags=strict&gist=18fea0a692fc85f5da18d00bc578b2fa

from collections.abc import Sequence
from typing import TypeAlias

FloatVector: TypeAlias = Sequence[float]
FloatTensor: TypeAlias = FloatVector | Sequence["FloatTensor"]

rejected_float_vector: FloatVector = "ok"       # OK: rejected

accepted_float_tensor: FloatTensor = [[[3.14]]] # OK: accepted
rejected_float_vector1: FloatTensor = object()  # OK: rejected
rejected_float_vector2: FloatTensor = "fail"    # FAIL: accepted

So even though recursive types should be supported since mypy 1.7 (python/mypy#731), in this case, it results in a false negative

jorenham · 2024-11-24T18:17:00Z

As it turns out, this isn't limited to float sequences.

I raised it at python/mypy#18184

In the meantime, I'll try to figure out a workaround for this and publish a new optype release.

jorenham · 2024-11-25T01:04:14Z

@pavyamsiri I released a new optype version that includes the workaround. So if you rebase your branch, then the overlapping overload errors should disappear.

…yi`. Some notes: 1. It seems the scipy implementation for `sawtooth` and `square` has a bug. The intention is that the dtype of the `t` array when it is `float32` or `float64` determines the output dtype but because the check wrongly uses a list instead of just a string this code path never runs. Therefore the output dtype is fixed to always be `float64`. This makes typing simpler however. This bug has been in the code for 19 years!

Wanted to add overloads depending on input dtype but because complex is a superclass of float, the overloads overlap and so we can't do it

Add type tests as well to check overloads are complete

float16, float32 preserve their while everything else becomes float64. Not sure how to represent this though.

In `_waveforms.pyi`

Ruff complains that we can only use simple values and `float` (the type) is not a simple value.

param

In `_waveforms.pyi`

`t` can be either an array or a scalar or "cutoff". Add corresponding tests

This doesn't fix `stubtest` however but it does fix `typetest`.

pavyamsiri · 2024-11-25T05:32:20Z

@pavyamsiri I released a new optype version that includes the workaround. So if you rebase your branch, then the overlapping overload errors should disappear.

Thanks for the help with the mypy bug. There's an issue remaining with the overloads for chirp however. I guess the new change has caused pyright to think onp.ToFloatND to overlap with numpy._typing._ArrayLike and so we get conflicting overlaps.

pavyamsiri · 2024-11-25T05:33:14Z

The simplest fix would be to just leave one of the overloads out, unless you know of a way to avoid this.

jorenham · 2024-11-25T11:47:14Z

I guess the new change has caused pyright to think onp.ToFloatND to overlap with numpy._typing._ArrayLike and so we get conflicting overlaps.

Well, I suppose that'd be correct; they actually overlap. I'm not sure why before this it wasn't detected though 🤷🏻. I had to change quite a lot of overloads because of this as well in #197. So maybe you a similar approach I used there to solve it in this case.

scipy-stubs/signal/_waveforms.pyi

pavyamsiri · 2024-11-25T12:49:31Z

I managed to avoid the overlap but I am not convinced the overloads work as intended. I tried making some type tests like passing np.zeros(4, dtype=np.float16) as t and there was no suitable overload for chirp.

jorenham · 2024-11-25T14:09:45Z

I managed to avoid the overlap but I am not convinced the overloads work as intended. I tried making some type tests like passing np.zeros(4, dtype=np.float16) as t and there was no suitable overload for chirp.

hmmm 🤔
Well in this case you could also brute force it by overloading each of the float{16,32,64} cases.
But I also don't mind if you just return an np.floating[Any] there, or keep it as it currently is. I'd be happy to merge either way. So let me know if I should

pavyamsiri · 2024-11-25T15:21:28Z

Yeah I think I will go with the Any approach because I fear the brute force approach will be combinatorially explosive due to the other parameters.

I added an overload to differentiate between scalar and non scalar array likes. Should be able to be merged now.

jorenham · 2024-11-25T15:22:35Z

Thanks Pavadol!

jorenham added the scipy.signal label Nov 24, 2024

jorenham reviewed Nov 24, 2024

View reviewed changes

jorenham mentioned this pull request Nov 24, 2024

optype.numpy.To{}ND false negatives in mypy jorenham/optype#194

Closed

pavyamsiri added 17 commits November 25, 2024 16:09

signal: Add type stubs for sweep_poly in _waveforms.pyi

157b713

Wanted to add overloads depending on input dtype but because complex is a superclass of float, the overloads overlap and so we can't do it

signal: Add type stubs to unit_impulse in _waveforms.pyi

810224f

signal+tests: Add type stubs for gausspulse in _waveforms.pyi.

dbe336e

Add type tests as well to check overloads are complete

signal: Add type stubs to chirp in _waveforms.pyi.

7a63f34

float16, float32 preserve their while everything else becomes float64. Not sure how to represent this though.

signal: Don't allow onp.ToComplexND because time should be real-valued

7c8aa77

signal: Use a type alias for npt.NDArray[np.float64]

d416367

In `_waveforms.pyi`

signal: Sort imports in _waveforms.pyi

cfb7713

signal: Omit default float value in _waveforms.

2d60ea0

Ruff complains that we can only use simple values and `float` (the type) is not a simple value.

signal: Fix up some mistakes in _waveforms.pyi.

f2a870b

signal: Allow scalars as t in gausspulse

bb7ec09

tests: Update test_waveforms

d656dd6

signal: t in gausspulse accepts the string "cutoff" not the tpr

7524442

param

tests: Update test_waveforms

7b975ec

signal: Type annotate chirp so that the output dtype can be inferred

647b1c3

In `_waveforms.pyi`

signal+tests: Add all possible overloads for gausspulse.

3278292

`t` can be either an array or a scalar or "cutoff". Add corresponding tests

signal: Move gausspulse overloads around to satisfy mypy

4d74dc8

This doesn't fix `stubtest` however but it does fix `typetest`.

pavyamsiri force-pushed the improve/signal/waveforms branch from 633310c to 4d74dc8 Compare November 25, 2024 05:30

jorenham reviewed Nov 25, 2024

View reviewed changes

scipy-stubs/signal/_waveforms.pyi Outdated Show resolved Hide resolved

signal: Fix chirp overload overlap in _waveforms.pyi

60570a0

signal: Simplify overloads for chirp in _waveforms.pyi

76785f5

jorenham self-requested a review November 25, 2024 15:20

jorenham approved these changes Nov 25, 2024

View reviewed changes

jorenham merged commit 5559ee0 into jorenham:master Nov 25, 2024
2 checks passed

jorenham modified the milestone: 1.14.1.4 Nov 25, 2024

jorenham mentioned this pull request Nov 25, 2024

🌕 complete scipy.signal #99

Open

21 tasks

jorenham added this to the 1.14.1.5 milestone Nov 25, 2024

jorenham added the stubs: improvement Improve or refactor existing annotations label Nov 25, 2024

pavyamsiri deleted the improve/signal/waveforms branch November 26, 2024 01:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`signal`: Add type stubs to `_waveforms.pyi`. #195

`signal`: Add type stubs to `_waveforms.pyi`. #195

pavyamsiri commented Nov 24, 2024 •

edited

Loading

jorenham left a comment

pavyamsiri commented Nov 24, 2024

pavyamsiri commented Nov 24, 2024 •

edited

Loading

pavyamsiri commented Nov 24, 2024

jorenham commented Nov 24, 2024 •

edited

Loading

jorenham commented Nov 24, 2024 •

edited

Loading

jorenham commented Nov 25, 2024 •

edited

Loading

pavyamsiri commented Nov 25, 2024

pavyamsiri commented Nov 25, 2024

jorenham commented Nov 25, 2024

pavyamsiri commented Nov 25, 2024

jorenham commented Nov 25, 2024

pavyamsiri commented Nov 25, 2024

jorenham commented Nov 25, 2024

signal: Add type stubs to _waveforms.pyi. #195

signal: Add type stubs to _waveforms.pyi. #195

Conversation

pavyamsiri commented Nov 24, 2024 • edited Loading

Notes

Funny Bug

jorenham left a comment

Choose a reason for hiding this comment

pavyamsiri commented Nov 24, 2024

pavyamsiri commented Nov 24, 2024 • edited Loading

pavyamsiri commented Nov 24, 2024

jorenham commented Nov 24, 2024 • edited Loading

jorenham commented Nov 24, 2024 • edited Loading

jorenham commented Nov 25, 2024 • edited Loading

pavyamsiri commented Nov 25, 2024

pavyamsiri commented Nov 25, 2024

jorenham commented Nov 25, 2024

pavyamsiri commented Nov 25, 2024

jorenham commented Nov 25, 2024

pavyamsiri commented Nov 25, 2024

jorenham commented Nov 25, 2024

`signal`: Add type stubs to `_waveforms.pyi`. #195

`signal`: Add type stubs to `_waveforms.pyi`. #195

pavyamsiri commented Nov 24, 2024 •

edited

Loading

pavyamsiri commented Nov 24, 2024 •

edited

Loading

jorenham commented Nov 24, 2024 •

edited

Loading

jorenham commented Nov 24, 2024 •

edited

Loading

jorenham commented Nov 25, 2024 •

edited

Loading