TST: tests for maybe_promote (precursor to #23982) #25637

h-vetinari · 2019-03-10T17:26:32Z

First step towards #23833, resp. precursor to #23982.

TL;DR: maybe_promote is quite broken. #23982 tries to come up with tests that it should pass, and #25425 tries to fix the implementation. However, #23982 is quite big, so @jreback asked for a smaller version that (mostly) just tests existing behaviour.

pep8speaks · 2019-03-10T17:26:42Z

Hello @h-vetinari! Thanks for updating this PR. We checked the lines you've touched for PEP 8 issues, and found:

There are currently no PEP 8 issues detected in this Pull Request. Cheers! 🍻

Comment last updated at 2019-06-21 06:15:45 UTC

codecov · 2019-03-10T18:41:47Z

Codecov Report

Merging #25637 into master will increase coverage by <.01%.
The diff coverage is n/a.

@@            Coverage Diff             @@
##           master   #25637      +/-   ##
==========================================
+ Coverage   91.26%   91.27%   +<.01%     
==========================================
  Files         173      173              
  Lines       52968    52968              
==========================================
+ Hits        48339    48344       +5     
+ Misses       4629     4624       -5

Flag	Coverage Δ
#multiple	`89.84% <ø> (ø)`	⬆️
#single	`41.71% <ø> (ø)`	⬆️

Impacted Files	Coverage Δ
pandas/core/dtypes/cast.py	`89% <0%> (+0.83%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 26cfa28...ca8f96b. Read the comment docs.

codecov · 2019-03-10T18:41:47Z

Codecov Report

Merging #25637 into master will decrease coverage by <.01%.
The diff coverage is n/a.

@@            Coverage Diff             @@
##           master   #25637      +/-   ##
==========================================
- Coverage   91.97%   91.97%   -0.01%     
==========================================
  Files         180      180              
  Lines       50756    50756              
==========================================
- Hits        46685    46683       -2     
- Misses       4071     4073       +2

Flag	Coverage Δ
#multiple	`90.57% <ø> (ø)`	⬆️
#single	`41.84% <ø> (-0.07%)`	⬇️

Impacted Files	Coverage Δ
pandas/io/gbq.py	`88.88% <0%> (-11.12%)`	⬇️
pandas/core/frame.py	`96.89% <0%> (-0.12%)`	⬇️
pandas/util/testing.py	`90.94% <0%> (+0.1%)`	⬆️
pandas/core/dtypes/cast.py	`90.69% <0%> (+0.16%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update b9b081d...636b1f1. Read the comment docs.

jreback

just started glancing at this. I find the box & box_dtype parametrization args very confusing. This is testing a scalar and array, but when / where can this actually be an object dtype of a scalar? I would like to keep the scope of this way way down. This means don't over parmaterize.

jreback · 2019-03-10T22:02:43Z

pandas/tests/dtypes/cast/test_promote.py

+
+    # try/except as numpy dtypes (i.e. if result_dtype is np.object_) do not
+    # know some expected dtypes like DatetimeTZDtype, and hence raise TypeError
+    try:


I find this very error prone an would rather make this explicit

I don't understand what you mean here. PandasDtypes (which will not be known by a numpy dtype) can come in on either side of the comparison. If you have a way to write this better, let me know.

By more explicit, I think you need to check the type of result_dtype and expected_dtype. That way, you know which way to compare.

pandas/tests/dtypes/cast/test_promote.py

jreback · 2019-03-10T22:04:41Z

pandas/tests/dtypes/cast/test_promote.py

+@pytest.mark.parametrize('boxed, box_dtype', [
+    (True, None),    # fill_value wrapped in array with auto-dtype
+    (True, object),  # fill_value wrapped in array with object dtype
+    (False, None)    # fill_value directly


what does box_dtype of None even mean?

I'm trying to explain just that with the comment next to it.

Side note here: None takes no extra meaning, but is passed directly to the constructor of np.array(some_value, dtype=None), which corresponds to the default behaviour of numpy.

Incorporate this side-note as a comment. That will help to explain the parameterized values better.

h-vetinari

@jreback: just started glancing at this. I find the box & box_dtype parametrization args very confusing. This is testing a scalar and array, but when / where can this actually be an object dtype of a scalar? I would like to keep the scope of this way way down. This means don't over parmaterize.

Thanks for starting to take a look at this. I intentionally kept these tests together, as the actual tests are the same in 95% of the cases, and it would lead to huge amount of duplication to separate testing the scalar case vs. the array case.

I've tried to annotate at every step what boxed and box_dtype mean, if we can find a better way to write this I'm all ears, but I don't think breaking up the tests is a good approach.

h-vetinari · 2019-03-11T07:29:34Z

pandas/tests/dtypes/cast/test_promote.py

+
+    # try/except as numpy dtypes (i.e. if result_dtype is np.object_) do not
+    # know some expected dtypes like DatetimeTZDtype, and hence raise TypeError
+    try:


I don't understand what you mean here. PandasDtypes (which will not be known by a numpy dtype) can come in on either side of the comparison. If you have a way to write this better, let me know.

pandas/tests/dtypes/cast/test_promote.py

h-vetinari · 2019-03-11T07:30:57Z

pandas/tests/dtypes/cast/test_promote.py

+@pytest.mark.parametrize('boxed, box_dtype', [
+    (True, None),    # fill_value wrapped in array with auto-dtype
+    (True, object),  # fill_value wrapped in array with object dtype
+    (False, None)    # fill_value directly


I'm trying to explain just that with the comment next to it.

jreback · 2019-05-12T21:26:25Z

though a good idea, closing as stale

h-vetinari · 2019-05-13T05:44:31Z

@jreback: though a good idea, closing as stale

I've worked an inordinate amount of time on #25425, and although I prepared two subsets of it (#23982 and this one), it never got any review. I understand the realities of how scarce reviewing resources are, but it's still disheartening after putting in so much work.

Long story short, all those three PRs (#23982,#25425, #25637) should be reopened, please. I'm ready to keep working on them.

gfyoung · 2019-05-13T08:09:16Z

@h-vetinari : I'll re-open this one first. I agree with @jreback that the scope of what you're trying to do is massive. Thus, it would be great if you focused on this PR alone.

We can re-open other ones as the work progresses.

gfyoung · 2019-05-13T08:12:37Z

it would lead to huge amount of duplication to separate testing the scalar case vs. the array case.

In a separate commit, separate these out as requested. The best way to make the point here is to show the code side-by-side. Are you sure you can't abstract out setup methods for some of the functionality that would be de-duplicated by any chance?

Also, if you could address the merge conflict.

…_precursor

h-vetinari · 2019-05-13T17:18:18Z

@gfyoung
Thanks for reopening.

In a separate commit, separate these out as requested.

I can do this, but essentially every test would be duplicated. I'm testing the same logic in both cases, and actually more than that: that the same promotion logic applies in both scalar and array case.

If I start splitting this, it would make most sense to have one test for scalars and one for arrays, but from that point on, it would be very easy for those tests to diverge. Does this sound like a reasonable trade-off to you? While trivial, it will be a fair bit of work to actually duplicate these, so I'd rather be sure you understand the consequences before-hand.

Are you sure you can't abstract out setup methods for some of the functionality that would be de-duplicated by any chance?

That's exactly what I was aiming for with _check_promote. All the "differences" between the array and scalar case are taken care of by the wrapper, so that each test only has to specify the promotion logic that's being tested.

gfyoung · 2019-05-13T22:38:15Z

If I start splitting this, it would make most sense to have one test for scalars and one for arrays, but from that point on

Why don't you try duplicating for one test only. Then we can evaluate before and after on that one test and see whether it makes sense for you to continue forward with the duplication.

That's exactly what I was aiming for with _check_promote.

Hmm...I see. It sounds though (from the comments IIUC) that that might have impacted readability because there were so many different cases to check. Is it possible to maybe have two or three different check promote functions that each handle a subset of cases?

…_precursor

h-vetinari · 2019-05-30T14:53:44Z

@gfyoung

Finally got around to coming back to this:

That's exactly what I was aiming for with _check_promote.

Hmm...I see. It sounds though (from the comments IIUC) that that might have impacted readability because there were so many different cases to check. Is it possible to maybe have two or three different check promote functions that each handle a subset of cases?

I added a docstring and some comments to make it clearer. The function essentially only contains one branch depending on whether fill_value should be passed as scalar or as an array. And that's exactly the part I want to abstract away from the tests further on.

All of those tests have a very simple format:

def test_maybe_promote_XXX_with_YYY(fixtures..., boxed, box_dtype):
    dtype = some_dtype(XXX)
    fill_dtype = some_dtype(YYY)

    if some_condition:
        pytest.xfail('reason')

    # define fill_value (usually implicitly through fill_dtype)
    fill_value = some_scalar(fill_dtype)

    # define expected values
    expected_dtype = some_dtype
    exp_val_for_scalar = some_scalar_value
    exp_val_for_array = some_missing_value_marker

    _check_promote(dtype, fill_value, boxed, box_dtype, expected_dtype,
                   exp_val_for_scalar, exp_val_for_array)

This about as simple as I can imagine it to be (while unifying the treatment of scalar and array case). Short of a new idea (and the docstring), I don't know how to make this clearer.

(speaking of new ideas: I incorporated your typecheck instead of the try/except for PandasExtensionDtype)

gfyoung · 2019-05-30T15:29:22Z

pandas/conftest.py

-DATETIME_DTYPES = ['datetime64[ns]', 'M8[ns]']
-TIMEDELTA_DTYPES = ['timedelta64[ns]', 'm8[ns]']
+DATETIME64_DTYPES = ['datetime64[ns]', 'M8[ns]']
+TIMEDELTA64_DTYPES = ['timedelta64[ns]', 'm8[ns]']


Rationale for renames?

Is this necessary given how massive the diff is already?

I can split off the conftest-related things, of course.

Regarding your specific point: The rename is really important IMO, because the DATETIME_DTYPES (as-is) have nothing to do with datetime.dateime or other such things.

However, when reading a test that is being fed by a datetime_dtype fixture, one could easily think that this tests datetime.datetime.

The current state in conftest specifically contains only DATETIME64_DTYPES, and I believe this distinction is used in many parts of the code base as well.

Got it. I'm thinking that we should separate this out. The more we can condense this PR, the better.

But hold off on making that PR for the moment (just one more question re: some of other things you've changed for conftest.py)

I can change the DATETIME_DTYPES -> DATETIME64_DTYPES in a precursor of course, but the actual change that precipitates the necessity is the introduction of datetime64_dtype, which wouldn't be used before this PR. Would you want me to introduce the fixtures with the conftest-precursor, or in this PR?

…_precursor

h-vetinari · 2019-06-01T09:55:56Z

@gfyoung
This is now slightly reduced in size after #26596. ~~Also, could you please restart the azure CI?~~

Edit: Nevermind, I had some linting to commit anyway.

gfyoung · 2019-06-01T17:35:21Z

@jreback : Could you have another look at this

jreback · 2019-06-01T17:37:52Z

at some point

…_precursor

jreback · 2019-06-06T00:39:22Z

so why are you haveing every int type against a timedelta64 for example; this is just way overkill. These should all react the same, but having this expanse of tests is just too confusing. Please limit this.

Obviously the same holds true for example datetime.

XFAIL pandas/tests/dtypes/cast/test_promote.py::test_maybe_promote_any_with_timedelta64[uint8-timedelta64[ns]-box0-np.timedelta64]
  reason: does not upcast correctly
XFAIL pandas/tests/dtypes/cast/test_promote.py::test_maybe_promote_any_with_timedelta64[uint8-timedelta64[ns]-box2-np.timedelta64]
  reason: does not upcast correctly
XFAIL pandas/tests/dtypes/cast/test_promote.py::test_maybe_promote_any_with_timedelta64[uint8-m8[ns]-box0-np.timedelta64]
  reason: does not upcast correctly
XFAIL pandas/tests/dtypes/cast/test_promote.py::test_maybe_promote_any_with_timedelta64[uint8-m8[ns]-box2-np.timedelta64]
  reason: does not upcast correctly
XFAIL pandas/tests/dtypes/cast/test_promote.py::test_maybe_promote_any_with_timedelta64[uint16-timedelta64[ns]-box0-np.timedelta64]
  reason: does not upcast correctly
XFAIL pandas/tests/dtypes/cast/test_promote.py::test_maybe_promote_any_with_timedelta64[uint16-timedelta64[ns]-box2-np.timedelta64]
  reason: does not upcast correctly
XFAIL pandas/tests/dtypes/cast/test_promote.py::test_maybe_promote_any_with_timedelta64[uint16-m8[ns]-box0-np.timedelta64]
  reason: does not upcast correctly
XFAIL pandas/tests/dtypes/cast/test_promote.py::test_maybe_promote_any_with_timedelta64[uint16-m8[ns]-box2-np.timedelta64]
  reason: does not upcast correctly
XFAIL pandas/tests/dtypes/cast/test_promote.py::test_maybe_promote_any_with_timedelta64[uint32-timedelta64[ns]-box0-np.timedelta64]
  reason: does not upcast correctly
XFAIL pandas/tests/dtypes/cast/test_promote.py::test_maybe_promote_any_with_timedelta64[uint32-timedelta64[ns]-box2-np.timedelta64]
  reason: does not upcast correctly
XFAIL pandas/tests/dtypes/cast/test_promote.py::test_maybe_promote_any_with_timedelta64[uint32-m8[ns]-box0-np.timedelta64]
  reason: does not upcast correctly
XFAIL pandas/tests/dtypes/cast/test_promote.py::test_maybe_promote_any_with_timedelta64[uint32-m8[ns]-box2-np.timedelta64]
  reason: does not upcast correctly
XFAIL pandas/tests/dtypes/cast/test_promote.py::test_maybe_promote_any_with_timedelta64[uint64-timedelta64[ns]-box0-np.timedelta64]
  reason: does not upcast correctly
XFAIL pandas/tests/dtypes/cast/test_promote.py::test_maybe_promote_any_with_timedelta64[uint64-timedelta64[ns]-box2-np.timedelta64]
  reason: does not upcast correctly
XFAIL pandas/tests/dtypes/cast/test_promote.py::test_maybe_promote_any_with_timedelta64[uint64-m8[ns]-box0-np.timedelta64]
  reason: does not upcast correctly
XFAIL pandas/tests/dtypes/cast/test_promote.py::test_maybe_promote_any_with_timedelta64[uint64-m8[ns]-box2-np.timedelta64]
  reason: does not upcast correctly
XFAIL pandas/tests/dtypes/cast/test_promote.py::test_maybe_promote_any_with_timedelta64[int-timedelta64[ns]-box0-np.timedelta64]
  reason: does not upcast correctly
XFAIL pandas/tests/dtypes/cast/test_promote.py::test_maybe_promote_any_with_timedelta64[int-timedelta64[ns]-box2-np.timedelta64]
  reason: does not upcast correctly
XFAIL pandas/tests/dtypes/cast/test_promote.py::test_maybe_promote_any_with_timedelta64[int-m8[ns]-box0-np.timedelta64]
  reason: does not upcast correctly
XFAIL pandas/tests/dtypes/cast/test_promote.py::test_maybe_promote_any_with_timedelta64[int-m8[ns]-box2-np.timedelta64]
  reason: does not upcast correctly
XFAIL pandas/tests/dtypes/cast/test_promote.py::test_maybe_promote_any_with_timedelta64[int8-timedelta64[ns]-box0-np.timedelta64]
  reason: does not upcast correctly
XFAIL pandas/tests/dtypes/cast/test_promote.py::test_maybe_promote_any_with_timedelta64[int8-timedelta64[ns]-box2-np.timedelta64]
  reason: does not upcast correctly
XFAIL pandas/tests/dtypes/cast/test_promote.py::test_maybe_promote_any_with_timedelta64[int8-m8[ns]-box0-np.timedelta64]
  reason: does not upcast correctly
XFAIL pandas/tests/dtypes/cast/test_promote.py::test_maybe_promote_any_with_timedelta64[int8-m8[ns]-box2-np.timedelta64]
  reason: does not upcast correctly
XFAIL pandas/tests/dtypes/cast/test_promote.py::test_maybe_promote_any_with_timedelta64[int16-timedelta64[ns]-box0-np.timedelta64]
  reason: does not upcast correctly

h-vetinari · 2019-06-06T02:11:31Z

@jreback: so why are you haveing every int type against a timedelta64 for example; this is just way overkill. These should all react the same, but having this expanse of tests is just too confusing. Please limit this.

The simple answer is: fixtures - they do exactly what I wanted here: exhaustively (as possible) test every type against (inputs of) every other type. I get the argument that some tests can be overkill, but I can't see why it would be confusing...?

Are you opposed specifically to the number of xfails, I'm guessing? Otherwise, once one starts introducing special cases in this module it becomes less understandable, more fragile, and more complex.

jreback · 2019-06-08T23:35:32Z

The simple answer is: fixtures - they do exactly what I wanted here: exhaustively (as possible) test every type against (inputs of) every other type. I get the argument that some tests can be overkill, but I can't see why it would be confusing...?

@h-vetinari this add quite a bit of duplicative testing for many types. Please use more selective fixtures. If its obvious that something is doing the same exact cast vs int, int8, int16.....then a single int is enough for a fixture with a nice comment.

…_precursor

h-vetinari · 2019-06-14T18:17:02Z

@jreback
What do you think of the reduced fixture I introduced following your review?

jreback · 2019-06-21T02:31:47Z

ok @h-vetinari whats 3000+ new tests, lol.

merge master once again and ping on green.

…_precursor

h-vetinari · 2019-06-21T15:45:01Z

merge master once again and ping on green.

@jreback, this is green. :)

jreback · 2019-06-21T15:51:20Z

k thanks @h-vetinari

h-vetinari · 2019-06-21T16:30:59Z

@jreback @gfyoung
Thanks for reviewing this big hunk. :)

We can re-open other ones as the work progresses.

Can you please reopen #25425?

I'd also suggest that we reopen #23982 anyway - I don't think it has a chance of getting merged (because it basically just adds a bunch of failing tests), but once rebased it would provide a good insight for the value of the refactor #25425 - as in: all the things that are currently broken and would be fixed by that PR.

h-vetinari · 2019-06-23T17:30:49Z

@jreback @gfyoung

@h-vetinari: Can you please reopen #25425?

TST: tests for maybe_promote (precursor to pandas-dev#23982)

ca8f96b

h-vetinari force-pushed the tst_maybe_promote_precursor branch from 7801d0f to ca8f96b Compare March 10, 2019 17:29

jreback requested changes Mar 10, 2019

View reviewed changes

h-vetinari commented Mar 11, 2019

View reviewed changes

gfyoung added Testing pandas testing functions or related to the test suite Algos Non-arithmetic algos: value_counts, factorize, sorting, isin, clip, shift, diff labels Mar 16, 2019

h-vetinari mentioned this pull request Mar 17, 2019

REF: Fix maybe_promote #25425

Closed

3 tasks

jreback closed this May 12, 2019

h-vetinari mentioned this pull request May 13, 2019

TST: add test coverage for maybe_promote #23982

Merged

4 tasks

gfyoung reopened this May 13, 2019

h-vetinari added 2 commits May 13, 2019 19:10

Merge remote-tracking branch 'upstream/master' into tst_maybe_promote…

a881929

…_precursor

Fix tabs vs. spaces

e14d8a2

h-vetinari added 7 commits May 14, 2019 07:54

Merge remote-tracking branch 'upstream/master' into tst_maybe_promote…

bc10a87

…_precursor

fix conftest

ab328c0

more conftest merge conflict artefact fixes

80d4081

lint

4a2327c

Merge remote-tracking branch 'upstream/master' into tst_maybe_promote…

a205ea9

…_precursor

Review (jreback)

79299af

Add docstring to _check_promote

1b18fde

gfyoung reviewed May 30, 2019

View reviewed changes

h-vetinari added a commit to h-vetinari/pandas that referenced this pull request May 31, 2019

prepare conftest for pandas-dev#25637

f3f8d4f

h-vetinari mentioned this pull request May 31, 2019

TST: prepare conftest for #25637 #26596

Merged

h-vetinari added 2 commits May 31, 2019 18:18

fix fixture docstrings

c0b4a3f

docstring updates

d6001a7

h-vetinari mentioned this pull request Jun 1, 2019

MAINT: Condense TIMEZONE_IDS construction #26600

Merged

h-vetinari added 2 commits June 1, 2019 11:14

add ids for box-fixture

f73740b

Merge remote-tracking branch 'upstream/master' into tst_maybe_promote…

895fcb5

…_precursor

lint

200365e

h-vetinari added 2 commits June 6, 2019 00:15

Merge remote-tracking branch 'upstream/master' into tst_maybe_promote…

bb36925

…_precursor

break out function to make _check_promote even more branchless

a97d29b

h-vetinari added 2 commits June 10, 2019 14:36

Merge remote-tracking branch 'upstream/master' into tst_maybe_promote…

8c982f2

…_precursor

Refactor with reduced any_numpy_dtype fixture (review jreback)

a1add65

jreback approved these changes Jun 21, 2019

View reviewed changes

jreback added this to the 0.25.0 milestone Jun 21, 2019

Merge remote-tracking branch 'upstream/master' into tst_maybe_promote…

636b1f1

…_precursor

jreback merged commit f2aea09 into pandas-dev:master Jun 21, 2019

h-vetinari deleted the tst_maybe_promote_precursor branch June 21, 2019 16:34

h-vetinari mentioned this pull request Sep 21, 2019

TST: restore type checks to maybe_promote tests #28561

Closed

TST: tests for maybe_promote (precursor to #23982) #25637

TST: tests for maybe_promote (precursor to #23982) #25637

Conversation

h-vetinari commented Mar 10, 2019

pep8speaks commented Mar 10, 2019 • edited Loading

Comment last updated at 2019-06-21 06:15:45 UTC

codecov bot commented Mar 10, 2019

Codecov Report

codecov bot commented Mar 10, 2019 • edited Loading

Codecov Report

jreback left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

gfyoung May 13, 2019 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

h-vetinari left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jreback commented May 12, 2019

h-vetinari commented May 13, 2019

gfyoung commented May 13, 2019

gfyoung commented May 13, 2019 • edited Loading

h-vetinari commented May 13, 2019

gfyoung commented May 13, 2019

h-vetinari commented May 30, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment

gfyoung May 31, 2019 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

h-vetinari commented Jun 1, 2019 • edited Loading

gfyoung commented Jun 1, 2019

jreback commented Jun 1, 2019

jreback commented Jun 6, 2019 • edited Loading

h-vetinari commented Jun 6, 2019

jreback commented Jun 8, 2019

h-vetinari commented Jun 14, 2019

jreback commented Jun 21, 2019

h-vetinari commented Jun 21, 2019

jreback commented Jun 21, 2019

h-vetinari commented Jun 21, 2019

h-vetinari commented Jun 23, 2019

pep8speaks commented Mar 10, 2019 •

edited

Loading

codecov bot commented Mar 10, 2019 •

edited

Loading

gfyoung May 13, 2019 •

edited

Loading

gfyoung commented May 13, 2019 •

edited

Loading

gfyoung May 31, 2019 •

edited

Loading

h-vetinari commented Jun 1, 2019 •

edited

Loading

jreback commented Jun 6, 2019 •

edited

Loading