CLN: Prune unnecessary indexing code #27576

jbrockmendel · 2019-07-24T21:52:24Z

No description provided.

jbrockmendel · 2019-07-25T00:02:38Z

pandas/core/indexing.py

@@ -2077,6 +2072,8 @@ def _getitem_tuple(self, tup: Tuple):

            # if the dim was reduced, then pass a lower-dim the next time
            if retval.ndim < self.ndim:
+                # TODO: this is never reached in tests; can we confirm that
+                #  it is impossible?


@toobaz any thoughts on showing this is unreachable? If we can confirm this is the case, then this chunk of the code can be shared with the other _getitem_tuple method above

(pinging you since you wrote the wiki page on simplifying this code)

looks like this also wasnt hit in 0.24.2

And for context, example case which triggers on that commit but not on master

Code

import numpy as np from pandas import * mi_int = DataFrame( np.random.randn(3, 3), columns=[[2, 2, 4], [6, 8, 10]], index=[[4, 4, 8], [8, 10, 12]], ) # this triggers the code path when testing against 9f0dc3befb rs = mi_int.iloc[:, 2]

I don't think that path is ever reached. It should be reached by something like df.loc[1, [0, 1]], which however gets captured in the call above to self._getitem_lowerdim(tup) (which doesn't make sense, because it is named "lowerdim", but then it resorts to _is_nested_tuple_indexer). So to sum up: I don't know exactly what the aim of that axis -= 1 is, I don't think it is ever reached, but I think it should be reached in principle.

(but no major objections to removing for now)

@pilkibun thanks for tracking that down

@toobaz how about we fix getitem_lowerdim to make sense before removing this check. simplifying this code is going to take a few passes

@pilkibun there is another un-hit block on lines 440-450. any idea what that was intended to catch?

this was related to Panel indexing, can probably go.

@toobaz how about we fix getitem_lowerdim to make sense before removing this check. simplifying this code is going to take a few passes

No objection

jbrockmendel · 2019-07-25T00:05:30Z

pandas/core/indexing.py

@@ -2152,7 +2149,7 @@ def _convert_to_indexer(
            )


-class _ScalarAccessIndexer(_NDFrameIndexer):
+class _ScalarAccessIndexer(_NDFrameIndexerBase):


The only _NDFrameIndexer method these use is _tuplify, which is why this PR takes that out and makes it a function instead of a method. Much easier to reason about these classes now

jbrockmendel · 2019-07-25T00:08:36Z

pandas/core/series.py

-                    pass
-                elif is_timedelta64_dtype(self.dtype):
-                    # reassign a null value to iNaT
-                    if is_valid_nat_for_dtype(value, self.dtype):


this is no longer reachable since we fixed a bug in the _set_with_engine call above in #27536.

pandas/core/frame.py

jreback · 2019-07-25T17:03:13Z

pandas/core/indexes/multi.py

@@ -3464,3 +3462,22 @@ def _sparsify(label_list, start=0, sentinel=""):

 def _get_na_rep(dtype):
    return {np.datetime64: "NaT", np.timedelta64: "NaT"}.get(dtype, "NaN")
+
+
+def maybe_droplevels(index, key):


can you add a doc-string

jreback · 2019-07-25T17:03:38Z

pandas/core/indexing.py

@@ -2077,6 +2072,8 @@ def _getitem_tuple(self, tup: Tuple):

            # if the dim was reduced, then pass a lower-dim the next time
            if retval.ndim < self.ndim:
+                # TODO: this is never reached in tests; can we confirm that
+                #  it is impossible?


this was related to Panel indexing, can probably go.

jreback · 2019-07-25T17:04:01Z

pandas/core/indexing.py

@@ -2320,6 +2314,12 @@ def _convert_key(self, key, is_setter: bool = False):
        return key


+def _tuplify(ndim: int, loc) -> tuple:
+    tup = [slice(None, None) for _ in range(ndim)]


can you add a doc-string

docstrings added. will take another look at removing the discussed lines in the next pass

…ndex

jreback · 2019-07-25T22:11:47Z

pandas/core/indexes/multi.py

@@ -2750,7 +2747,8 @@ def get_loc_level(self, key, level=0, drop_level=True):
        (1, None)
        """

-        def maybe_droplevels(indexer, levels, drop_level):
+        # different name to distinguish from maybe_droplevels
+        def maybe_mi_droplevels(indexer, levels, drop_level: bool):


in future maybe pull this out to a module level function

jreback · 2019-07-25T22:12:51Z

thanks

jbrockmendel added 9 commits July 24, 2019 10:25

disable unused getitem

ebd9d03

remove unhit branch

e0e039f

move maybe_droplevels

330f8af

make tuplify non-method

ca16db2

remove _has_valid_setitem_indexer

9dafe60

self.obj.ndim->self.ndim

f21e7d5

Comment

dd5fc2d

remove case handled by set_engine above

59e3935

remove unused import

e165eca

jbrockmendel commented Jul 25, 2019

View reviewed changes

jreback reviewed Jul 25, 2019

View reviewed changes

pandas/core/frame.py Show resolved Hide resolved

jreback requested changes Jul 25, 2019

View reviewed changes

jreback added Clean Indexing Related to indexing on series/frames, not to indexes themselves labels Jul 25, 2019

jbrockmendel added 3 commits July 25, 2019 10:49

Merge branch 'master' of https://github.com/pandas-dev/pandas into mi…

743e7c0

…ndex

docstrings

5c4927e

dummy commit to force ci

f0559a1

jreback added this to the 1.0 milestone Jul 25, 2019

jreback reviewed Jul 25, 2019

View reviewed changes

jreback approved these changes Jul 25, 2019

View reviewed changes

jreback merged commit ebcfee4 into pandas-dev:master Jul 25, 2019

jbrockmendel deleted the mindex branch July 25, 2019 22:36

quintusdias pushed a commit to quintusdias/pandas_dev that referenced this pull request Aug 16, 2019

CLN: Prune unnecessary indexing code (pandas-dev#27576)

a2f670c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CLN: Prune unnecessary indexing code #27576

CLN: Prune unnecessary indexing code #27576

jbrockmendel commented Jul 24, 2019

jbrockmendel Jul 25, 2019

jbrockmendel Jul 25, 2019

ghost Jul 25, 2019 •

edited by ghost

Loading

toobaz Jul 25, 2019

toobaz Jul 25, 2019

jbrockmendel Jul 25, 2019

jbrockmendel Jul 25, 2019

jreback Jul 25, 2019

toobaz Jul 25, 2019

jbrockmendel Jul 25, 2019

jbrockmendel Jul 25, 2019

jreback Jul 25, 2019

jbrockmendel Jul 25, 2019

jreback Jul 25, 2019

jreback Jul 25, 2019

jbrockmendel Jul 25, 2019

jbrockmendel Jul 25, 2019

jreback Jul 25, 2019

jreback commented Jul 25, 2019

CLN: Prune unnecessary indexing code #27576

CLN: Prune unnecessary indexing code #27576

Conversation

jbrockmendel commented Jul 24, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ghost Jul 25, 2019 • edited by ghost Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jreback commented Jul 25, 2019

ghost Jul 25, 2019 •

edited by ghost

Loading