BUG: Casting tz-aware DatetimeIndex to object-dtype ndarray/Index #23524

jbrockmendel · 2018-11-06T01:55:16Z

Also fixes bug with DateOffset == "infer" incorrectly raising instead of returning False.

Also fixes bug(?) with pd.Index(dtindex, dtype=object) returning an index of datetimes instead of Timestamps, potentially losing nanoseconds.

closes BUG: DatetimeIndex cast to object dtype raises/wrong for tzaware #23491
tests added / passed
passes git diff upstream/master -u -- "*.py" | flake8 --diff
whatsnew entry

…r Index

pep8speaks · 2018-11-06T01:55:28Z

Hello @jbrockmendel! Thanks for submitting the PR.

There are no PEP8 issues in the file pandas/core/arrays/datetimes.py !
There are no PEP8 issues in the file pandas/core/indexes/base.py !
There are no PEP8 issues in the file pandas/tests/arrays/test_datetimelike.py !
There are no PEP8 issues in the file pandas/tests/indexes/test_base.py !
There are no PEP8 issues in the file pandas/tests/tseries/offsets/test_offsets.py !
There are no PEP8 issues in the file pandas/tests/tseries/offsets/test_ticks.py !
There are no PEP8 issues in the file pandas/tseries/offsets.py !

pandas/core/arrays/datetimes.py

pandas/core/indexes/base.py

codecov · 2018-11-06T02:56:05Z

Codecov Report

Merging #23524 into master will decrease coverage by <.01%.
The diff coverage is 91.3%.

@@            Coverage Diff             @@
##           master   #23524      +/-   ##
==========================================
- Coverage   92.25%   92.25%   -0.01%     
==========================================
  Files         161      161              
  Lines       51262    51278      +16     
==========================================
+ Hits        47292    47304      +12     
- Misses       3970     3974       +4

Flag	Coverage Δ
#multiple	`90.63% <91.3%> (-0.01%)`	⬇️
#single	`42.32% <43.47%> (-0.02%)`	⬇️

Impacted Files	Coverage Δ
pandas/core/arrays/datetimes.py	`98.44% <100%> (+0.02%)`	⬆️
pandas/core/indexes/base.py	`96.45% <100%> (ø)`	⬆️
pandas/tseries/offsets.py	`97.07% <85.71%> (-0.15%)`	⬇️
pandas/io/pytables.py	`92.34% <0%> (-0.09%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update efd1844...cc7f5cd. Read the comment docs.

pandas/core/arrays/datetimes.py

…dex_bugs

TomAugspurger · 2018-11-06T16:29:31Z

Also fixes bug(?) with pd.Index(dtindex, dtype=object) returning an index of datetimes instead of Timestamps, potentially losing nanoseconds.

Seems like a fine change, but needs a release note.

jbrockmendel · 2018-11-06T17:52:19Z

Seems like a fine change, but needs a release note.

Good point; done.

pandas/tests/arrays/test_datetimelike.py

…dex_bugs

doc/source/whatsnew/v0.24.0.txt

pandas/_libs/tslibs/offsets.pyx

pandas/core/arrays/datetimes.py

pandas/core/indexes/base.py

pandas/tseries/offsets.py

…dex_bugs

doc/source/whatsnew/v0.24.0.txt

pandas/core/arrays/datetimes.py

pandas/core/indexes/base.py

…dex_bugs

jorisvandenbossche

Small comment

jorisvandenbossche · 2018-11-08T15:01:39Z

pandas/core/arrays/datetimes.py

+
+        # TODO: warn that dtype is not used?
+        #  warn that conversion may be lossy?
+        return self._data.view(np.ndarray)  # follow Index.__array__


Other question: what is the .view(np.ndarray) part doing if it is already an array? Can we remove it?

It can probably be removed; this is taken directly from the Index.__array__ implementation, so I think the maybe-removing this should be done at the same time those methods are overhauled (ill be opening an Issue shortly)

jorisvandenbossche · 2018-11-08T15:04:01Z

pandas/tests/arrays/test_datetimelike.py

@@ -57,6 +57,54 @@ def timedelta_index(request):

 class TestDatetimeArray(object):

+    def test_array_object_dtype(self, tz_naive_fixture):


It think it would be good to add such a test to the base extension tests as well?

I don't know those tests well enough to have an informed opinion. AFAIK ExtensionArray doesn't implement __array__, so it isn't clear that this is supported in the general case.

EA implements __iter__, which should be sufficient.

This test would be slightly opinionated for a base test, in case an EA wants to be converted to a specific NumPy type, but I think it's OK.

I guess we have base.interface.BaseInterfaceTests.test_array_interface which checks

def test_array_interface(self, data): result = np.array(data) assert result[0] == data[0]

Ah, yes, that's already a generic test. OK, since that does not actually test the return dtype, it's good to have more explicit tests here.

Should we expect from EA that np.array(EA, dtype=object) always works (returns an object array of scalars)?
That seems like an OK assumption to me, since this already happens if you don't implement __array__, so we can expect this as well if the EA author implements a custom __array__ I think.

pandas/core/arrays/datetimes.py

jorisvandenbossche · 2018-11-09T09:57:44Z

pandas/core/arrays/datetimes.py

+        if is_object_dtype(dtype):
+            return np.array(list(self), dtype=object)
+        elif is_int64_dtype(dtype):
+            return self.asi8


I think we can remove this elif branch. Numpy will afterwards convert the M8[ns] data to int, and in that way ensure the semantics of np.asarray regarding copy/no copy is followed.

@jbrockmendel I opened #23593, a PR doing the __array__ for all datetimelike EAs, not only DatetimeArray (but so there is a bit of overlap with this PR)

Great, I'll take a look at 23593.

…dex_bugs

jreback · 2018-11-11T16:20:28Z

thanks @jbrockmendel

#23593 is the followup to address __array__.

* upstream/master: BUG: Casting tz-aware DatetimeIndex to object-dtype ndarray/Index (pandas-dev#23524) BUG: Delegate more of Excel parsing to CSV (pandas-dev#23544) API: DataFrame.__getitem__ returns Series for sparse column (pandas-dev#23561) CLN: use float64_t consistently instead of double, double_t (pandas-dev#23583) DOC: Fix Order of parameters in docstrings (pandas-dev#23611) TST: Unskip some Categorical Tests (pandas-dev#23613) TST: Fix integer ops comparison test (pandas-dev#23619) DOC: Fixes to docstring to add validation to CI (pandas-dev#23560) DOC: Remove incorrect periods at the end of parameter types (pandas-dev#23600) MAINT: tm.assert_raises_regex --> pytest.raises (pandas-dev#23592) DOC: Updating Series.resample and DataFrame.resample docstrings (pandas-dev#23197)

…fixed * upstream/master: DOC: Enhancing pivot / reshape docs (pandas-dev#21038) TST: Fix xfailing DataFrame arithmetic tests by transposing (pandas-dev#23620) BUILD: Simplifying contributor dependencies (pandas-dev#23522) BUG/REF: TimedeltaIndex.__new__ (pandas-dev#23539) BUG: Casting tz-aware DatetimeIndex to object-dtype ndarray/Index (pandas-dev#23524) BUG: Delegate more of Excel parsing to CSV (pandas-dev#23544) API: DataFrame.__getitem__ returns Series for sparse column (pandas-dev#23561) CLN: use float64_t consistently instead of double, double_t (pandas-dev#23583) DOC: Fix Order of parameters in docstrings (pandas-dev#23611) TST: Unskip some Categorical Tests (pandas-dev#23613) TST: Fix integer ops comparison test (pandas-dev#23619)

…ndas-dev#23524)

jbrockmendel added 3 commits November 5, 2018 17:07

BUG: fix and test offset comparison with non-offsets

61ad510

whatsnew note

937541e

Fix and test casting tz-aware datetimeindex to object-dtype ndarray o…

ca9e8af

…r Index

add GH references

9d0fbd7

jreback reviewed Nov 6, 2018

View reviewed changes

pandas/core/arrays/datetimes.py Show resolved Hide resolved

jreback requested changes Nov 6, 2018

View reviewed changes

pandas/core/indexes/base.py Outdated Show resolved Hide resolved

jreback added Datetime Datetime data dtype Timezones Timezone data dtype labels Nov 6, 2018

Clarify comment

dbf145f

jreback requested changes Nov 6, 2018

View reviewed changes

pandas/core/arrays/datetimes.py Show resolved Hide resolved

jbrockmendel added 2 commits November 6, 2018 08:11

Merge branch 'master' of https://github.com/pandas-dev/pandas into in…

ec696e5

…dex_bugs

test for np.array(arr, dtype=np.int64)

8853b6a

release note for fixing dropping of nanoseconds

9add480

TomAugspurger reviewed Nov 6, 2018

View reviewed changes

pandas/tests/arrays/test_datetimelike.py Show resolved Hide resolved

jbrockmendel added 2 commits November 6, 2018 17:01

test for copy=False being respected

cfd8e71

Merge branch 'master' of https://github.com/pandas-dev/pandas into in…

bc06a61

…dex_bugs

jreback requested changes Nov 7, 2018

View reviewed changes

jbrockmendel added 3 commits November 7, 2018 08:00

Merge branch 'master' of https://github.com/pandas-dev/pandas into in…

c9a2c22

…dex_bugs

fix missing backticks

b4ecfbb

comments

756574f

jreback requested changes Nov 8, 2018

View reviewed changes

doc/source/whatsnew/v0.24.0.txt Outdated Show resolved Hide resolved

pandas/core/arrays/datetimes.py Show resolved Hide resolved

pandas/core/indexes/base.py Outdated Show resolved Hide resolved

jreback added this to the 0.24.0 milestone Nov 8, 2018

jbrockmendel added 2 commits November 8, 2018 06:49

Merge branch 'master' of https://github.com/pandas-dev/pandas into in…

ebada17

…dex_bugs

move whatsnew note, do astype differently

bf1677a

jorisvandenbossche reviewed Nov 8, 2018

View reviewed changes

jbrockmendel mentioned this pull request Nov 8, 2018

Review/Overhaul __array__ and view methods #23569

Closed

jorisvandenbossche reviewed Nov 9, 2018

View reviewed changes

jbrockmendel added 2 commits November 9, 2018 06:01

remove comment

0a5dbed

Merge branch 'master' of https://github.com/pandas-dev/pandas into in…

cc7f5cd

…dex_bugs

jreback approved these changes Nov 11, 2018

View reviewed changes

jreback merged commit 58a59bd into pandas-dev:master Nov 11, 2018

jbrockmendel deleted the index_bugs branch November 11, 2018 17:08

JustinZhengBC pushed a commit to JustinZhengBC/pandas that referenced this pull request Nov 14, 2018

BUG: Casting tz-aware DatetimeIndex to object-dtype ndarray/Index (pa…

c355f26

…ndas-dev#23524)

tm9k1 pushed a commit to tm9k1/pandas that referenced this pull request Nov 19, 2018

BUG: Casting tz-aware DatetimeIndex to object-dtype ndarray/Index (pa…

2e1e644

…ndas-dev#23524)

jorisvandenbossche mentioned this pull request Jan 3, 2019

API: consistent __array__ for datetime-like ExtensionArrays #23593

Merged

Pingviinituutti pushed a commit to Pingviinituutti/pandas that referenced this pull request Feb 28, 2019

BUG: Casting tz-aware DatetimeIndex to object-dtype ndarray/Index (pa…

4821e80

…ndas-dev#23524)

Pingviinituutti pushed a commit to Pingviinituutti/pandas that referenced this pull request Feb 28, 2019

BUG: Casting tz-aware DatetimeIndex to object-dtype ndarray/Index (pa…

3d6ab98

…ndas-dev#23524)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG: Casting tz-aware DatetimeIndex to object-dtype ndarray/Index #23524

BUG: Casting tz-aware DatetimeIndex to object-dtype ndarray/Index #23524

jbrockmendel commented Nov 6, 2018

pep8speaks commented Nov 6, 2018

codecov bot commented Nov 6, 2018 •

edited

Loading

TomAugspurger commented Nov 6, 2018

jbrockmendel commented Nov 6, 2018

jorisvandenbossche left a comment

jorisvandenbossche Nov 8, 2018

jbrockmendel Nov 8, 2018

jorisvandenbossche Nov 8, 2018

jbrockmendel Nov 8, 2018

TomAugspurger Nov 8, 2018

TomAugspurger Nov 8, 2018

jorisvandenbossche Nov 9, 2018

jorisvandenbossche Nov 9, 2018

jorisvandenbossche Nov 9, 2018

jbrockmendel Nov 9, 2018

jreback commented Nov 11, 2018

		@@ -57,6 +57,54 @@ def timedelta_index(request):

		class TestDatetimeArray(object):

		def test_array_object_dtype(self, tz_naive_fixture):

BUG: Casting tz-aware DatetimeIndex to object-dtype ndarray/Index #23524

BUG: Casting tz-aware DatetimeIndex to object-dtype ndarray/Index #23524

Conversation

jbrockmendel commented Nov 6, 2018

pep8speaks commented Nov 6, 2018

codecov bot commented Nov 6, 2018 • edited Loading

Codecov Report

TomAugspurger commented Nov 6, 2018

jbrockmendel commented Nov 6, 2018

jorisvandenbossche left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jreback commented Nov 11, 2018

codecov bot commented Nov 6, 2018 •

edited

Loading