sel with categorical index #3670

fujiisoup · 2020-01-08T10:51:06Z

Closes Fail to sel() when index comes from categorical pandas Series #3669, Multi-index with categorical values #3674
Tests added
Passes black . && mypy . && flake8
Fully documented, including whats-new.rst for all changes and api.rst for new API

It is a bit surprising that no members have used xarray with CategoricalIndex...
If there is anything missing additionally, please feel free to point it out.

fujiisoup · 2020-01-08T20:20:50Z

I don't think the check failure is related.

xarray/tests/test_dataset.py

pep8speaks · 2020-01-11T02:22:00Z

Hello @fujiisoup! Thanks for updating this PR. We checked the lines you've touched for PEP 8 issues, and found:

In the file xarray/core/indexes.py:

Line 13:4: W291 trailing whitespace
Line 14:85: W291 trailing whitespace

Comment last updated at 2020-01-25 00:59:20 UTC

fujiisoup · 2020-01-11T03:20:27Z

I think this PR is ready for review.

fujiisoup · 2020-01-14T23:40:08Z

I'll merge this tomorrow if no more commens.

keewis

I tried to look over the code but didn't find anything except some stylistic issues which I don't feel strongly about. Other than that this looks good to me (I don't know much about this part of xarray, though).

keewis · 2020-01-14T23:43:56Z

xarray/core/indexes.py

+            levels = []
+            for i, level in enumerate(index.levels):
+                if isinstance(level, pd.CategoricalIndex):
+                    level = level[index.codes[i]].remove_unused_categories()
+                levels.append(level)


I feel like this could be cleaner by using a list comprehension:

levels = [ level[index.codes[i]].remove_unused_categories() if isinstance(level, pd.CategoricalIndex) else level for i, level in enumerate(index.levels) ]

though that might be just me

xarray/core/indexes.py

dcherian

Some minor comments. Thanks @fujiisoup

xarray/core/dataset.py

xarray/core/indexes.py

xarray/core/indexing.py

doc/whats-new.rst

keewis · 2020-01-25T00:07:59Z

unless I'm missing something this should be really close (run black once and fix the merge conflict in whats-new.rst)

* 'master' of github.com:pydata/xarray: (27 commits) bump min deps for 0.15 (pydata#3713) setuptools-scm and isort tweaks (pydata#3720) Allow binned coordinates on 1D plots y-axis. (pydata#3685) apply_ufunc: Add meta kwarg + bump dask to 2.2 (pydata#3660) setuptools-scm and one-liner setup.py (pydata#3714) Feature/align in dot (pydata#3699) ENH: enable `H5NetCDFStore` to work with already open h5netcdf.File a… (pydata#3618) One-off isort run (pydata#3705) hardcoded xarray.__all__ (pydata#3703) Bump mypy to v0.761 (pydata#3704) remove DataArray and Dataset constructor deprecations for 0.15 (pydata#3560) Tests for variables with units (pydata#3654) Add an example notebook using apply_ufunc to vectorize 1D functions (pydata#3629) Use encoding['dtype'] over data.dtype when possible within CFMaskCoder.encode (pydata#3652) allow passing any iterable to drop when dropping variables (pydata#3693) Typo on DataSet/DataArray.to_dict documentation (pydata#3692) Fix mypy type checking tests failure in ds.merge (pydata#3690) Explicitly convert result of pd.to_datetime to a timezone-naive type (pydata#3688) ds.merge(da) bugfix (pydata#3677) fix docstring for combine_first: returns a Dataset (pydata#3683) ...

keewis · 2020-01-25T12:25:13Z

should be ready for merging now

fujiisoup · 2020-01-25T22:38:10Z

Thanks, @dcherian and @keewis , for keeping this updated.
Merging.

* master: Add support for CFTimeIndex in get_clean_interp_index (pydata#3631) sel with categorical index (pydata#3670) bump min deps for 0.15 (pydata#3713) setuptools-scm and isort tweaks (pydata#3720) Allow binned coordinates on 1D plots y-axis. (pydata#3685) apply_ufunc: Add meta kwarg + bump dask to 2.2 (pydata#3660) setuptools-scm and one-liner setup.py (pydata#3714) Feature/align in dot (pydata#3699) ENH: enable `H5NetCDFStore` to work with already open h5netcdf.File a… (pydata#3618) One-off isort run (pydata#3705) hardcoded xarray.__all__ (pydata#3703) Bump mypy to v0.761 (pydata#3704) remove DataArray and Dataset constructor deprecations for 0.15 (pydata#3560) Tests for variables with units (pydata#3654) Add an example notebook using apply_ufunc to vectorize 1D functions (pydata#3629) Use encoding['dtype'] over data.dtype when possible within CFMaskCoder.encode (pydata#3652)

Added a support with categorical index

0f74942

fujiisoup mentioned this pull request Jan 8, 2020

Fail to sel() when index comes from categorical pandas Series #3669

Closed

dcherian reviewed Jan 8, 2020

View reviewed changes

xarray/tests/test_dataset.py Show resolved Hide resolved

This was referenced Jan 9, 2020

Cannot remove_unused_categories for CategoricalIndex in a MultiIndex pandas-dev/pandas#30846

Open

Multi-index with categorical values #3674

Closed

fujiisoup added 3 commits January 9, 2020 21:08

fix from_dataframe

0e0db2f

update a test

cd6f3c8

Added more tests

98d38b0

fujiisoup added 2 commits January 11, 2020 11:51

black

8d9b6ec

Added a test to make sure raising ValueErrors

262b8d5

fujiisoup added 2 commits January 11, 2020 12:22

remove unnecessary print

b8720a4

added a test for reindex

57ca90e

keewis reviewed Jan 15, 2020

View reviewed changes

dcherian approved these changes Jan 16, 2020

View reviewed changes

xarray/core/dataset.py Outdated Show resolved Hide resolved

xarray/core/indexes.py Outdated Show resolved Hide resolved

xarray/core/indexing.py Outdated Show resolved Hide resolved

xarray/core/indexing.py Outdated Show resolved Hide resolved

dcherian reviewed Jan 16, 2020

View reviewed changes

doc/whats-new.rst Outdated Show resolved Hide resolved

dcherian mentioned this pull request Jan 17, 2020

release 0.15.0? #3702

Closed

11 tasks

Fix according to reviews

d338185

dcherian and others added 3 commits January 24, 2020 17:58

blacken

5cdf82e

delete trailing whitespace

27f3505

fujiisoup merged commit cc142f4 into pydata:master Jan 25, 2020

fujiisoup deleted the sel_categorilical branch January 25, 2020 22:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sel with categorical index #3670

sel with categorical index #3670

fujiisoup commented Jan 8, 2020 •

edited

Loading

fujiisoup commented Jan 8, 2020

pep8speaks commented Jan 11, 2020 •

edited

Loading

fujiisoup commented Jan 11, 2020

fujiisoup commented Jan 14, 2020

keewis left a comment

keewis Jan 14, 2020

dcherian left a comment

keewis commented Jan 25, 2020

keewis commented Jan 25, 2020

fujiisoup commented Jan 25, 2020

sel with categorical index #3670

sel with categorical index #3670

Conversation

fujiisoup commented Jan 8, 2020 • edited Loading

fujiisoup commented Jan 8, 2020

pep8speaks commented Jan 11, 2020 • edited Loading

Comment last updated at 2020-01-25 00:59:20 UTC

fujiisoup commented Jan 11, 2020

fujiisoup commented Jan 14, 2020

keewis left a comment

Choose a reason for hiding this comment

keewis Jan 14, 2020

Choose a reason for hiding this comment

dcherian left a comment

Choose a reason for hiding this comment

keewis commented Jan 25, 2020

keewis commented Jan 25, 2020

fujiisoup commented Jan 25, 2020

fujiisoup commented Jan 8, 2020 •

edited

Loading

pep8speaks commented Jan 11, 2020 •

edited

Loading