Merge new changes in pydata master #4

phausamann · 2019-03-07T16:04:28Z

No description provided.

Add missing , and article in error message when attribute values have the wrong type.

This makes string indexing usable in a pandas.MultiIndex.

* DOC: remove example using Dataset.T * Update reshaping.rst * Update reshaping.rst

* Fix multidimensional co-ordinate example. * Use open_dataset().load() in other example

* wip: getting started * preliminary support for zarr consolidated metadata * update zarr dev repo * add consolidate to close * doc updates * skip tests based on zarr version * fix doc typos * fix PEP8 issues * fix test skipping * fixed integration test * update version check * rename keyword arg * Update whats-new.rst * instructions for consolidating existing stores

This version should be a little less noisy, since instructions only for authors are put in commented out HTML.

* Fix h5netcdf saving scalars with filters or chunks * Revert adding scalar dataset to central test function * Add fix description to what's new.

* Add keep_attrs to binary_op * added test for binary_ops keep_attrs=True * PEP8 issues + blank lines removed * whitespace removed * keep_attrs in DataArray * enhancement * simpler testing

* Support HighLevelGraphs Fixes #4291 * test __dask_layers__ * Skip dependnecies test with old dask * Reenable dask-dev test on Travis-CI

* concatenates along a single dimension * Wrote function to find correct tile_IDs from nested list of datasets * Wrote function to check that combined_tile_ids structure is valid * Added test of 2d-concatenation * Tests now check that dataset ordering is correct * Test concatentation along a new dimension * Started generalising auto_combine to N-D by integrating the N-D concatentation algorithm * All unit tests now passing * Fixed a failing test which I didn't notice because I don't have pseudoNetCDF * Began updating open_mfdataset to handle N-D input * Refactored to remove duplicate logic in open_mfdataset & auto_combine * Implemented Shoyers suggestion in #2553 to rewrite the recursive nested list traverser as an iterator * --amend * Now raises ValueError if input not ordered correctly before concatenation * Added some more prototype tests defining desired behaviour more clearly * Now raises informative errors on invalid forms of input * Refactoring to alos merge along each dimension * Refactored to literally just apply the old auto_combine along each dimension * Added unit tests for open_mfdatset * Removed TODOs * Removed format strings * test_get_new_tile_ids now doesn't assume dicts are ordered * Fixed failing tests on python3.5 caused by accidentally assuming dict was ordered * Test for getting new tile id * Fixed itertoolz import so that it's compatible with older versions * Increased test coverage * Added toolz as an explicit dependency to pass tests on python2.7 * Updated 'what's new' * No longer attempts to shortcut all concatenation at once if concat_dims=None * Rewrote using itertools.groupby instead of toolz.itertoolz.groupby to remove hidden dependency on toolz * Fixed erroneous removal of utils import * Updated docstrings to include an example of multidimensional concatenation * Clarified auto_combine docstring for N-D behaviour * Added unit test for nested list of Datasets with different variables * Minor spelling and pep8 fixes * Reverted API so that N-D generalisation is hidden * Removed infer_order_from_coords argument

* Fix parsing '_Unsigned' attribute Fixes #2583 * Fix encode step too. * Add tests. * Fix whats-new. * Undo unnecessary change * Yay! fix test failure.

* doc fixes Fixes #2610 * minor doc fixes. * Fix examples path for open statements.

There seems to be some sort of dependency issue on Appveyor, but it's not worth tracking down given how we'll be dropping Python 2.7 in the new year anyways.

* .resample now supports loffset. * Update whats-new.rst * Fix for pandas 0.19.2 * doc update. * Review comments.

* CF: also decode time bounds when available * Fix failing test when cftime not present and what's new * Fix windows * Reviews * Reviews 2

…lex} (#2615) * Don't raise a warning for xarray.ufuncs.angle * Update warning message

* Add test to ensure that 0d slices are views * Get 0d slices of ndarrays directly from indexing * Add 0d slice documentation

…hs (#2589) * added some logic to deal with rasterio objects in addition to filepath strings * added no network test, pep8 compliance, whatsnew.rst * removed subclass, added to base RasterioArrayWrapper * upped rasterio test version to > 1 * specified rasterio version should be greater than 1

* Close files when CachingFileManager is garbage collected Fixes GH2560 This frees users from needing to worry about this. * Minor tweak * Test raising an error in __del__ * restore change * Remove the need for a lock in __del__ * Handle locking ourselves with rasterio * Remove race condition with netCDF4 * refactor optional lock * Fix more possible race conditions * Warn if we can't close in FileManager.__del__ * Fix lock acquisition in CachingFileManager.__del__ * Cleaner fall-back for no dask-distributed * Test tweaks * Test for FileManager.__repr__ * Add reference counting to CachingFileManager * remove unused import * Spelling / reorg

* Fix multiindex selection * Support pandas0.19 * a bugfix * Do remove_unused_levels only once in unstack. * import algos * Remove unused import * Adopt local import

* ENH: resample methods with tolerance * ENH: resample methods bfill, pad, nearest accept tolerance keyword * DOC: documentation is updated with examples Fixes: GH2695 * TST: Upsampling with tolerance keyword Include tests for GH2695 * pep8 * Make resample().nearest(tolerance) test meaningful * DOC: Mention units of tolerance

* added integrate. * Docs * Update via comment * Update via comments * integrate can accept multiple dimensions. * using set instead of list * dim -> coord

* deprecate compat & encoding * stacklevel * whatsnew * imports * merge conflicts * remove deprecations * removal date

* Support dropna() for a Series indexed by a CFTimeIndex * Add a what's new entry * Use == instead of is

* add tests for GH#697 - handling of empty pandas objects in constructors * make pep8 happy

@spencerclark

* First implementation of resampling for CFTimeIndex. * First implementation of resampling for CFTimeIndex, cleaned. * First implementation of resampling for CFTimeIndex, cleaned. * First implementation of resampling for CFTimeIndex, cleaned. * First implementation of resampling for CFTimeIndex. * First implementation of resampling for CFTimeIndex, more bugs fixed, cleaned. * First implementation of resampling for CFTimeIndex, test file written. * First implementation of resampling for CFTimeIndex, test file written, cleaned. * First implementation of resampling for CFTimeIndex, test file written, cleaned. * First implementation of resampling for CFTimeIndex, test file written, cleaned. * First implementation of resampling for CFTimeIndex, test file written, cleaned. * Docstrings for resample_cftime.py written. Upsample still not fixed. * Fixed PEP8 and test parametrization. * PEP8 * Test file fixes and other optimizations (2018-12-16 @spencerclark and 2018-12-05 @max-sixty GitHub reviews for resample-v2-clean pull request). Not cleaned. * Test file fixes and other optimizations (2018-12-16 @spencerclark and 2018-12-05 @max-sixty GitHub reviews for resample-v2-clean pull request). Cleaned. * _get_range_edges logic changed to emulate latest version of pandas. * Simplified resampling logic (errors persist). Pre-cleaning. * Simplified resampling logic (error persists). Cleaned. * Simplified resampling logic (error persists). Fixed first_items.dropna() in groupby.py. Pandas cannot drop indices made up of CFTime objects, so integer indices were swapped in for dropping then swapped back out once NAs are dropped. * Simplified resampling logic (error persists). Logic slightly altered after more tests. 5578 out of 5920 tests passed. Pre-cleaning. * Simplified resampling logic (error persists). Logic slightly altered after more tests. 5578 out of 5920 tests passed. Cleaned. * Precise cftime arithmetic. Reduced overall test time. Added test for _get_range_edges. * Added default values for closed and label args of resample function in common.py. Cleaned up print statements. Modified tests that were written under the assumption that CFTimeIndex cannot be resampled so that the tests now pass. * Added back replace['dayofwk'] = -1 to cftime_offsets.py and cftimeindex.py. Removed unused code from resample_cftime.py. Removed tests that raise error when resampling CFTimeIndex. Removed temp files. * Optimizations as per #2593 * Simple test for non-standard calendars added and documentation updated. * Simple test for non-standard calendars added and documentation updated. * Added loffset support to CFTimeIndex resampling. Better adherence to PEP8 and other coding style conventions. * Added loffset support to CFTimeIndex resampling. Better adherence to PEP8 and other coding style conventions. * Support datetime.timedelta objects for loffset. Improved test coverage. * Removed support for Python 2 compatibility. * Updated pandas minversion to 0.24 as 0.24 is officially out. * Removed Python 2 support from test_cftimeindex_resample.py. * Moved full_index and first_items generation logic to a helper function so that the complexity of GroupBy.__init__ is reduced. * In groupby.py, moved s to _get_index_and_items helper function. * Removed redundant code from test_formatting.py due to bad merge. * Removed redundant test and simplify code now that dropna is implemented. * delete unnecessary test * eliminate some repetition

… build (#2736)

Fixes #2050 I'm not quite sure what was going on, but it passes now.

* Refactor (part of) dataset.py to use explicit indexes * Use copy.copy() * Ensure coordinate order is deterministic

* Fix CRS being WKT instead of PROJ.4 See https://github.com/mapbox/rasterio/blob/master/CHANGES.txt#L7 * Fix rasterio usage for older rasterio without to_proj4 Co-Authored-By: djhoese <david.hoese@ssec.wisc.edu> * Fix indentation on rasterio AttributeError check * Add CRS WKT fix to whats-new

* add h5netcdf+dask tests * pep8 * reactivate pynio/rasterio/iris in py36 test builds * revert changes to test_backends.py -- unrelated to this PR

* BUG: Pass kwargs to the FileManager for pynio engine (#2380) * TST: Added test for pynio kwargs passing (#2380) * Fixed formatting (#2380)

enable internal plotting with cftime datetime

Apparently I wasn't paying attention in my last PR :)

* WIP: fix regression about datetime_to_numeric * Workaround for object array * added a whatsnew * rearrange tests * lint * Added Variable._to_numeric * Fix for cftime * Update via comments * lint * Fix via comment * Fix errors * lint

* fix renaming * formatting * added tests * shoyer's solution * what's new

* add h5netcdf+dask tests * pep8 * pass encoding through to _replace_vars_and_dims in ds.chunk() * lint * _kwargs=None in roundtrip methods

* Update computation.py to use Python 3 function signatures This lets us remove lots of ugly explicit calls to ``kwargs.pop()``. * Lint / py35 fixup

* New test for reduce func which takes no axes * Fixed axis logic * Recorded fix in what's new * Added intermediate variable

* Add use_cftime option to open_dataset * Remove f-strings * Fix test-skipping logic and remove 'dummy' from warning * Note that use_cftime is only relevant for standard calendar dates * Move use_cftime option to CFDatetimeCoder constructor

* Quarter offset implemented (base is now latest pydata-master). * Fixed issues raised in review (#2721 (review)) * Updated whats-new.rst with info on quarter offset support. * Updated whats-new.rst with info on quarter offset support. * Update doc/whats-new.rst Co-Authored-By: jwenfai <jwenfai@gmail.com> * Added support for quarter frequencies when resampling CFTimeIndex. Less redundancy in CFTimeIndex resampling tests. * Removed normalization code (unnecessary for cftime_range) in cftime_offsets.py. Removed redundant lines in whats-new.rst. * Removed invalid option from _get_day_of_month docstring. Added tests back in that raises ValueError when resampling (base=24 when resampling to daily freq, e.g., '8D'). * Minor edits to docstrings/comments * lint

* ENH: Add Dataset.drop_dims() * Drops full dimensions and any corresponding variables in a Dataset * Fixes GH1949 * DOC: Add Dataset.drop_dims() documentation

* Added tests of desired name inferring behaviour * Infers names * updated what's new

It got deprecated in numpy 1.16 and throws a ton of warnings due to that. All the function does is returning .item() anyway, which is why it got deprecated.

gerritholl and others added 30 commits November 16, 2018 08:40

add missing , and article in error message (#2557)

70e9eb8

Add missing , and article in error message when attribute values have the wrong type.

DOC: fix computation.rst (#2567)

57fdcc5

Return slices when possible from CFTimeIndex.get_loc() (#2569)

9d572a5

This makes string indexing usable in a pandas.MultiIndex.

python setup.py test now works by default (#2573)

ecbf91f

DOC: remove example using Dataset.T (#2572)

a2a448d

* DOC: remove example using Dataset.T * Update reshaping.rst * Update reshaping.rst

Concat docstring typo (#2577)

483b8a0

Fix typo (#2578)

22a5763

fix examples (#2581)

0d6056e

* Fix multidimensional co-ordinate example. * Use open_dataset().load() in other example

Minor update to PR template (#2596)

77634d4

This version should be a little less noisy, since instructions only for authors are put in commented out HTML.

Fix h5netcdf saving scalars with filters or chunks (#2591)

53746c9

* Fix h5netcdf saving scalars with filters or chunks * Revert adding scalar dataset to central test function * Add fix description to what's new.

Add dayofyear and dayofweek accessors (#2599)

5d8ef5f

Fix wrong error message in interp() (#2598)

6881503

Temporarily mark dask-dev build as an allowed failure (#2602)

23483ad

use keep_attrs in binary operations II (#2590)

82789bc

* Add keep_attrs to binary_op * added test for binary_ops keep_attrs=True * PEP8 issues + blank lines removed * whitespace removed * keep_attrs in DataArray * enhancement * simpler testing

Bump cftime version in doc environment (#2604)

cbb32e1

Support HighLevelGraphs (#2603)

2223445

* Support HighLevelGraphs Fixes #4291 * test __dask_layers__ * Skip dependnecies test with old dask * Reenable dask-dev test on Travis-CI

fix a few typos in rst files (#2607)

09494eb

Fix parsing '_Unsigned' attribute (#2584)

f8cced7

* Fix parsing '_Unsigned' attribute Fixes #2583 * Fix encode step too. * Add tests. * Fix whats-new. * Undo unnecessary change * Yay! fix test failure.

doc fixes. (#2611)

090564c

* doc fixes Fixes #2610 * minor doc fixes. * Fix examples path for open statements.

Remove meaningless tz argument in cftime_range (#2613)

a4c9ab5

Remove failing Appveyor Python 2.7 32-bit build (#2617)

30288e8

There seems to be some sort of dependency issue on Appveyor, but it's not worth tracking down given how we'll be dropping Python 2.7 in the new year anyways.

.resample now supports loffset. (#2608)

778ffc4

* .resample now supports loffset. * Update whats-new.rst * Fix for pandas 0.19.2 * doc update. * Review comments.

CF: also decode time bounds when available (#2571)

57348ab

* CF: also decode time bounds when available * Fix failing test when cftime not present and what's new * Fix windows * Reviews * Reviews 2

FIX Don't raise a deprecation warning for xarray.ufuncs.{angle,iscomp…

a15587d

…lex} (#2615) * Don't raise a warning for xarray.ufuncs.angle * Update warning message

Get 0d slices of ndarrays directly from indexing (#2625)

ce52341

* Add test to ensure that 0d slices are views * Get 0d slices of ndarrays directly from indexing * Add 0d slice documentation

Fix multiindex selection (#2621)

b5059a5

* Fix multiindex selection * Support pandas0.19 * a bugfix * Do remove_unused_levels only once in unstack. * import algos * Remove unused import * Adopt local import

observingClouds and others added 29 commits January 31, 2019 09:28

Implement integrate (#2653)

4923039

* added integrate. * Docs * Update via comment * Update via comments * integrate can accept multiple dimensions. * using set instead of list * dim -> coord

deprecate compat & encoding (#2703)

d634f64

* deprecate compat & encoding * stacklevel * whatsnew * imports * merge conflicts * remove deprecations * removal date

dropna() for a Series indexed by a CFTimeIndex (#2734)

a1ff90b

* Support dropna() for a Series indexed by a CFTimeIndex * Add a what's new entry * Use == instead of is

add tests for handling of empty pandas objects in constructors (#2735)

0da9d62

* add tests for GH#697 - handling of empty pandas objects in constructors * make pep8 happy

remove bottleneck dev build from travis, this test env was failing to…

2ef3f0b

… build (#2736)

Reenable cross engine read write netCDF test (#2739)

053aed1

Fixes #2050 I'm not quite sure what was going on, but it passes now.

remove xfail from test_cross_engine_read_write_netcdf4 (#2741)

27cf53f

Refactor (part of) dataset.py to use explicit indexes (#2696)

e677b7a

* Refactor (part of) dataset.py to use explicit indexes * Use copy.copy() * Ensure coordinate order is deterministic

reintroduce pynio/rasterio/iris to py36 test env (#2738)

1abc45b

* add h5netcdf+dask tests * pep8 * reactivate pynio/rasterio/iris in py36 test builds * revert changes to test_backends.py -- unrelated to this PR

BUG: Pass kwargs to the FileManager for pynio engine (#2380) (#2732)

0dfc0e6

* BUG: Pass kwargs to the FileManager for pynio engine (#2380) * TST: Added test for pynio kwargs passing (#2380) * Fixed formatting (#2380)

remove references to cyordereddict (#2750)

e097763

enable internal plotting with cftime datetime (#2665)

8a1a8a1

enable internal plotting with cftime datetime

Fix mypy errors (#2753)

6d20766

Apparently I wasn't paying attention in my last PR :)

Fix name loss when masking (#2749)

07cfc5a

* fix renaming * formatting * added tests * shoyer's solution * what's new

add h5netcdf+dask tests (#2737)

fd9b0b0

* add h5netcdf+dask tests * pep8 * pass encoding through to _replace_vars_and_dims in ds.chunk() * lint * _kwargs=None in roundtrip methods

Update computation.py to use Python 3 function signatures (#2756)

2089382

* Update computation.py to use Python 3 function signatures This lets us remove lots of ugly explicit calls to ``kwargs.pop()``. * Lint / py35 fixup

typo in whats_new (#2763)

17fa64f

'standard' now refers to 'gregorian' in cftime_range (#2771)

cd8e370

Bugfix/reduce no axis (#2769)

57cd76d

* New test for reduce func which takes no axes * Fixed axis logic * Recorded fix in what's new * Added intermediate variable

Add use_cftime option to open_dataset (#2759)

612d390

* Add use_cftime option to open_dataset * Remove f-strings * Fix test-skipping logic and remove 'dummy' from warning * Note that use_cftime is only relevant for standard calendar dates * Move use_cftime option to CFDatetimeCoder constructor

Add Dataset.drop_dims (#2767)

e8eb83b

* ENH: Add Dataset.drop_dims() * Drops full dimensions and any corresponding variables in a Dataset * Fixes GH1949 * DOC: Add Dataset.drop_dims() documentation

Improve name concat (#2792)

c33dab2

* Added tests of desired name inferring behaviour * Infers names * updated what's new

Don't use deprecated np.asscalar() (#2800)

0c534b0

It got deprecated in numpy 1.16 and throws a ton of warnings due to that. All the function does is returning .item() anyway, which is why it got deprecated.

Add support for cftime.datetime coordinates with coarsen (#2778)

c770eec

phausamann merged commit b420005 into phausamann:master Mar 7, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Merge new changes in pydata master #4

Merge new changes in pydata master #4

phausamann commented Mar 7, 2019

Merge new changes in pydata master #4

Merge new changes in pydata master #4

Conversation

phausamann commented Mar 7, 2019