Make constructing slices lazily. #1994

fujiisoup · 2018-03-15T23:15:26Z

Closes DataArray.rolling().mean() is way slower than it should be #1993
Tests passed
Fully documented, including whats-new.rst for all changes.

Quick fix of #1993.

With this fix, the script shown in #1993 runs
Bottleneck: 0.08317923545837402 s
Pandas: 1.3338768482208252 s
Xarray: 1.1349339485168457 s

shoyer

It would be nice to add a benchmark for rolling computations :).

shoyer · 2018-03-15T23:39:19Z

xarray/core/rolling.py


-        self._setup_windows()
+        self.window_labels = self.obj[self.dim]
+        self._stops = np.arange(1, len(self.window_labels) + 1)


can we do this only when iteration is requested? that would be a bit more efficient for the common case where iteration is not done.

My guess is that iteration is slow enough that the overhead of recreating these objects will not be noticeable.

shoyer · 2018-03-15T23:54:05Z

The benchmark does not need to be complete, but it would be good to ensure we don't have a regression here.

…for long arrays.

fujiisoup · 2018-03-16T00:38:33Z

asv_bench/benchmarks/rolling.py

+            se = self.da_long.to_series()
+            getattr(se.rolling(window=window), func)()
+        else:
+            getattr(self.da_long.rolling(x=window), func)()


Also added a benchmark using pandas for long 1d-array, in order to make it easier to find the cause of the regression.
But is it too verbose as pandas might make the similar benchmark tests?

I'm OK with this.

shoyer

Looks good to me, thanks!

shoyer · 2018-03-17T19:07:59Z

asv_bench/benchmarks/rolling.py

+            se = self.da_long.to_series()
+            getattr(se.rolling(window=window), func)()
+        else:
+            getattr(self.da_long.rolling(x=window), func)()


I'm OK with this.

shoyer · 2018-03-17T19:08:33Z

xarray/core/rolling.py


    def __iter__(self):
-        for (label, indices) in zip(self.window_labels, self.window_indices):
-            window = self.obj.isel(**{self.dim: indices})
+        _stops = np.arange(1, len(self.window_labels) + 1)


nit: no need to preface with an underscore for variables that are already limited in scope to this method.

fujiisoup added 2 commits March 16, 2018 08:05

Make constructing slices lazily.

3e0b352

Additional speedup

ef35ae6

shoyer reviewed Mar 15, 2018

View reviewed changes

fujiisoup added 2 commits March 16, 2018 09:12

Move some lines in DataArrayRolling into __iter__. Added a benchmark …

151adfb

…for long arrays.

Bugfix in benchmark

5e290d2

fujiisoup commented Mar 16, 2018

View reviewed changes

shoyer approved these changes Mar 17, 2018

View reviewed changes

remove underscores.

3f8aad3

fujiisoup merged commit 1d0fbe6 into pydata:master Mar 18, 2018

fujiisoup deleted the lazy_rolling_window branch March 18, 2018 08:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make constructing slices lazily. #1994

Make constructing slices lazily. #1994

fujiisoup commented Mar 15, 2018

shoyer left a comment

shoyer Mar 15, 2018

fujiisoup Mar 16, 2018

shoyer commented Mar 15, 2018

fujiisoup Mar 16, 2018 •

edited

Loading

shoyer Mar 17, 2018

shoyer left a comment

shoyer Mar 17, 2018

shoyer Mar 17, 2018

Make constructing slices lazily. #1994

Make constructing slices lazily. #1994

Conversation

fujiisoup commented Mar 15, 2018

shoyer left a comment

Choose a reason for hiding this comment

shoyer Mar 15, 2018

Choose a reason for hiding this comment

fujiisoup Mar 16, 2018

Choose a reason for hiding this comment

shoyer commented Mar 15, 2018

fujiisoup Mar 16, 2018 • edited Loading

Choose a reason for hiding this comment

shoyer Mar 17, 2018

Choose a reason for hiding this comment

shoyer left a comment

Choose a reason for hiding this comment

shoyer Mar 17, 2018

Choose a reason for hiding this comment

shoyer Mar 17, 2018

Choose a reason for hiding this comment

fujiisoup Mar 16, 2018 •

edited

Loading