Feature Request: Efficient rolling with strides #3608

niowniow · 2019-12-10T12:38:59Z

Xarray is facing the same issues in its current rolling implementation (DataArrayRolling and DatasetRolling) as described in this pandas issue. Namely, the construct methods stride parameter is applied after the rolling is computed. Technically, we are computing more than we would need to because we partially throwing it away due to striding.

In PR #3607 the issue is solved for the ...Rolling's __iter__ function but not for the construct, reduce and _bottleneck_reduce methods.
Since the way Xarray's rolling is implemented relies on numpy, we could introduce a sliding window function as described here.

Any opinions?

The text was updated successfully, but these errors were encountered:

niowniow · 2019-12-10T13:05:10Z

Previous enhancement requests asking for a stride argument to rolling: pandas-dev/pandas#15354, pandas-dev/pandas#22976, pandas-dev/pandas#27654 (comment), dask/dask#4659, numpy/numpy#7753

Originally posted by @pilkibun in pandas-dev/pandas#26959 (comment)

GenevieveBuckley · 2021-03-31T00:24:39Z

Is this useful/relevant for you? dask/dask#7234

keewis · 2021-03-31T09:26:03Z

@dcherian, should this have been closed by #4977?

dcherian · 2021-03-31T15:27:47Z

No. but this should be really easy to fix.

construct already supports stride.

reduce can easily support stride by passing it on here:

xarray/xarray/core/rolling.py

Lines 441 to 443 in 57a4479

    
           windows = self._construct( 
        
               obj, rolling_dim, keep_attrs=keep_attrs, fill_value=fillna 
        
           )

We should also add it to _reduce_method

bottleneck does not support stride so we can only use a .isel call at the end of _bottleneck_reduce

xarray/xarray/core/rolling.py

Line 518 in 57a4479

return DataArray(values, self.obj.coords, attrs=attrs, name=self.obj.name)

Note: sliding_window_view does not support stride because it's easy to stride after constructing that view (i saw this on some numpy issue)

niowniow · 2021-05-31T12:50:37Z

Quickly glancing over sliding_window_view I didn't immediately understand how to use it with stride. Would I need to

transform the DataArray to dask array using chunk (which may involve an overhead!?),
then use rolling which itself uses sliding_window_view because its a dask array!?
Then use isel with stride on the new dimension?

reduce can easily support stride by passing it on here:

I think that's what I did in #3607. It's been a while

dcherian · 2021-05-31T19:09:00Z

sliding_window_view is a numpy function (see npcompat.py), so you need not transform to dask.

For reduce I think we just have to pass stride to _construct as in #3608 (comment) and for bottleneck insert the .isel call (copied from the end of _construct) after the DataArray is constructed.

kmsquire · 2021-07-21T17:53:59Z

Question: instead of adding stride to reduce and _reduce_method, why not add it as a member of DataArrayRolling directly? This would allow, e.g., __iter__ to use it as well, and seems like a cleaner interface.

I've been confused why some parameters are available only in construct (stride, fill_value), some are available both in construct and in the DataArrayRolling constructor (keep_attrs), and some are only available in the constructor (min_periods, center, and soon pad).

niowniow · 2021-07-28T11:58:28Z

You are right. It's quite confusing. I've already added a stride parameter in my PR #3607
I didn't follow through with it and at the moment the checks are not successful anymore.
Maybe someone else could give an opinion on the pro/cons of a stride parameter in rolling?

dcherian added the topic-rolling label Feb 18, 2021

headtr1ck mentioned this issue Oct 29, 2024

Sliding window (mix of rolling and coarsen) #9696

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature Request: Efficient rolling with strides #3608

Feature Request: Efficient rolling with strides #3608

niowniow commented Dec 10, 2019 •

edited

Loading

niowniow commented Dec 10, 2019

GenevieveBuckley commented Mar 31, 2021

keewis commented Mar 31, 2021

dcherian commented Mar 31, 2021

niowniow commented May 31, 2021

dcherian commented May 31, 2021

kmsquire commented Jul 21, 2021

niowniow commented Jul 28, 2021

Feature Request: Efficient rolling with strides #3608

Feature Request: Efficient rolling with strides #3608

Comments

niowniow commented Dec 10, 2019 • edited Loading

niowniow commented Dec 10, 2019

GenevieveBuckley commented Mar 31, 2021

keewis commented Mar 31, 2021

dcherian commented Mar 31, 2021

niowniow commented May 31, 2021

dcherian commented May 31, 2021

kmsquire commented Jul 21, 2021

niowniow commented Jul 28, 2021

niowniow commented Dec 10, 2019 •

edited

Loading