`Dataset.broadcast_like(other)` should broadcast against like variables in other #6549

headtr1ck · 2022-04-30T17:51:37Z

Is your feature request related to a problem?

I am a bit puzzled about how xarrays is broadcasting Datasets.
It seems to always add all dimensions to all variables.
Is this what you want in general?

See this example:

import xarray as xr

da = xr.DataArray([[1, 2, 3]], dims=("x", "y"))
# <xarray.DataArray (x: 1, y: 3)>
# array([[1, 2, 3]])
ds = xr.Dataset({"a": ("x", [1]), "b": ("z", [2, 3])})
# <xarray.Dataset>
# Dimensions:  (x: 1, z: 2)
# Dimensions without coordinates: x, z
# Data variables:
#     a        (x) int32 1
#     b        (z) int32 2 3
ds.broadcast_like(da)

# returns:
# <xarray.Dataset>
# Dimensions:  (x: 1, y: 3, z: 2)
# Dimensions without coordinates: x, y, z
# Data variables:
#     a        (x, y, z) int32 1 1 1 1 1 1
#     b        (x, y, z) int32 2 3 2 3 2 3

# I think it should return:
# <xarray.Dataset>
# Dimensions:  (x: 1, y: 3, z: 2)
# Dimensions without coordinates: x, y, z
# Data variables:
#     a        (x, y) int32 1 1 1  # notice here without "z" dim
#     b        (x, y, z) int32 2 3 2 3 2 3

Describe the solution you'd like

I would like broadcasting to behave the same way as e.g. a simple addition.
In the upper example da + ds produces the dimensions that I want.

Describe alternatives you've considered

ds + xr.zeros_like(da) this works, but seems more like a "dirty hack".

Additional context

Maybe one can add an option to broadcasting that controls this behavior?

The text was updated successfully, but these errors were encountered:

keewis · 2022-04-30T18:03:51Z

see also #6304 which covers xr.broadcast

headtr1ck · 2022-04-30T18:26:45Z

see also #6304 which covers xr.broadcast

I tried adding a join input to Dataset.broadcast_like and passing it to align, but that did not work (at least for join="inner"). Still got the same result...

headtr1ck · 2022-05-01T14:37:43Z

related to #6227

dcherian · 2025-02-06T15:21:50Z

I keep misunderstanding this issue so typing this out to make sure I got it right.

Writing out dimension names in square brackets

ds['a': 'x', 'b': 'z'].broadcast_like(da: ['x', 'y']) -> ds['a': ['x', y'], 'b': ['x', 'y', 'z']]

IIUC the request is to avoid broadcasting the variables in ds against each other, and to only broadcast each variable against da separately. Did I get it right?

dcherian · 2025-02-06T15:46:54Z

@mjwillson posted a nice summary in #10031 :

I had to summarize the overall problem, it's that behaviour of xarray.broadcast (and Dataset.broadcast_like etc) is not consistent with how actual arithmetic operations broadcast, in cases where Datasets are involved.

alvarosg · 2025-02-06T15:47:33Z

In the light of #10031, and broadcasting a dataset to itself not behaving as a no-op, should we label this as "bug" rather than "enhancement"?

headtr1ck added the enhancement label Apr 30, 2022

headtr1ck mentioned this issue Apr 30, 2022

polyval: Use Horner's algorithm + support chunked inputs #6548

Merged

3 tasks

mjwillson mentioned this issue Feb 6, 2025

Behaviour from Dataset.broadcast_like is strange and inconsistent with how arithmetic ops on Datasets actually broadcast #10031

Closed

5 tasks

dcherian changed the title ~~Improved Dataset broadcasting~~ Add a broadcasting mode that inserts size-1 labeled dimensions Feb 6, 2025

dcherian changed the title ~~Add a broadcasting mode that inserts size-1 labeled dimensions~~ Add a broadcasting mode that inserts size-1 unlabeled dimension Feb 6, 2025

dcherian changed the title ~~Add a broadcasting mode that inserts size-1 unlabeled dimension~~ Improved Dataset.broadcasting Feb 6, 2025

dcherian changed the title ~~Improved Dataset.broadcasting~~ Dataset.broadcast_like(other) should broadcast against like variables in other Feb 6, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`Dataset.broadcast_like(other)` should broadcast against like variables in other #6549

`Dataset.broadcast_like(other)` should broadcast against like variables in other #6549

headtr1ck commented Apr 30, 2022

keewis commented Apr 30, 2022

headtr1ck commented Apr 30, 2022

headtr1ck commented May 1, 2022

dcherian commented Feb 6, 2025 •

edited

Loading

dcherian commented Feb 6, 2025

alvarosg commented Feb 6, 2025 •

edited

Loading

Dataset.broadcast_like(other) should broadcast against like variables in other #6549

Dataset.broadcast_like(other) should broadcast against like variables in other #6549

Comments

headtr1ck commented Apr 30, 2022

Is your feature request related to a problem?

Describe the solution you'd like

Describe alternatives you've considered

Additional context

keewis commented Apr 30, 2022

headtr1ck commented Apr 30, 2022

headtr1ck commented May 1, 2022

dcherian commented Feb 6, 2025 • edited Loading

dcherian commented Feb 6, 2025

alvarosg commented Feb 6, 2025 • edited Loading

`Dataset.broadcast_like(other)` should broadcast against like variables in other #6549

`Dataset.broadcast_like(other)` should broadcast against like variables in other #6549

dcherian commented Feb 6, 2025 •

edited

Loading

alvarosg commented Feb 6, 2025 •

edited

Loading