You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
At the moment Series.any and Series.all have inconsistent behavior for pyarrow vs. non-pyarrow dtypes for empty or all null data. The pyarrow behavior is coming from pyarrow.compute.any and pyarrow.compute.all which return null in these cases. I suspect different behavior here within pandas is likely to be problematic and confusing. Is it OK to diverge from pyarrow here in order to be more consistent within pandas?
Expected Behavior
Probably to match the behavior of non-pyarrow types
Installed Versions
.
The text was updated successfully, but these errors were encountered:
I think the difference is in min_count; pyarrow defaults to 1 whereas pandas (where it exists) defaults to 0. I think pandas should use the pandas default when calling pyarrow.
I think the difference is in min_count; pyarrow defaults to 1 whereas pandas (where it exists) defaults to 0. I think pandas should use the pandas default when calling pyarrow.
Pandas version checks
I have checked that this issue has not already been reported.
I have confirmed this bug exists on the latest version of pandas.
I have confirmed this bug exists on the main branch of pandas.
Reproducible Example
Issue Description
At the moment
Series.any
andSeries.all
have inconsistent behavior for pyarrow vs. non-pyarrow dtypes for empty or all null data. The pyarrow behavior is coming frompyarrow.compute.any
andpyarrow.compute.all
which return null in these cases. I suspect different behavior here within pandas is likely to be problematic and confusing. Is it OK to diverge from pyarrow here in order to be more consistent within pandas?Expected Behavior
Probably to match the behavior of non-pyarrow types
Installed Versions
.
The text was updated successfully, but these errors were encountered: