ENH: allow EA to register types for is_scalar #27462

jbrockmendel · 2019-07-19T01:17:32Z

i think we need. way for EA to hook into this for an EA scalar
eg an IPaddress from cyberpandas could register a scalar i think

Before we move on this, I think we need to clarify in which situations we care about lib.is_scalar(x) vs the simpler np.ndim(x) == 0

The text was updated successfully, but these errors were encountered:

TomAugspurger · 2019-07-19T01:30:56Z

One example is for nested data. In this case we need something like scalar_for_dtype(value, dtype), since the ndim of a "scalar" for a nested data type would be > 0.

jorisvandenbossche · 2019-07-29T08:09:32Z

Alternative for registering, could be a method on the dtype/array that can check if a value is a valid scalar?

sterlinm · 2022-04-12T13:59:25Z

Hi! I think I've run into this issue in my own attempt at building an ExtensionArray and I was curious if there'd been any changes on this or if it was something I could potentially contribute on.

I've been working on an extension array where the na_value I want to return for the ExtensionDtype is not recognized as a scalar by is_scalar. That seems to cause issues with some methods that aren't part of the ExtensionArray interface that I can't figure out how to fix (e.g. Series.where).

Is there another workaround for this that I haven't found yet? Thanks!

jbrockmendel · 2022-04-12T17:00:49Z

Is there another workaround for this that I haven't found yet?

Only thought that comes to mind is trying to replace is_scalar checks with not is_listlike checks. Last time I checked (worth double-checking since this was a while ago) is_listlike was faster than is_scalar anyway, and should be more robust to this problem.

sterlinm · 2022-04-12T18:47:13Z

Thanks very much! It looks like that change has already been made in a number of places in the most recent versions of Pandas (I was testing on 1.3).

Thanks for your help and sorry to bother you!

andrewgsavage · 2022-10-02T15:31:19Z

Now that is_list_like interprets scalars correctly, #44626, this is now the main issue holding back pint-pandas.

There's a few different ways suggested in this issue since it was created. What's the suggested way to fix this at the moment?

edit: I was able to get all tests in pint-pandas passing without this, so it may not be needed.

jbrockmendel · 2023-07-03T20:43:26Z

I looked at this in April and writing up my conclusions fell through the cracks.

Many of the places where we use is_scalar (also is_list_like) are either

as a preliminary check if we can use this as a scalar in __setitem__
to see whether we should treat it as a single label vs sequence of labels for indexing.

In the latter case, is_scalar is behaving like a faster is_hashable (58ns vs 506ns on []).

In the former, we should be able to use an EA-specific method to check if the item is a scalar that is valid for the specific array at hand. We already have something like this for most of our internal EAS (DTA, TDA, PeriodArray, Categorical, PandasArray, IntervalArray, and MaskedArray all have _validate_setitem_value. ArrowExtensionArray has _maybe_convert_setitem_value).

jbrockmendel added API Design ExtensionArray Extending pandas with custom dtypes or arrays. labels Jul 23, 2019

jbrockmendel mentioned this issue Jan 29, 2020

setting column value with .loc based on a condition #28924

Open

jbrockmendel closed this as completed Sep 5, 2020

jbrockmendel reopened this Sep 17, 2020

jbrockmendel mentioned this issue Jun 7, 2021

ENH: Specify how pandas infers dtype on objects #41848

Open

mroeschke added Enhancement and removed API Design labels Jul 10, 2021

sanzoghenzo mentioned this issue Aug 31, 2021

TypeError using DataFrame.mask hgrecco/pint-pandas#93

Closed

jorisvandenbossche mentioned this issue May 25, 2022

ENH: Incorproate ArrowDtype into ArrowExtensionArray #47034

Merged

2 tasks

jbrockmendel mentioned this issue Apr 21, 2023

BUG: fix setitem with enlargment with pyarrow Scalar #52833

Closed

4 tasks

jbrockmendel mentioned this issue Jul 3, 2023

Support uncertainties as EA Dtype #53970

Closed

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: allow EA to register types for is_scalar #27462

ENH: allow EA to register types for is_scalar #27462

jbrockmendel commented Jul 19, 2019

TomAugspurger commented Jul 19, 2019

jorisvandenbossche commented Jul 29, 2019

sterlinm commented Apr 12, 2022

jbrockmendel commented Apr 12, 2022

sterlinm commented Apr 12, 2022

andrewgsavage commented Oct 2, 2022 •

edited

Loading

jbrockmendel commented Jul 3, 2023

ENH: allow EA to register types for is_scalar #27462

ENH: allow EA to register types for is_scalar #27462

Comments

jbrockmendel commented Jul 19, 2019

TomAugspurger commented Jul 19, 2019

jorisvandenbossche commented Jul 29, 2019

sterlinm commented Apr 12, 2022

jbrockmendel commented Apr 12, 2022

sterlinm commented Apr 12, 2022

andrewgsavage commented Oct 2, 2022 • edited Loading

jbrockmendel commented Jul 3, 2023

andrewgsavage commented Oct 2, 2022 •

edited

Loading