False positive for "nested renamer is not supported" error #32156

rth · 2020-02-21T15:53:54Z

Code Sample

import pandas as pd

df = pd.DataFrame({'A': [1, 1, 1, 2, 2],  'B': range(5), 'C': range(5)})
df.groupby('A').agg({'B': 'sum', 'G': 'min'})  # aggregate by a non existing column

produces

<ipython-input-5-f5ac34bf856f> in <module>
----> 1 df.groupby('A').agg({'B': 'sum', 'G': 'min'})

~/src/dgr00/.venv3/lib/python3.6/site-packages/pandas/core/groupby/generic.py in aggregate(self, func, *args, **kwargs)
    938         func = _maybe_mangle_lambdas(func)
    939
--> 940         result, how = self._aggregate(func, *args, **kwargs)
    941         if how is None:
    942             return result

~/src/dgr00/.venv3/lib/python3.6/site-packages/pandas/core/base.py in _aggregate(self, arg, *args, **kwargs)
    364                     obj.columns.intersection(keys)
    365                 ) != len(keys):
--> 366                     raise SpecificationError("nested renamer is not supported")
    367
    368             from pandas.core.reshape.concat import concat

SpecificationError: nested renamer is not supported

Problem description

While groupby.agg() with a dictionary when renaming was deprecated in 1.0 (
https://pandas.pydata.org/pandas-docs/stable/whatsnew/v0.20.0.html#deprecate-groupby-agg-with-a-dictionary-when-renaming) the corresponding error message can also be obtained when aggregating by an non existing column which can lead to confusion.

Expected Output

Error saying that the column G does not exist.

Output of `pd.show_versions()`

INSTALLED VERSIONS

python : 3.6.4.final.0
OS : Linux
machine : x86_64
pandas : 1.0.1

The text was updated successfully, but these errors were encountered:

Juan-132 · 2020-02-29T22:22:41Z

Confirmed. The incorrect error message threw me off as well.

luciofaso · 2020-03-04T09:51:59Z

Same problem

tanglade22 · 2020-03-12T15:21:19Z

Same problem here, bug is due to non-existing column

twang96 · 2020-03-20T20:21:47Z

same problem here, anyone has a workaround method? thx

foxx · 2020-03-22T12:23:24Z

Can confirm, got the same exception due to a non-existing column name.

benjamin-ny · 2020-03-24T09:51:04Z

It's also inconsistent:

import pandas as pd
df = pd.DataFrame({'A': [1, 1, 1, 2, 2],  'B': range(5), 'C': range(5)})
df.groupby('A').agg({'B': 'sum', 'G': ['min'])  # <- use ['min'] instead of 'min'

raises the correct error: KeyError: "Column 'G' does not exist!"

KingPegasus · 2020-03-27T08:31:49Z

You wrote G instead of C. 'G' is nothing. 'C' is the column.
import pandas as pd
df = pd.DataFrame({'A': [1, 1, 1, 2, 2], 'B': range(5), 'C': range(5)})
df.groupby('A').agg({'B': 'sum', 'C': 'min'})

Applying multiple aggregation a column
df = pd.DataFrame({'A': [1, 1, 1, 2, 2], 'B': range(5), 'C': range(5)})
df.groupby('A').agg({'B': ['sum','min'], 'C': ['sum','min']})

thoo · 2020-03-27T16:35:54Z

I think this one is also related.

I tried to rename the column right after groupby by the way it is done in pd.version < 1.0. I do not get the deprecation warnings like I get in pd.version < 1.0.

Here is the example:

df = pd.DataFrame({'A': [1, 1, 1, 2, 2],'B': range(5)})
df.groupby('A').agg({'B': {'foo': 'sum'}})

The error message is:

---------------------------------------------------------------------------
SpecificationError                        Traceback (most recent call last)
<ipython-input-22-440a616816b6> in <module>
      1 df = pd.DataFrame({'A': [1, 1, 1, 2, 2],'B': range(5)})
----> 2 df.groupby('A').agg({'B': {'foo': 'sum'}})

~/anaconda3/lib/python3.7/site-packages/pandas/core/groupby/generic.py in aggregate(self, func, *args, **kwargs)
    926         func = _maybe_mangle_lambdas(func)
    927 
--> 928         result, how = self._aggregate(func, *args, **kwargs)
    929         if how is None:
    930             return result

~/anaconda3/lib/python3.7/site-packages/pandas/core/base.py in _aggregate(self, arg, *args, **kwargs)
    340                     # {'ra' : { 'A' : 'mean' }}
    341                     if isinstance(v, dict):
--> 342                         raise SpecificationError("nested renamer is not supported")
    343                     elif isinstance(obj, ABCSeries):
    344                         raise SpecificationError("nested renamer is not supported")

SpecificationError: nested renamer is not supported

thoo · 2020-03-27T16:37:05Z

You wrote G instead of C. 'G' is nothing. 'C' is the column.
import pandas as pd
df = pd.DataFrame({'A': [1, 1, 1, 2, 2], 'B': range(5), 'C': range(5)})
df.groupby('A').agg({'B': 'sum', 'C': 'min'})

Applying multiple aggregation a column
df = pd.DataFrame({'A': [1, 1, 1, 2, 2], 'B': range(5), 'C': range(5)})
df.groupby('A').agg({'B': ['sum','min'], 'C': ['sum','min']})

We are expecting the appropriate error message. The current error message is not pointing to the right direction.

crash5936 · 2020-04-08T22:04:56Z

I also ran into this error and as mentioned, it was caused by trying to aggregate a non-existent column. Version 1.0.3

sibiyes · 2020-05-04T12:08:44Z

got the same error when there are duplicate columns in the dataframe.

NikkiShah93 · 2020-06-18T01:05:42Z

I only get this error when I'm running my code with command or git bash, when I'm running my code in jupyter it works fine, what's the best way to solve it by still using the agg()?

janithahn · 2020-06-21T13:32:06Z

Instead of using .agg({'B': 'sum', 'G': 'min'}), try passing it as a list of tuples like .agg([('B', 'sum'), ('G', 'min')]).

icm-ai · 2020-06-23T07:38:34Z

thanks a lot, upstairs

timhunderwood · 2020-06-28T09:18:44Z

I think this has been fixed on master: See this issue 32755 and PR #32836.

I now get the below error message on master, rather than the "nested renamer" error:

import pandas as pd
df = pd.DataFrame({'A': [1, 1, 1, 2, 2],  'B': range(5), 'C': range(5)})
df.groupby('A').agg({'B': 'sum', 'G': 'min'}) 

Traceback (most recent call last):
  File "C:\Users\timhu\Anaconda3\envs\pandas-dev-2\lib\site-packages\IPython\core\interactiveshell.py", line 3331, in run_code
    exec(code_obj, self.user_global_ns, self.user_ns)
  File "<ipython-input-2-08e47b9415b3>", line 4, in <module>
    df.groupby('A').agg({'B': 'sum', 'G': 'min'})
  File "c:\users\timhu\documents\code\pandas\pandas\core\groupby\generic.py", line 948, in aggregate
    result, how = self._aggregate(func, *args, **kwargs)
  File "c:\users\timhu\documents\code\pandas\pandas\core\base.py", line 354, in _aggregate
    raise SpecificationError(f"Column(s) {cols} do not exist")
pandas.core.base.SpecificationError: Column(s) ['G'] do not exist

I think this issue can be closed.

mroeschke · 2021-12-28T02:13:49Z

Looks to be addressed by #32836

charlesdong1991 added the Error Reporting Incorrect or improved errors from pandas label Feb 23, 2020

mroeschke added the Bug label Apr 8, 2020

peerchemist mentioned this issue Apr 15, 2020

Resampling causes exception (with pandas 1.0.3) peerchemist/finta#57

Closed

mroeschke added Apply Apply, Aggregate, Transform, Map Groupby labels Jul 29, 2021

mroeschke closed this as completed Dec 28, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

False positive for "nested renamer is not supported" error #32156

False positive for "nested renamer is not supported" error #32156

rth commented Feb 21, 2020

INSTALLED VERSIONS

Juan-132 commented Feb 29, 2020

luciofaso commented Mar 4, 2020

tanglade22 commented Mar 12, 2020

twang96 commented Mar 20, 2020

foxx commented Mar 22, 2020

benjamin-ny commented Mar 24, 2020

KingPegasus commented Mar 27, 2020 •

edited

Loading

thoo commented Mar 27, 2020

thoo commented Mar 27, 2020

crash5936 commented Apr 8, 2020

sibiyes commented May 4, 2020

NikkiShah93 commented Jun 18, 2020

janithahn commented Jun 21, 2020

icm-ai commented Jun 23, 2020

timhunderwood commented Jun 28, 2020

mroeschke commented Dec 28, 2021

False positive for "nested renamer is not supported" error #32156

False positive for "nested renamer is not supported" error #32156

Comments

rth commented Feb 21, 2020

Code Sample

Problem description

Expected Output

Output of pd.show_versions()

INSTALLED VERSIONS

Juan-132 commented Feb 29, 2020

luciofaso commented Mar 4, 2020

tanglade22 commented Mar 12, 2020

twang96 commented Mar 20, 2020

foxx commented Mar 22, 2020

benjamin-ny commented Mar 24, 2020

KingPegasus commented Mar 27, 2020 • edited Loading

thoo commented Mar 27, 2020

thoo commented Mar 27, 2020

crash5936 commented Apr 8, 2020

sibiyes commented May 4, 2020

NikkiShah93 commented Jun 18, 2020

janithahn commented Jun 21, 2020

icm-ai commented Jun 23, 2020

timhunderwood commented Jun 28, 2020

mroeschke commented Dec 28, 2021

Output of `pd.show_versions()`

KingPegasus commented Mar 27, 2020 •

edited

Loading