BUG: DataFrame.merge(suffixes=) does not respect None #24782

simonjayhawkins · 2019-01-15T16:41:25Z

>>> import pandas as pd
>>> print(pd.__version__)
0.24.0rc1+6.gabfe72d7c.dirty
>>> from pandas import DataFrame
>>> df = DataFrame([1])
>>> df
   0
0  1
>>>
>>> df.columns
RangeIndex(start=0, stop=1, step=1)
>>>
>>> result = df.merge(df, left_index=True, right_index=True, suffixes=(None,'_dup'))
>>> result
   0None  0_dup
0      1      1
>>>
>>> result.columns
Index(['0None', '0_dup'], dtype='object')
>>>
>>> expected = result.rename(columns = {'0None':0})
>>> expected
   0  0_dup
0  1      1
>>>
>>> expected.columns
Index([0, '0_dup'], dtype='object')

specifying an empty string changes the dtype of the column label

>>> result = df.merge(df, left_index=True, right_index=True, suffixes=('','_dup'))
>>> result
   0  0_dup
0  1      1
>>> result.columns
Index(['0', '0_dup'], dtype='object')

WillAyd · 2019-01-15T18:43:16Z

Thanks for the report. Makes sense to have a more robust handling of None here - PRs are always welcome

charlesdong1991 · 2019-01-17T20:09:16Z

I am on it!

simonjayhawkins · 2019-02-06T11:04:28Z

Thanks @charlesdong1991

There is a bug in Pandas < 0.25.0 that adds the string "None" as a suffix to column names on merging. That impacts pipelines such as those using `CombSumTransformer`, causing an error on searching columns such as "score" (because it became "scoreNone" instead). Issue describing the error: pandas-dev/pandas#24782 ```python >>> result = df.merge(df, left_index=True, right_index=True, suffixes=(None,'_dup')) >>> result 0None 0_dup 0 1 1 >>> result.columns Index(['0None', '0_dup'], dtype='object') ``` There is also the possibility of other recent versions (current is 1.2.4), I'm using the 1.0.1 and it looks fine for now.

mroeschke added Bug Reshaping Concat, Merge/Join, Stack/Unstack, Explode labels Jan 15, 2019

charlesdong1991 mentioned this issue Jan 17, 2019

BUG: DataFrame.merge(suffixes=) does not respect None #24819

Merged

4 tasks

jreback added this to the 0.25.0 milestone Feb 4, 2019

jreback closed this as completed in #24819 Feb 6, 2019

charlesdong1991 mentioned this issue Feb 9, 2019

ENH: accept None behaviour for suffixes in DataFrame.merge #25242

Closed

albertoueda mentioned this issue May 12, 2021

Update pandas version in requirements.txt terrier-org/pyterrier#159

Merged

simonjayhawkins mentioned this issue May 16, 2022

Merge nonstring columns #46879

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG: DataFrame.merge(suffixes=) does not respect None #24782

BUG: DataFrame.merge(suffixes=) does not respect None #24782

simonjayhawkins commented Jan 15, 2019

WillAyd commented Jan 15, 2019

charlesdong1991 commented Jan 17, 2019

simonjayhawkins commented Feb 6, 2019

BUG: DataFrame.merge(suffixes=) does not respect None #24782

BUG: DataFrame.merge(suffixes=) does not respect None #24782

Comments

simonjayhawkins commented Jan 15, 2019

WillAyd commented Jan 15, 2019

charlesdong1991 commented Jan 17, 2019

simonjayhawkins commented Feb 6, 2019