BUG: in _nsorted for frame with duplicated values index #13428

Tux1 · 2016-06-12T04:11:50Z

closes BUG: in _nsorted for frame with duplicated values index #13412
tests added / passed
passes git diff upstream/master | flake8 --diff
whatsnew entry

Tux1 · 2016-06-12T08:09:59Z

what doesn't pass tests ?

sinhrks · 2016-06-12T12:22:27Z

You must fix test to pass flake8 check.

sinhrks · 2016-06-12T12:24:23Z

doc/source/whatsnew/v0.19.0.txt

@@ -81,3 +81,5 @@ Performance Improvements

 Bug Fixes
 ~~~~~~~~~
+
+- Bug in ``DataFrame._nsorted`` when data-frame has duplicated value index. (:issue:`13412`)


Describe the problem from user's point of view. Users don't care _nsorted.

codecov-io · 2016-06-12T12:50:12Z

Current coverage is 84.23%

Merging #13428 into master will increase coverage by <.01%

@@             master     #13428   diff @@
==========================================
  Files           138        138          
  Lines         50805      50810     +5   
  Methods           0          0          
  Messages          0          0          
  Branches          0          0          
==========================================
+ Hits          42796      42801     +5   
  Misses         8009       8009          
  Partials          0          0

Powered by Codecov. Last updated by 62b4327...9e47bbe

TomAugspurger · 2016-06-12T23:54:56Z

git diff upstream/master | flake8 --diff will help you track down the changes you have to make.

jreback · 2016-06-13T11:41:42Z

I think a simple soln will work here

(Pdb) p self.loc[ser.index].head(n).sort_values(columns, ascending=ascending,kind='mergesort')
   a  b
1  3  2
(Pdb) p self.loc[ser.index]
   a  b
1  3  2
1  4  1

Tux1 · 2016-06-13T12:05:45Z

~~@jreback what do you mean ? in _nsorted ? I don't think so~~

Testing. I think you're right

jreback · 2016-06-13T12:21:31Z

maybe the .head() should go at the end. I don't recall the exact guarantees of this.

Tux1 · 2016-06-13T13:01:53Z

I tested and your soln doesn't work and doesn't pass the test with this case :
df = pd.DataFrame({'a': [1, 2, 3, 4], 'b': [4, 3, 2, 1]}, index=[0, 0, 1, 1])

Any other suggestion ?

jreback · 2016-06-14T13:55:33Z

@Tux1 well then play around with it. What you are doing is WAY too complicated for a simple take.

jreback · 2016-09-09T22:40:48Z

can you rebase / update?

sinhrks added the Bug label Jun 12, 2016

sinhrks added this to the 0.18.2 milestone Jun 12, 2016

sinhrks reviewed Jun 12, 2016
View reviewed changes

Tux1 force-pushed the master branch from a77f735 to 31d058b Compare June 12, 2016 12:50

Tux1 force-pushed the master branch from 31d058b to b269d1d Compare June 12, 2016 12:52

Tux1 force-pushed the master branch from b269d1d to 9e47bbe Compare June 13, 2016 12:32

jreback removed this from the 0.18.2 milestone Jun 14, 2016

Tux1 closed this Oct 28, 2016

Tux1 force-pushed the master branch from 9e47bbe to 096d886 Compare October 28, 2016 15:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG: in _nsorted for frame with duplicated values index #13428

BUG: in _nsorted for frame with duplicated values index #13428

Tux1 commented Jun 12, 2016 •

edited

Loading

Tux1 commented Jun 12, 2016

sinhrks commented Jun 12, 2016

sinhrks Jun 12, 2016

codecov-io commented Jun 12, 2016 •

edited

Loading

TomAugspurger commented Jun 12, 2016

jreback commented Jun 13, 2016

Tux1 commented Jun 13, 2016 •

edited

Loading

jreback commented Jun 13, 2016

Tux1 commented Jun 13, 2016

jreback commented Jun 14, 2016

jreback commented Sep 9, 2016

BUG: in _nsorted for frame with duplicated values index #13428

BUG: in _nsorted for frame with duplicated values index #13428

Conversation

Tux1 commented Jun 12, 2016 • edited Loading

Tux1 commented Jun 12, 2016

sinhrks commented Jun 12, 2016

sinhrks Jun 12, 2016

Choose a reason for hiding this comment

codecov-io commented Jun 12, 2016 • edited Loading

Current coverage is 84.23%

TomAugspurger commented Jun 12, 2016

jreback commented Jun 13, 2016

Tux1 commented Jun 13, 2016 • edited Loading

jreback commented Jun 13, 2016

Tux1 commented Jun 13, 2016

jreback commented Jun 14, 2016

jreback commented Sep 9, 2016

Tux1 commented Jun 12, 2016 •

edited

Loading

codecov-io commented Jun 12, 2016 •

edited

Loading

Tux1 commented Jun 13, 2016 •

edited

Loading