Implement sort_values for Index/MultiIndex #1120

itholic · 2019-12-12T03:21:27Z

Implement sort_values for Index/MultiIndex
(https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.Index.sort_values.html#pandas.Index.sort_values)

>>> idx = ks.Index([10, 100, 1, 1000])
>>> idx
Int64Index([10, 100, 1, 1000], dtype='int64')

>>> idx.sort_values()
Int64Index([1, 10, 100, 1000], dtype='int64')

>>> idx.sort_values(ascending=False)
Int64Index([1000, 100, 10, 1], dtype='int64')

Support for MultiIndex.

>>> kidx = ks.MultiIndex.from_tuples([('a', 'x', 1), ('c', 'y', 2), ('b', 'z', 3)])
>>> kidx
MultiIndex([('a', 'x', 1),
            ('c', 'y', 2),
            ('b', 'z', 3)],
           )

>>> kidx.sort_values()
MultiIndex([('a', 'x', 1),
            ('b', 'z', 3),
            ('c', 'y', 2)],
           )

>>> kidx.sort_values(ascending=False)
MultiIndex([('c', 'y', 2),
            ('b', 'z', 3),
            ('a', 'x', 1)],
           )

databricks/koalas/indexes.py

codecov-io · 2019-12-12T03:56:39Z

Codecov Report

Merging #1120 into master will increase coverage by <.01%.
The diff coverage is 100%.

@@            Coverage Diff             @@
##           master    #1120      +/-   ##
==========================================
+ Coverage   95.19%   95.19%   +<.01%     
==========================================
  Files          35       35              
  Lines        7071     7075       +4     
==========================================
+ Hits         6731     6735       +4     
  Misses        340      340

Impacted Files	Coverage Δ
databricks/koalas/missing/indexes.py	`100% <ø> (ø)`	⬆️
databricks/koalas/indexes.py	`96.45% <100%> (+0.07%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 468bf3a...a8dda7d. Read the comment docs.

ueshin · 2019-12-12T21:03:29Z

databricks/koalas/indexes.py

+        if isinstance(self, MultiIndex):
+            result.names = self.names
+        else:
+            result.name = self.name


Do we need this?

without this, i think we can't keep names when below case.

>>> kidx = ks.MultiIndex.from_tuples([('a', 'x', 1), ('c', 'y', 2), ('b', 'z', 3)]) >>> kidx.names = ['A', 'B', 'C'] >>> kidx.sort_values() MultiIndex([('a', 'x', 1), ('b', 'z', 3), ('c', 'y', 2)], )

ah, I see. @names.setter seems wrong.

ah, i got it.

in @names.setter, the new index_map is overwritten to self._kdf._internal, not to self._internal.

like below

self._kdf._internal = internal.copy(index_map=list(zip(internal.index_columns, names)))

at this point, i curious why we overwrite self._kdf._internal rather than simply self._internal?

For now, i've fixed it to the current implementation

HyukjinKwon

Looks fine but let me leave it to @ueshin

softagram-bot · 2019-12-13T08:39:03Z

Softagram Impact Report for pull/1120 (head commit: `8ae7722`)

⚠️ Copy paste found

ℹ️ test_indexes.py: Copy paste fragment on line 30 shared with ../test_dataframe.py, ../test_numpy_compat.py:


    @property
    def pdf(self):
        return pd.DataFrame({
            'a': [1, 2, 3, 4, 5, 6, 7, 8, 9],
            'b': [4, 5, 6, 3, 2, 1, ...(truncated 160 chars)

ℹ️ indexes.py: Copy paste fragment inside the same file on lines 720, 1163:

            raise NotImplementedError(
                \"Doesn't support symmetric_difference between Index & MultiIndex for now\")

        sdf_self = self._kdf._s...(truncated 477 chars)

Now that you are on the file, it would be easier to pay back some tech. debt.

⭐ Change Overview

(Open in Softagram Desktop for full details)

📄 Full report

Permalink: Full report for pull/1120

Impact Report explained. Give feedback on this report to [email protected]

HyukjinKwon · 2019-12-19T00:27:00Z

@itholic can you resolve conflicts?

itholic · 2019-12-19T00:31:00Z

@HyukjinKwon resolved :)

databricks/koalas/missing/indexes.py

ueshin

LGTM.

Implement sort_values for Index/MultiIndex

4f649e6

itholic commented Dec 12, 2019

View reviewed changes

databricks/koalas/indexes.py Show resolved Hide resolved

itholic mentioned this pull request Dec 12, 2019

Raise TypeError for Index/MultiIndex.sort() #1115

Merged

ueshin reviewed Dec 12, 2019

View reviewed changes

itholic added 2 commits December 13, 2019 12:53

resolve conflicts

25aca52

fix

9d75621

itholic mentioned this pull request Dec 13, 2019

Implement Index.drop_duplicates #1121

Merged

HyukjinKwon approved these changes Dec 13, 2019

View reviewed changes

resolve conflicts

8ae7722

resolve conflicts

ef14c37

ueshin reviewed Dec 19, 2019

View reviewed changes

databricks/koalas/missing/indexes.py Outdated Show resolved Hide resolved

databricks/koalas/missing/indexes.py Outdated Show resolved Hide resolved

itholic added 2 commits December 19, 2019 10:51

fix

7e717cc

add doc multiindex

a8dda7d

ueshin approved these changes Dec 19, 2019

View reviewed changes

HyukjinKwon merged commit c03b3a6 into databricks:master Dec 19, 2019

itholic deleted the i_sort_values branch December 20, 2019 04:17

ueshin mentioned this pull request Jan 22, 2020

Update index_names of _InternalFrame when Index name changed #1210

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implement sort_values for Index/MultiIndex #1120

Implement sort_values for Index/MultiIndex #1120

Uh oh!

itholic commented Dec 12, 2019

Uh oh!

Uh oh!

codecov-io commented Dec 12, 2019 •

edited

Loading

Uh oh!

ueshin Dec 12, 2019

Uh oh!

itholic Dec 13, 2019

Uh oh!

ueshin Dec 13, 2019

Uh oh!

itholic Dec 13, 2019 •

edited

Loading

Uh oh!

HyukjinKwon left a comment

Uh oh!

softagram-bot commented Dec 13, 2019

Uh oh!

HyukjinKwon commented Dec 19, 2019

Uh oh!

itholic commented Dec 19, 2019

Uh oh!

Uh oh!

Uh oh!

ueshin left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Implement sort_values for Index/MultiIndex #1120

Implement sort_values for Index/MultiIndex #1120

Uh oh!

Conversation

itholic commented Dec 12, 2019

Uh oh!

Uh oh!

codecov-io commented Dec 12, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

ueshin Dec 12, 2019

Choose a reason for hiding this comment

Uh oh!

itholic Dec 13, 2019

Choose a reason for hiding this comment

Uh oh!

ueshin Dec 13, 2019

Choose a reason for hiding this comment

Uh oh!

itholic Dec 13, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

HyukjinKwon left a comment

Choose a reason for hiding this comment

Uh oh!

softagram-bot commented Dec 13, 2019

Softagram Impact Report for pull/1120 (head commit: 8ae7722)

⚠️ Copy paste found

⭐ Change Overview

📄 Full report

Uh oh!

HyukjinKwon commented Dec 19, 2019

Uh oh!

itholic commented Dec 19, 2019

Uh oh!

Uh oh!

Uh oh!

ueshin left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

codecov-io commented Dec 12, 2019 •

edited

Loading

itholic Dec 13, 2019 •

edited

Loading

Softagram Impact Report for pull/1120 (head commit: `8ae7722`)