Implement Index.drop_duplicates #1121

RainFung · 2019-12-12T03:22:30Z

Implement Index.drop_duplicates by using spark drop_duplicates API without keep parameter

databricks/koalas/indexes.py

codecov-io · 2019-12-12T03:55:21Z

Codecov Report

Merging #1121 into master will increase coverage by <.01%.
The diff coverage is 100%.

@@            Coverage Diff             @@
##           master    #1121      +/-   ##
==========================================
+ Coverage   95.17%   95.17%   +<.01%     
==========================================
  Files          35       35              
  Lines        7048     7051       +3     
==========================================
+ Hits         6708     6711       +3     
  Misses        340      340

Impacted Files	Coverage Δ
databricks/koalas/missing/indexes.py	`100% <ø> (ø)`	⬆️
databricks/koalas/indexes.py	`96.64% <100%> (+0.06%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update d435072...e87f3a6. Read the comment docs.

databricks/koalas/indexes.py

HyukjinKwon

Looks fine except https://github.com/databricks/koalas/pull/1121/files#r357498274

HyukjinKwon · 2019-12-16T12:40:10Z

docs/source/reference/indexing.rst

   Index.is_interval
   Index.is_numeric
   Index.is_object
+   Index.drop_duplicates


Sorry, can you add drop_duplicates at MultiIndex too?

MultiIndex.drop_duplicates has been deprecated in pandas 0.26 doc.https://dev.pandas.io/docs/reference/indexing.html

softagram-bot · 2019-12-17T08:38:01Z

Softagram Impact Report for pull/1121 (head commit: `e87f3a6`)

⚠️ Copy paste found

ℹ️ indexes.py: Copy paste fragment inside the same file on lines 757, 1136:

            raise NotImplementedError(
                \"Doesn't support symmetric_difference between Index & MultiIndex for now\")

        sdf_self = self._kdf._s...(truncated 477 chars)

ℹ️ test_indexes.py: Copy paste fragment on line 30 shared with ../test_dataframe.py:


    @property
    def pdf(self):
        return pd.DataFrame({
            'a': [1, 2, 3, 4, 5, 6, 7, 8, 9],
            'b': [4, 5, 6, 3, 2, 1, ...(truncated 160 chars)

ℹ️ test_indexes.py: Copy paste fragment on line 32 shared with ../test_dataframe.py, ../test_numpy_compat.py:

    def pdf(self):
        return pd.DataFrame({
            'a': [1, 2, 3, 4, 5, 6, 7, 8, 9],
            'b': [4, 5, 6, 3, 2, 1, 0, 0, 0],
        }, index=[0, 1, 3, 5, 6, 8, 9, 9, 9])...(truncated 105 chars)

Now that you are on the file, it would be easier to pay back some tech. debt.

⭐ Change Overview

(Open in Softagram Desktop for full details)

📄 Full report

Permalink: Full report for pull/1121

Impact Report explained. Give feedback on this report to [email protected]

fengyurun added 2 commits December 12, 2019 11:20

first commit

7a6040d

fix parameter docs

c488d59

itholic reviewed Dec 12, 2019

View reviewed changes

databricks/koalas/indexes.py Outdated Show resolved Hide resolved

itholic reviewed Dec 12, 2019

View reviewed changes

databricks/koalas/indexes.py Outdated Show resolved Hide resolved

fix pd to ks and keep index name

6c66bc6

ueshin reviewed Dec 12, 2019

View reviewed changes

databricks/koalas/indexes.py Outdated Show resolved Hide resolved

databricks/koalas/indexes.py Outdated Show resolved Hide resolved

remove unnecessary code

f47a5b4

itholic reviewed Dec 13, 2019

View reviewed changes

databricks/koalas/indexes.py Outdated Show resolved Hide resolved

HyukjinKwon approved these changes Dec 13, 2019

View reviewed changes

fengyurun and others added 2 commits December 14, 2019 11:41

fix self index map

10fe03a

Merge branch 'master' into Index.drop_duplicates

7339d7b

HyukjinKwon reviewed Dec 16, 2019

View reviewed changes

Merge branch 'master' into Index.drop_duplicates

e87f3a6

HyukjinKwon merged commit a44e734 into databricks:master Dec 17, 2019

HyukjinKwon approved these changes Dec 17, 2019

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implement Index.drop_duplicates #1121

Implement Index.drop_duplicates #1121

Uh oh!

RainFung commented Dec 12, 2019

Uh oh!

Uh oh!

Uh oh!

codecov-io commented Dec 12, 2019 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

HyukjinKwon left a comment

Uh oh!

HyukjinKwon Dec 16, 2019

Uh oh!

RainFung Dec 17, 2019 •

edited

Loading

Uh oh!

HyukjinKwon Dec 17, 2019

Uh oh!

softagram-bot commented Dec 17, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Implement Index.drop_duplicates #1121

Implement Index.drop_duplicates #1121

Uh oh!

Conversation

RainFung commented Dec 12, 2019

Uh oh!

Uh oh!

Uh oh!

codecov-io commented Dec 12, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Uh oh!

Uh oh!

HyukjinKwon left a comment

Choose a reason for hiding this comment

Uh oh!

HyukjinKwon Dec 16, 2019

Choose a reason for hiding this comment

Uh oh!

RainFung Dec 17, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

HyukjinKwon Dec 17, 2019

Choose a reason for hiding this comment

Uh oh!

softagram-bot commented Dec 17, 2019

Softagram Impact Report for pull/1121 (head commit: e87f3a6)

⚠️ Copy paste found

⭐ Change Overview

📄 Full report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

codecov-io commented Dec 12, 2019 •

edited

Loading

RainFung Dec 17, 2019 •

edited

Loading

Softagram Impact Report for pull/1121 (head commit: `e87f3a6`)