Implement Series.combine_first #1290

itholic · 2020-02-18T02:10:42Z

basic example

>>> s1 = ks.Series([1, np.nan])
>>> s2 = ks.Series([3, 4])
>>> s1.combine_first(s2)
0    1.0
1    4.0
Name: 0, dtype: float64

MultiIndex

>>> midx1 = pd.MultiIndex([['lama', 'cow', 'falcon', 'koala'],
...                        ['speed', 'weight', 'length', 'power']],
...                       [[0, 3, 1, 1, 1, 2, 2, 2],
...                        [0, 2, 0, 3, 2, 0, 1, 3]])
>>> midx2 = pd.MultiIndex([['lama', 'cow', 'falcon'],
...                        ['speed', 'weight', 'length']],
...                       [[0, 0, 0, 1, 1, 1, 2, 2, 2],
...                        [0, 1, 2, 0, 1, 2, 0, 1, 2]])
>>> kser1 = ks.Series([45, 200, 1.2, 30, 250, 1.5, 320, 1], index=midx1)
>>> kser2 = ks.Series([-45, 200, -1.2, 30, -250, 1.5, 320, 1, -0.3], index=midx2)
>>> kser1
lama    speed      45.0
koala   length    200.0
cow     speed       1.2
        power      30.0
        length    250.0
falcon  speed       1.5
        weight    320.0
        power       1.0
Name: 0, dtype: float64
>>> kser2
lama    speed     -45.0
        weight    200.0
        length     -1.2
cow     speed      30.0
        weight   -250.0
        length      1.5
falcon  speed     320.0
        weight      1.0
        length     -0.3
Name: 0, dtype: float64

>>> kser1.combine_first(kser2)
cow     length    250.0
        power      30.0
        speed       1.2
        weight   -250.0
falcon  length     -0.3
        power       1.0
        speed       1.5
        weight    320.0
koala   length    200.0
lama    length     -1.2
        speed      45.0
        weight    200.0
Name: 0, dtype: float64

itholic · 2020-02-18T02:18:01Z

i considered putting Series.combine and Series.combine_first in single PR,

but their implementation concept was way different than i thought, so i separated them.

codecov-io · 2020-03-01T03:57:26Z

Codecov Report

Merging #1290 into master will increase coverage by 0.01%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##           master    #1290      +/-   ##
==========================================
+ Coverage   95.25%   95.26%   +0.01%     
==========================================
  Files          34       34              
  Lines        7541     7559      +18     
==========================================
+ Hits         7183     7201      +18     
  Misses        358      358

Impacted Files	Coverage Δ
databricks/koalas/missing/series.py	`100.00% <ø> (ø)`
databricks/koalas/series.py	`96.86% <100.00%> (+0.07%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 45c325b...c4fb5d0. Read the comment docs.

databricks/koalas/series.py

HyukjinKwon · 2020-03-16T03:51:39Z

I think you should rebase and sync to the current master, @itholic .

databricks/koalas/series.py

HyukjinKwon · 2020-03-16T04:59:24Z

Looks fine otherwise.

itholic added 2 commits February 18, 2020 11:03

Implement Series.combine_first

b84e2c9

Add failure test

06ed257

Resolve conflicts

b10cdbc

itholic added 2 commits March 1, 2020 16:28

Applying Black

910a8d2

Resolve conflicts

6ed35ae

HyukjinKwon reviewed Mar 5, 2020

View reviewed changes

databricks/koalas/series.py Show resolved Hide resolved

HyukjinKwon reviewed Mar 5, 2020

View reviewed changes

databricks/koalas/series.py Outdated Show resolved Hide resolved

itholic added 5 commits March 5, 2020 21:44

Adding case when Series come from same DataFrame

baa87e8

Resolve conflicts

bfaf51b

Move test from ops_on_diff to series

53cb168

Empty commit for build test

f134976

Resolve conflicts

4dd3b83

HyukjinKwon reviewed Mar 9, 2020

View reviewed changes

databricks/koalas/series.py Show resolved Hide resolved

HyukjinKwon reviewed Mar 9, 2020

View reviewed changes

databricks/koalas/series.py Outdated Show resolved Hide resolved

HyukjinKwon reviewed Mar 9, 2020

View reviewed changes

databricks/koalas/series.py Show resolved Hide resolved

itholic added 2 commits March 16, 2020 10:59

Resolve conflicts

338f691

scols -> spark_columns

3551c67

Rebase to Master

d156f36

HyukjinKwon reviewed Mar 16, 2020

View reviewed changes

databricks/koalas/series.py Show resolved Hide resolved

HyukjinKwon reviewed Mar 16, 2020

View reviewed changes

databricks/koalas/series.py Outdated Show resolved Hide resolved

fix all comments

c4fb5d0

HyukjinKwon approved these changes Mar 25, 2020

View reviewed changes

HyukjinKwon merged commit d4012b6 into databricks:master Mar 25, 2020

itholic deleted the s_combine_first branch March 25, 2020 13:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implement Series.combine_first #1290

Implement Series.combine_first #1290

Uh oh!

itholic commented Feb 18, 2020

Uh oh!

itholic commented Feb 18, 2020

Uh oh!

codecov-io commented Mar 1, 2020 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

HyukjinKwon commented Mar 16, 2020

Uh oh!

Uh oh!

Uh oh!

HyukjinKwon commented Mar 16, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Implement Series.combine_first #1290

Implement Series.combine_first #1290

Uh oh!

Conversation

itholic commented Feb 18, 2020

Uh oh!

itholic commented Feb 18, 2020

Uh oh!

codecov-io commented Mar 1, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

HyukjinKwon commented Mar 16, 2020

Uh oh!

Uh oh!

Uh oh!

HyukjinKwon commented Mar 16, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codecov-io commented Mar 1, 2020 •

edited

Loading