Skip to content

Conversation

@charlesdong1991
Copy link
Contributor

@charlesdong1991 charlesdong1991 commented Nov 5, 2019

@HyukjinKwon
Copy link
Member

Nice! Thanks for working on this!

@charlesdong1991 charlesdong1991 marked this pull request as ready for review November 11, 2019 20:46
@charlesdong1991
Copy link
Contributor Author

haven't come up with a nice and robust solution for allowing ingore_index to be True, any suggestions? @ueshin

@codecov-io
Copy link

codecov-io commented Nov 12, 2019

Codecov Report

Merging #1009 into master will decrease coverage by 0.1%.
The diff coverage is 96.15%.

Impacted file tree graph

@@            Coverage Diff            @@
##           master   #1009      +/-   ##
=========================================
- Coverage    95.2%   95.1%   -0.11%     
=========================================
  Files          34      34              
  Lines        6883    6779     -104     
=========================================
- Hits         6553    6447     -106     
- Misses        330     332       +2
Impacted Files Coverage Δ
databricks/koalas/namespace.py 87.79% <96.15%> (+0.44%) ⬆️
databricks/koalas/indexing.py 94.38% <0%> (-0.51%) ⬇️
databricks/koalas/base.py 94.9% <0%> (-0.42%) ⬇️
databricks/koalas/frame.py 96.49% <0%> (-0.26%) ⬇️
databricks/koalas/series.py 96.41% <0%> (-0.13%) ⬇️
databricks/koalas/utils.py 98.12% <0%> (-0.05%) ⬇️
databricks/koalas/missing/frame.py 100% <0%> (ø) ⬆️
databricks/koalas/missing/indexes.py 100% <0%> (ø) ⬆️
databricks/koalas/missing/window.py 100% <0%> (ø) ⬆️
... and 4 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update e7170cb...f672805. Read the comment docs.

new_objs.append(obj)
objs = new_objs

if axis == 1:
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

For ignore_index, I think we can explicitly throw an exception for now when axis is 1.

if axis in [0, 'index']:
axis = 0
elif axis in [1, 'columns']:
axis = 1
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Seems conflict was by 6ece69d

sdfs = internal._sdf
continue
col = '{}'.format(index_col)
sdfs = sdfs.join(obj._internal._sdf,
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@charlesdong1991, can we do this via each column assignment? It can leverage existing logics for compute.ops_on_diff_frames (see config.py)

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Very sorry to wait such long to get back to this PR @HyukjinKwon

I am not sure if i fully understand here, if via each column assignment, doesn't it mean the ops become more expensive? I made a slight change, feel free to review and will appreciate your feedback and suggestions! I got through tough time already, so will have time to make code changes much quicker.

@charlesdong1991
Copy link
Contributor Author

Many thanks for your comments @HyukjinKwon

I am very sorry that I have been very busy recently and don't have time for my existing PRs. I will try my best to update the PR following your comments this weekend!

Thanks again for your understanding and patience!

@HyukjinKwon
Copy link
Member

Hey, @charlesdong1991 no worries at all. Thanks for trying this change out.

@softagram-bot
Copy link

Softagram Impact Report for pull/1009 (head commit: f672805)

⭐ Change Overview

Showing the changed files, dependency changes and the impact - click for full size
(Open in Softagram Desktop for full details)

📄 Full report

Impact Report explained. Give feedback on this report to [email protected]

HyukjinKwon added a commit that referenced this pull request Mar 20, 2020
This PR proposes to add `axis` at `ks.concat(...)`.

```python
import databricks.koalas as ks

df1 = ks.DataFrame([['a', 1], ['b', 2]], columns=['letter', 'number'])
df2 = ks.DataFrame([['bird', 'polly'], ['monkey', 'george']], columns=['animal', 'name'])
ks.concat([df1, df2], axis=1)
```

```
  letter  number  animal    name
0      a       1    bird   polly
1      b       2  monkey  george
```

Resolves #625, Closes #1009.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants