Skip to content

Conversation

@HyukjinKwon
Copy link
Member

This PR proposes DataFrame.info.

>>> import databricks.koalas as ks
>>> ks.range(100).info()
<class 'databricks.koalas.frame.DataFrame'>
Index: 100 entries, 0 to 99
Data columns (total 1 columns):
id    100 non-null int64

Resolves #872

@HyukjinKwon HyukjinKwon requested a review from ueshin December 12, 2019 10:45
@HyukjinKwon HyukjinKwon force-pushed the df-info branch 3 times, most recently from d2a2eb0 to d7f3821 Compare December 12, 2019 12:06
... _ = f.write(s)
"""
# To avoid pandas' existing config affects Koalas.
# TODO: should we have corresponding Koalas configs?
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah, we should in a separate PR.

Copy link
Collaborator

@ueshin ueshin left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, pending tests.

@codecov-io
Copy link

codecov-io commented Dec 13, 2019

Codecov Report

Merging #1124 into master will decrease coverage by <.01%.
The diff coverage is 93.33%.

Impacted file tree graph

@@            Coverage Diff            @@
##           master   #1124      +/-   ##
=========================================
- Coverage   95.11%   95.1%   -0.01%     
=========================================
  Files          35      35              
  Lines        7019    7033      +14     
=========================================
+ Hits         6676    6689      +13     
- Misses        343     344       +1
Impacted Files Coverage Δ
databricks/koalas/missing/frame.py 100% <ø> (ø) ⬆️
databricks/koalas/frame.py 96.82% <100%> (+0.01%) ⬆️
databricks/koalas/indexes.py 96.24% <88.88%> (-0.26%) ⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 0af6c1c...89f5b2d. Read the comment docs.

@softagram-bot
Copy link

Softagram Impact Report for pull/1124 (head commit: 89f5b2d)

⚠️ Copy paste found

ℹ️ frame.py: Copy paste fragment on line 5771 shared with ../namespace.py:

              on: Union[str, List[str], Tuple[str, ...], List[Tuple[str, ...]]] = None,
              left_on: Union[str, List[str], Tuple[s...(truncated 273 chars)

ℹ️ frame.py: Copy paste fragment inside the same file on lines 7269, 7352:


        # TODO: there is a similar logic to transpose in, for instance,
        #  DataFrame.any, Series.quantile. Maybe ...(truncated 1065 chars)

ℹ️ frame.py: Copy paste fragment inside the same file on lines 4900, 4921:

            sdf = self._sdf.select(
                self._internal.index_scols +
                [self._internal.scol_for(idx...(truncated 466 chars)

ℹ️ indexes.py: Copy paste fragment inside the same file on lines 720, 1091:

            raise NotImplementedError(
                \"Doesn't support symmetric_difference between Index & MultiIndex for now\")

        sdf_self = self._kdf._s...(truncated 477 chars)

Now that you are on the file, it would be easier to pay back some tech. debt.

⭐ Change Overview

Showing the changed files, dependency changes and the impact - click for full size
(Open in Softagram Desktop for full details)

📄 Full report

Impact Report explained. Give feedback on this report to [email protected]

@HyukjinKwon HyukjinKwon merged commit 894f813 into databricks:master Dec 13, 2019
@HyukjinKwon HyukjinKwon deleted the df-info branch September 11, 2020 07:52
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

DataFrame.info

4 participants