BUG: DataFrame.to_html validates formatters has the correct length #28632

guipleite · 2019-09-26T12:18:52Z

closes DataFrame.to_html should validate that formatters has the correct length #28469
tests added / passed
passes black pandas
passes git diff upstream/master -u -- "*.py" | flake8 --diff
whatsnew entry

WillAyd

Looks pretty good - general comments

doc/source/whatsnew/v1.0.0.rst

WillAyd · 2019-09-27T15:50:47Z

pandas/io/formats/format.py

@@ -561,7 +561,19 @@ def __init__(
        self.sparsify = sparsify

        self.float_format = float_format
-        self.formatters = formatters if formatters is not None else {}
+        if formatters is not None and (


Can you use is_list_like from pandas._libs.lib here instead? I think should help simplify the logic

Not sure if it will simplify it because is_list_like returns True to both dictionaries and lists, however we only want it to enter in this statement if formatters is a list.

I think this can be simplified if you just precede this condition with:

if formatters is None: formatters = {}

And then go through the conditions. Right now there is a lot of duplication of conditions which makes it tougher to reason about

Made an adjustment, is it better now?

@WillAyd

"You can use a protocol class with isinstance() ..." "isinstance() also works with the predefined protocols in typing such as Iterable." https://mypy.readthedocs.io/en/latest/protocols.html#using-isinstance-with-protocols

We could therefore use isinstance checks that match the types added to the function signatures.

in this case the logic could be as simple as..

if isinstance(formatters, Sequence) and len(frame.columns) != len(formatters): msg = ( "Formatters length({flen}) should match" " DataFrame number of columns({dlen})" ).format(flen=len(formatters), dlen=len(frame.columns)) raise ValueError(msg) self.formatters = formatters if formatters is not None else {}

thoughts?

Looks good for me and it works!

we don't use this pattern in the codebase, so don't make these changes in this PR. it's a discussion point going forward.

also, although mentioned previously, formatters type is currently Union[List[Callable], Tuple[Callable, ...], Dict[Union[str, int], Callable] since the code uses isinstance with list, tuple and dict for flow control.

going forward, formatters type could be as permissive as Union[Sequence[Callable], Mapping[Union[str, int], Callable]

and then the protocol based isinstance checks would be more applicable.

pandas/tests/io/formats/test_to_html.py

WillAyd · 2019-10-01T03:57:33Z

pandas/io/formats/format.py

@@ -561,7 +561,19 @@ def __init__(
        self.sparsify = sparsify

        self.float_format = float_format
-        self.formatters = formatters if formatters is not None else {}
+        if formatters is not None and (


I think this can be simplified if you just precede this condition with:

if formatters is None: formatters = {}

And then go through the conditions. Right now there is a lot of duplication of conditions which makes it tougher to reason about

pandas/tests/io/formats/test_to_html.py

pandas/io/formats/format.py

pandas/tests/io/formats/test_to_html.py

jreback · 2019-10-05T22:43:53Z

pandas/io/formats/format.py

+                    "Formatters length({flen}) should match"
+                    + " DataFrame number of columns({dlen})"
+                ).format(flen=len(formatters), dlen=len(frame.columns))
+            )


this logic is quite complicated, can you not do

if formaters is not None and not do_len_comparision: raise.... self.formatters = formatters or {}

?

Is it better now?

pandas/io/formats/format.py

Co-Authored-By: Simon Hawkins <[email protected]> general correct

simonjayhawkins

lgtm. @WillAyd @jreback

WillAyd · 2019-10-07T15:01:11Z

Thanks @guipleite !

…andas-dev#28632)

WillAyd added the IO HTML read_html, to_html, Styler.apply, Styler.applymap label Sep 26, 2019

WillAyd requested changes Sep 27, 2019

View reviewed changes

WillAyd requested changes Oct 1, 2019

View reviewed changes

simonjayhawkins reviewed Oct 5, 2019

View reviewed changes

pandas/tests/io/formats/test_to_html.py Outdated Show resolved Hide resolved

simonjayhawkins reviewed Oct 5, 2019

View reviewed changes

pandas/io/formats/format.py Outdated Show resolved Hide resolved

simonjayhawkins reviewed Oct 5, 2019

View reviewed changes

pandas/tests/io/formats/test_to_html.py Outdated Show resolved Hide resolved

jreback requested changes Oct 5, 2019

View reviewed changes

simonjayhawkins reviewed Oct 7, 2019

View reviewed changes

pandas/io/formats/format.py Outdated Show resolved Hide resolved

gabriellm1 added 2 commits October 7, 2019 10:48

Co-Authored-By: William Ayd <[email protected]>

14fb527

Co-Authored-By: Simon Hawkins <[email protected]> general correct

black pandas formatting

eb144eb

TomAugspurger approved these changes Oct 7, 2019

View reviewed changes

simonjayhawkins approved these changes Oct 7, 2019

View reviewed changes

WillAyd added this to the 1.0 milestone Oct 7, 2019

WillAyd approved these changes Oct 7, 2019

View reviewed changes

WillAyd merged commit 9db7f3e into pandas-dev:master Oct 7, 2019

gabriellm1 deleted the issue-28469 branch October 19, 2019 01:22

proost pushed a commit to proost/pandas that referenced this pull request Dec 19, 2019

BUG: DataFrame.to_html validates formatters has the correct length (p…

8fd931f

…andas-dev#28632)

proost pushed a commit to proost/pandas that referenced this pull request Dec 19, 2019

BUG: DataFrame.to_html validates formatters has the correct length (p…

3a36b15

…andas-dev#28632)

bongolegend pushed a commit to bongolegend/pandas that referenced this pull request Jan 1, 2020

BUG: DataFrame.to_html validates formatters has the correct length (p…

9ceecff

…andas-dev#28632)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG: DataFrame.to_html validates formatters has the correct length #28632

BUG: DataFrame.to_html validates formatters has the correct length #28632

guipleite commented Sep 26, 2019 •

edited

Loading

WillAyd left a comment

WillAyd Sep 27, 2019

guipleite Sep 29, 2019

WillAyd Oct 1, 2019

gabriellm1 Oct 6, 2019

simonjayhawkins Oct 7, 2019

gabriellm1 Oct 7, 2019

simonjayhawkins Oct 7, 2019

simonjayhawkins Oct 7, 2019

WillAyd Oct 1, 2019

jreback Oct 5, 2019

gabriellm1 Oct 6, 2019

simonjayhawkins left a comment

WillAyd commented Oct 7, 2019

BUG: DataFrame.to_html validates formatters has the correct length #28632

BUG: DataFrame.to_html validates formatters has the correct length #28632

Conversation

guipleite commented Sep 26, 2019 • edited Loading

WillAyd left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

simonjayhawkins left a comment

Choose a reason for hiding this comment

WillAyd commented Oct 7, 2019

guipleite commented Sep 26, 2019 •

edited

Loading