Fix DataFrame.to_string() justification (2) #22505

gshiba · 2018-08-25T14:24:20Z

closes to_string formatters not as expected when header=False #16839,
closes Justification is broken with to_string(index=False) #13032
tests added / passed
passes git diff upstream/master -u -- "*.py" | flake8 --diff
whatsnew entry

'Competes' with #22437 which attempts to revert % d to %d as suggested here: #13032 (comment) That turned out to affect a lot of tests, which in hindsight is expected; the % d has been around since at least 2012 (106fe99).

Instead, this PR reverts parts of #11942 and embraces the leading space even when index=False. df.to_string(index=False) will print the leading space when the first column is positive only, as well as preserve leading/trailing spaces on first/last lines.

With the following code:

import pandas as pd
def wrap_to_string(df, **kwargs):
    s = df.to_string(**kwargs)
    print(str(kwargs).center(25, '-'))
    for i, line in enumerate(s.split('\n')):
        print(f'^{line}$-{i}')
    print()
df = pd.DataFrame({'w': [1, 2], 'x': [3, -4], 'y': [555, 666],
                   'z': [777, -888], 'a': ['AAA', '   ']})
cols_ = list(map(list, ['wxyza', 'xyzaw', 'yzawx', 'zawxy', 'awxyz']))
for cols in cols_:
    wrap_to_string(df[cols], index=False)

Output with master:

-----{'index': False}----  # last cell (three spaces) disappeared
^w  x    y    z    a$-0
^1  3  555  777  AAA$-1
^2 -4  666 -888$-2

-----{'index': False}----  # misaligned
^x    y    z    a  w$-0
^3  555  777  AAA  1$-1
^-4  666 -888       2$-2

-----{'index': False}----  # misaligned
^y    z    a  w  x$-0
^555  777  AAA  1  3$-1
^666 -888       2 -4$-2

-----{'index': False}----  # misaligned
^z    a  w  x    y$-0
^777  AAA  1  3  555$-1
^-888       2 -4  666$-2

-----{'index': False}----  # misaligned
^a  w  x    y    z$-0
^AAA  1  3  555  777$-1
^     2 -4  666 -888$-2

Output with this PR:

-----{'index': False}----
^ w  x    y    z    a$-0
^ 1  3  555  777  AAA$-1
^ 2 -4  666 -888     $-2

-----{'index': False}----
^ x    y    z    a  w$-0
^ 3  555  777  AAA  1$-1
^-4  666 -888       2$-2

-----{'index': False}----
^   y    z    a  w  x$-0
^ 555  777  AAA  1  3$-1
^ 666 -888       2 -4$-2

-----{'index': False}----
^   z    a  w  x    y$-0
^ 777  AAA  1  3  555$-1
^-888       2 -4  666$-2

-----{'index': False}----
^   a  w  x    y    z$-0
^ AAA  1  3  555  777$-1
^      2 -4  666 -888$-2

Similar effect on Series as well.

gfyoung · 2018-08-25T18:09:23Z

pandas/tests/io/formats/test_format.py

+
+        for df, expected in zip(dfs, exs):
+            df_s = df.to_string(index=False)
+            assert df_s == expected


Definitely should use pytest.mark.parametrize for this.

I updated the code style to match other tests in the same file.

gfyoung · 2018-08-25T18:09:36Z

cc @datapythonista

codecov · 2018-08-25T21:54:12Z

Codecov Report

Merging #22505 into master will decrease coverage by 0.15%.
The diff coverage is 100%.

@@            Coverage Diff             @@
##           master   #22505      +/-   ##
==========================================
- Coverage   92.18%   92.03%   -0.16%     
==========================================
  Files         169      169              
  Lines       50820    50778      -42     
==========================================
- Hits        46850    46735     -115     
- Misses       3970     4043      +73

Flag	Coverage Δ
#multiple	`90.44% <100%> (-0.16%)`	⬇️
#single	`42.22% <0%> (-0.16%)`	⬇️

Impacted Files	Coverage Δ
pandas/io/formats/format.py	`98.35% <100%> (-0.01%)`	⬇️
pandas/io/formats/console.py	`65.15% <0%> (-10.61%)`	⬇️
pandas/errors/__init__.py	`92.3% <0%> (-7.7%)`	⬇️
pandas/core/dtypes/base.py	`92.68% <0%> (-7.32%)`	⬇️
pandas/core/arrays/base.py	`88% <0%> (-6.25%)`	⬇️
pandas/io/html.py	`89.17% <0%> (-2.08%)`	⬇️
pandas/io/parquet.py	`71.79% <0%> (-1.94%)`	⬇️
pandas/io/formats/html.py	`88.81% <0%> (-1.87%)`	⬇️
pandas/core/apply.py	`96.75% <0%> (-1.86%)`	⬇️
pandas/core/arrays/datetimelike.py	`94.02% <0%> (-1.52%)`	⬇️
... and 51 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 64b88e8...cc86cd7. Read the comment docs.

datapythonista · 2018-09-23T13:28:10Z

lgtm, but I think the changelog needs to be moved to 0.24.0.

@jreback can you take a look and see if you're happy with this?

jreback

change looks ok

jreback · 2018-09-23T13:29:58Z

doc/source/whatsnew/v0.23.5.txt

@@ -49,3 +49,5 @@ Bug Fixes
 **I/O**

 - Bug in :func:`read_csv` that caused it to raise ``OverflowError`` when trying to use 'inf' as ``na_value`` with integer index column (:issue:`17128`)
+- Bug in :func:`to_string(index=False)` that broke column alignment (:issue:`16839`, :issue:`13032`)


move to 0.24.0

can you make this more explicit, e.g. say what cases it is fixing.

jreback · 2018-09-23T13:31:36Z

pandas/tests/io/formats/test_format.py

        assert df_s == expected

    def test_to_string_line_width_no_index(self):
        df = DataFrame({'x': [1, 2, 3], 'y': [4, 5, 6]})

        df_s = df.to_string(line_width=1, index=False)
-        expected = "x  \\\n1   \n2   \n3   \n\ny  \n4  \n5  \n6"


can you add a comment where the issues that are closed

pep8speaks · 2018-09-24T04:32:43Z

Hello @gshiba! Thanks for updating the PR.

There are no PEP8 issues in the file pandas/io/formats/format.py !
There are no PEP8 issues in the file pandas/tests/io/formats/test_format.py !

datapythonista

lgtm, thanks for the fix @gshiba

jreback · 2018-09-25T12:55:08Z

thanks @gshiba

gshiba mentioned this pull request Aug 25, 2018

Fix DataFrame.to_string() justification #22437

Closed

4 tasks

gfyoung added Bug IO Data IO issues that don't fit into a more specific label Output-Formatting __repr__ of pandas objects, to_string labels Aug 25, 2018

gfyoung reviewed Aug 25, 2018

View reviewed changes

jreback added this to the 0.24.0 milestone Sep 23, 2018

jreback requested changes Sep 23, 2018

View reviewed changes

gshiba added 7 commits September 23, 2018 21:33

Fix justifcation; update tests

abee072

Update/add tests

e90320e

Fix Series too

5b4a89c

Match style to other tests in this module

c539f24

Add to What's new

cc1e14c

Fix typo

8e96e64

Complete change requests

cc86cd7

gshiba force-pushed the fix-to-string2 branch from d4ac415 to cc86cd7 Compare September 24, 2018 04:36

datapythonista approved these changes Sep 24, 2018

View reviewed changes

jreback approved these changes Sep 25, 2018

View reviewed changes

jreback merged commit 30b942a into pandas-dev:master Sep 25, 2018

Sup3rGeo pushed a commit to Sup3rGeo/pandas that referenced this pull request Oct 1, 2018

Fix DataFrame.to_string() justification (2) (pandas-dev#22505)

6d3ea39

TomAugspurger mentioned this pull request Jan 29, 2019

BUG: on .to_string(index=False) #25000

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix DataFrame.to_string() justification (2) #22505

Fix DataFrame.to_string() justification (2) #22505

gshiba commented Aug 25, 2018 •

edited by jreback

Loading

gfyoung Aug 25, 2018

gshiba Aug 25, 2018

gfyoung commented Aug 25, 2018

codecov bot commented Aug 25, 2018 •

edited

Loading

datapythonista commented Sep 23, 2018

jreback left a comment

jreback Sep 23, 2018

jreback Sep 23, 2018

pep8speaks commented Sep 24, 2018

datapythonista left a comment

jreback commented Sep 25, 2018

Fix DataFrame.to_string() justification (2) #22505

Fix DataFrame.to_string() justification (2) #22505

Conversation

gshiba commented Aug 25, 2018 • edited by jreback Loading

gfyoung Aug 25, 2018

Choose a reason for hiding this comment

gshiba Aug 25, 2018

Choose a reason for hiding this comment

gfyoung commented Aug 25, 2018

codecov bot commented Aug 25, 2018 • edited Loading

Codecov Report

datapythonista commented Sep 23, 2018

jreback left a comment

Choose a reason for hiding this comment

jreback Sep 23, 2018

Choose a reason for hiding this comment

jreback Sep 23, 2018

Choose a reason for hiding this comment

pep8speaks commented Sep 24, 2018

datapythonista left a comment

Choose a reason for hiding this comment

jreback commented Sep 25, 2018

gshiba commented Aug 25, 2018 •

edited by jreback

Loading

codecov bot commented Aug 25, 2018 •

edited

Loading