REF: test_to_latex #36528

ivanovmg · 2020-09-21T17:19:02Z

Refactor/clean-up test_to_latex.py.

Split big test functions with multiple assertions into multiple functions
Make readable expected strings by first indenting for the good visual appearance and then dedenting by the leading whitespace for the assertion.

Split tests into multiple test functions in test_to_latex.

WillAyd

Cool I think this helps readability - just one minor comment otherwise lgtm

WillAyd · 2020-09-21T21:55:28Z

pandas/tests/io/formats/test_to_latex.py

        assert result == expected

-        df = DataFrame.from_dict(
+    @pytest.fixture
+    def multiindex_frame(self):


Just as a matter of convention can you move the fixture to the top of the module rather than in between the tests? (though this was nice to keep here for review)

I moved the fixtures to the top of the class. If I take one more step in a separate PR, I would split the large test class into multiple classes and distribute relevant fixtures between them.

jreback · 2020-09-21T23:23:10Z

yeah agree with @WillAyd comment as well. Note that would be also fine to organize these into different classes (if that's useful). same / followon ok with this

ping on green.

ivanovmg · 2020-09-22T08:13:05Z

yeah agree with @WillAyd comment as well. Note that would be also fine to organize these into different classes (if that's useful). same / followon ok with this

ping on green.

Yes, I though of re-organizing the tests into separate classes. I would prefer to do that in a separate PR, after this one.

What is the policy on the test functions naming? I mean if we organize tests in the appropriate classes, then would that be necessary for us to keep to_latex in all of them? As I understand, this is useful for pytest .. -k to_latex filtering, but similarly one can run pytest -k ToLatex if all classes contain ToLatex in them.

Similarly, if we have a separate class on, let's say multiindex, then I would rather remove multiindex from the test functions as well, as the information will be available in the class name.
What do you think about it?

simonjayhawkins

Thanks @ivanovmg for the PR. generally lgtm.

simonjayhawkins · 2020-09-22T14:22:54Z

pandas/tests/io/formats/test_to_latex.py

            with open(path) as f:
                assert float_frame.to_latex() == f.read()

+    def test_to_latex_to_file_utf8_with_encoding(self, float_frame):


float_frame not needed in this test.

Thank you! Removed.

simonjayhawkins · 2020-09-22T14:23:15Z

pandas/tests/io/formats/test_to_latex.py

        # test with utf-8 and encoding option (GH 7061)
        df = DataFrame([["au\xdfgangen"]])
        with tm.ensure_clean("test.tex") as path:
            df.to_latex(path, encoding="utf-8")
            with codecs.open(path, "r", encoding="utf-8") as f:
                assert df.to_latex() == f.read()

+    def test_to_latex_to_file_utf8_without_encoding(self, float_frame):


simonjayhawkins · 2020-09-22T14:28:29Z

pandas/tests/io/formats/test_to_latex.py

 class TestToLatex:
-    def test_to_latex_filename(self, float_frame):
+    @pytest.fixture
+    def df_caption_label(self):


fixtures should have docstrings xref #19159

simonjayhawkins · 2020-09-22T14:41:11Z

pandas/tests/io/formats/test_to_latex.py

+        )
+
+    @pytest.fixture
+    def df_caption_label_longtable(self):


might make the tests more readable (no tuple unpacking) if frame, caption, and label were composable fixtures from a longtable fixture yielding True/False

because fixtures returning tuples can be undesirable for several reasons, pytest-cases has a helper to overcome this https://smarie.github.io/python-pytest-cases/pytest_goodies/#unpack_fixture-unpack_into.

so should maybe consider avoiding creating fixtures returning tuples.

I tried to make it less painful by using namedtuple.
Anyway, I checked on the fixture unpacking (thank you for mentioning it, I was not aware of that).
So far, for the cases concerned, I do not see why it is better than separate fixtures. I mean, that is certainly beneficial if the fixture contains parametrization, but the ones involved do not have parametrization.

So, I decided to supply separate fixtures for dataframe, captions and labels.
What do you think about it?

changes look good.

So far, for the cases concerned, I do not see why it is better than separate fixtures. I mean, that is certainly beneficial if the fixture contains parametrization, but the ones involved do not have parametrization.

In a follow-on could maybe parameterise the caption and label the fixtures on longtable True/False and combine tests.

it is sometimes better to parameterise the tests firsts and then extract common paramterisation into fixtures.

WillAyd

lgtm @simonjayhawkins

jreback · 2020-09-22T22:34:24Z

thanks @ivanovmg very nice.

so a couple of things for followons.

prefer shorter PRs, though I do get that sometimes it makes sense to change everything in one go
we use classes for organization, i would still leave the to_latex in the test name as with pytest you could search on this as well
using composable fixtures is nice if you can, but again use what makes sense

ivanovmg added 2 commits September 21, 2020 23:22

REF: split tests in test_to_latex

39ba69f

Split tests into multiple test functions in test_to_latex.

CLN: make readable expected strings using dedent

ffc709d

WillAyd requested changes Sep 21, 2020

View reviewed changes

WillAyd added the Clean label Sep 21, 2020

jreback added IO LaTeX to_latex Testing pandas testing functions or related to the test suite labels Sep 21, 2020

jreback added this to the 1.2 milestone Sep 21, 2020

ivanovmg mentioned this pull request Sep 22, 2020

ENH: Enable short_caption in to_latex #35668

Merged

5 tasks

CLN: move fixtures to top of TextToLatex

885c0f1

ivanovmg requested a review from WillAyd September 22, 2020 08:58

simonjayhawkins reviewed Sep 22, 2020

View reviewed changes

ivanovmg added 4 commits September 22, 2020 22:20

CLN: remove unused parameters

d285e3a

DOC: add docstrings to fixtures

e54f1a9

REF: split fixtures returning tuples into multiple

f2bf712

LINT: re-format for new black

ef2754b

jreback approved these changes Sep 22, 2020

View reviewed changes

WillAyd approved these changes Sep 22, 2020

View reviewed changes

jreback merged commit 254f509 into pandas-dev:master Sep 22, 2020

ivanovmg deleted the refactor/test-latex branch September 28, 2020 18:53

kesmit13 pushed a commit to kesmit13/pandas that referenced this pull request Nov 2, 2020

REF: test_to_latex (pandas-dev#36528)

6343add

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

REF: test_to_latex #36528

REF: test_to_latex #36528

ivanovmg commented Sep 21, 2020

WillAyd left a comment

WillAyd Sep 21, 2020

ivanovmg Sep 22, 2020

jreback commented Sep 21, 2020

ivanovmg commented Sep 22, 2020

simonjayhawkins left a comment

simonjayhawkins Sep 22, 2020

ivanovmg Sep 22, 2020

simonjayhawkins Sep 22, 2020

simonjayhawkins Sep 22, 2020

ivanovmg Sep 22, 2020

simonjayhawkins Sep 22, 2020

ivanovmg Sep 22, 2020

simonjayhawkins Sep 22, 2020

WillAyd left a comment

jreback commented Sep 22, 2020

REF: test_to_latex #36528

REF: test_to_latex #36528

Conversation

ivanovmg commented Sep 21, 2020

WillAyd left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jreback commented Sep 21, 2020

ivanovmg commented Sep 22, 2020

simonjayhawkins left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

WillAyd left a comment

Choose a reason for hiding this comment

jreback commented Sep 22, 2020