TST: Decoupled more xlrd reading tests from openpyxl #27114

WillAyd · 2019-06-28T22:51:23Z

closes TST: openpyxl tests fail if xlrd is not installed #27111
tests added / passed
passes git diff upstream/master -u -- "*.py" | flake8 --diff
whatsnew entry

WillAyd · 2019-06-28T22:52:07Z

pandas/tests/io/excel/test_readers.py

        func = partial(pd.ExcelFile, engine=request.param)
        monkeypatch.chdir(datapath("io", "data"))
        monkeypatch.setattr(pd, 'ExcelFile', func)

    def test_excel_passes_na(self, read_ext):

-        excel = ExcelFile('test4' + read_ext)
+        excel = pd.ExcelFile('test4' + read_ext)


We were monkeypatching pd.ExcelFile but used ExcelFile imported directly in tests which was causing this to run exclusive of the specified engine. Figured easiest just to stick with the pd namespace

WillAyd · 2019-06-28T22:53:07Z

pandas/tests/io/excel/test_readers.py


        parsed = pd.read_excel(excel, 'Sheet1', keep_default_na=False,
                               na_values=['apple'])
        expected = DataFrame([['NA'], [1], ['NA'], [np.nan], ['rabbit']],
                             columns=['Test'])
        tm.assert_frame_equal(parsed, expected)

+        excel = pd.ExcelFile('test4' + read_ext)


openpyxl seems to exhaust a pd.ExcelFile after reading whereas xlrd does not. I don't think we really have a preference here and I can't imagine that the xlrd behavior is necessarily required so just created a new pd.ExcelFile after each use

WillAyd · 2019-06-28T22:54:29Z

pandas/tests/io/excel/test_readers.py

+    pytest.param(None, marks=pytest.mark.skipif(
+        not td.safe_import("xlrd"), reason="no xlrd")),
+])
+def test_conflicting_excel_engines(read_ext, excel_engine, datapath):


This didn't really follow the same rules for parametrization as the rest of the items in TestExcelFileReadso figured clearest to remove from class

jreback · 2019-06-28T23:01:28Z

pandas/tests/io/excel/test_readers.py

@@ -778,7 +782,7 @@ def test_excel_passes_na(self, read_ext):
    @pytest.mark.parametrize('arg', ['sheet', 'sheetname', 'parse_cols'])
    def test_unexpected_kwargs_raises(self, read_ext, arg):
        # gh-17964
-        excel = ExcelFile('test1' + read_ext)
+        excel = pd.ExcelFile('test1' + read_ext)


do we normally need to close the file? use in a context manager?

It looks like it depends on what type of object ExcelFile is interacting with but ultimately converted usage to a context manager to be safe

…xl-only

WillAyd · 2019-06-29T14:54:19Z

pandas/tests/io/excel/test_writers.py

@@ -250,6 +250,7 @@ class and any subclasses, on account of the `autouse=True`
        set_option(option_name, prev_engine)  # Roll back option change


+@td.skip_if_no('xlrd')


The writer tests still assume xlrd to be there. Will fix up in a follow up to address proper parametrization here (have focused more on readers given recent PRs)

@simonjayhawkins

jorisvandenbossche

Tests pass now locally without having xlrd installed!

simonjayhawkins · 2019-06-29T19:37:10Z

This PR results in an additional 2 warnings. although 8 of the same were introduced in #25092.

pandas/tests/io/excel/test_readers.py::TestReaders::test_reader_special_dtypes[openpyxl-.xlsx]
pandas/tests/io/excel/test_readers.py::TestReaders::test_reader_special_dtypes[openpyxl-.xlsm]
pandas/tests/io/excel/test_readers.py::TestReaders::test_reading_all_sheets[openpyxl-.xlsx]
pandas/tests/io/excel/test_readers.py::TestReaders::test_reading_all_sheets[openpyxl-.xlsm]
pandas/tests/io/excel/test_readers.py::TestReaders::test_reading_multiple_specific_sheets[openpyxl-.xlsx]
pandas/tests/io/excel/test_readers.py::TestReaders::test_reading_multiple_specific_sheets[openpyxl-.xlsm]
pandas/tests/io/excel/test_readers.py::TestReaders::test_read_excel_squeeze[openpyxl-.xlsx]
pandas/tests/io/excel/test_readers.py::TestReaders::test_read_excel_squeeze[openpyxl-.xlsm]
pandas/tests/io/excel/test_readers.py::TestExcelFileRead::test_excel_passes_na[openpyxl-.xlsx]
pandas/tests/io/excel/test_readers.py::TestExcelFileRead::test_excel_passes_na[openpyxl-.xlsm]
  C:\Users\simon\Anaconda3\envs\pandas-dev\lib\site-packages\openpyxl\worksheet\_reader.py:296: UserWarning: Unknown extension is not supported and will be removed
    warn(msg)

jorisvandenbossche · 2019-06-29T21:12:05Z

@simonjayhawkins that should be solved by #27122

Decoupled xlrd tests from openpyxl

9994e3c

WillAyd added Testing pandas testing functions or related to the test suite IO Excel read_excel, to_excel labels Jun 28, 2019

WillAyd commented Jun 28, 2019

View reviewed changes

WillAyd changed the title ~~Decoupled xlrd tests from openpyxl~~ Decoupled more xlrd reading tests from openpyxl Jun 28, 2019

jreback reviewed Jun 28, 2019

View reviewed changes

jreback added this to the 0.25.0 milestone Jun 28, 2019

WillAyd added 4 commits June 28, 2019 22:10

Merge remote-tracking branch 'upstream/master' into make-tests-openpy…

63a1a9f

…xl-only

Converted all ExcelFile reads to CM

66363b4

Cleaned up non-xlrd failures

559f13c

lint fixup

d32ac00

WillAyd commented Jun 29, 2019

View reviewed changes

WillAyd added 2 commits June 29, 2019 09:56

Context mgr in test_xlrd

cf7eaf0

Simplified test_conflicting_excel_engines

cb52c3a

jorisvandenbossche changed the title ~~Decoupled more xlrd reading tests from openpyxl~~ TST: Decoupled more xlrd reading tests from openpyxl Jun 29, 2019

jorisvandenbossche approved these changes Jun 29, 2019

View reviewed changes

WillAyd mentioned this pull request Jun 29, 2019

Class to read OpenDocument Tables #25427

Merged

simonjayhawkins approved these changes Jun 29, 2019

View reviewed changes

WillAyd merged commit 65ec968 into pandas-dev:master Jun 30, 2019

WillAyd deleted the make-tests-openpyxl-only branch June 30, 2019 03:46

simonjayhawkins mentioned this pull request Jun 30, 2019

TST/CLN: engine fixture for tests/io/excel/test_readers.py #27139

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TST: Decoupled more xlrd reading tests from openpyxl #27114

TST: Decoupled more xlrd reading tests from openpyxl #27114

WillAyd commented Jun 28, 2019

WillAyd Jun 28, 2019

WillAyd Jun 28, 2019

WillAyd Jun 28, 2019

jreback Jun 28, 2019

WillAyd Jun 29, 2019

WillAyd Jun 29, 2019

jorisvandenbossche left a comment

simonjayhawkins commented Jun 29, 2019

jorisvandenbossche commented Jun 29, 2019

		@@ -250,6 +250,7 @@ class and any subclasses, on account of the `autouse=True`
		set_option(option_name, prev_engine) # Roll back option change


		@td.skip_if_no('xlrd')

TST: Decoupled more xlrd reading tests from openpyxl #27114

TST: Decoupled more xlrd reading tests from openpyxl #27114

Conversation

WillAyd commented Jun 28, 2019

WillAyd Jun 28, 2019

Choose a reason for hiding this comment

WillAyd Jun 28, 2019

Choose a reason for hiding this comment

WillAyd Jun 28, 2019

Choose a reason for hiding this comment

jreback Jun 28, 2019

Choose a reason for hiding this comment

WillAyd Jun 29, 2019

Choose a reason for hiding this comment

WillAyd Jun 29, 2019

Choose a reason for hiding this comment

jorisvandenbossche left a comment

Choose a reason for hiding this comment

simonjayhawkins commented Jun 29, 2019

jorisvandenbossche commented Jun 29, 2019