Automated testing of notebooks #147

birdage · 2014-08-14T15:50:01Z

myself and a few others (@Bobfrat,@daf) have been discussing automated testing of notebooks through travis ci. The testing scope would be pretty limited, i.e just running the notebooks to make sure that no errors are found. This would help when merging, as the merger could see straight away that there might be an issue.

ocefpaf · 2014-08-14T16:06:53Z

I like the simplicity of this idea. It will help to evaluate the PRs. Because we already save the .py file for every notebook, it would be a matter of running them and catching the Exceptions. A long term plan would be to add docstrings to the functions in utilities.py and run them as well.

However, we should not rely completely on travis-ci green light for merging. Even with the testing scheme in place we should send the nbviewer url with every PR to compare results and check the outputs (specially when updating an existing notebook).

PS: For the .travis.yml we will need to use conda instead of the python modules available in travis-ci virtual machine.

Here is an example on how to do that:
https://github.com/SciTools/iris/blob/master/.travis.yml

ocefpaf · 2014-08-15T14:59:00Z

I tried to implement something along those lines. (Warning: it is very hackish! We can do better!!)

The testing module:
https://github.com/ocefpaf/secoora/blob/master/notebooks/test_notebooks.py

And the .travis.yml config:
https://github.com/ocefpaf/secoora/blob/master/.travis.yml

By implementing this I realized that travis will return several (if not all) failures by hanging up. That is due to the heavy download that happens is almost all the notebooks in system-test:
https://travis-ci.org/ocefpaf/secoora/builds/32639783

Any ideas? Split the tests by mocking the download and use saved files for the second part?

rsignell-usgs · 2014-08-15T15:21:37Z

It looks like Travis times out because it didn't receive output for 10 minutes. So the inundation notebook takes more than 10 minutes to run? It shouldn't. We should be smarter about eliminating models that don't need to be tested. And other notebooks run faster, right? Can set at least get it testing the ones that are fast?

birdage · 2014-08-18T18:24:37Z

@rsignell-usgs i think we should remove some of the extreme notebooks from that list as most of them have large data access.

rsignell-usgs · 2014-08-18T20:49:02Z

Yes, or maybe @ocefpaf can figure out how to eliminate some of the models that get accessed but never used (perhaps by being more clever in the script)

ocefpaf · 2014-08-19T01:39:21Z

@birdage For now I think we should add all the notebooks to get some idea of which ones will fail. Later on we can start skipping, non-silently though, those notebooks that are hanging-up.

@rsignell-usgs The SECOORA inundation notebook minimizes* the download by using the KDTree for searching the data. The download only happens if data are found within the requested distance.

There are still some download waste when the data are located over land**. The script does not know if it is land or water until it downloads and checks the data. Maybe I could create a land mask and eliminate all point inside that before hand... Not sure what is the best approach here.

Here is the log file with the download times for each model:

No data near* takes just a few seconds.

land** takes > 1 minute.

jkupiec · 2014-08-19T17:16:25Z

Folks, Is this an enhancement that we can reasonably expect to be implemented within the next 6 weeks or so, or is it a larger effort that must wait for a future project?

birdage · 2014-08-20T16:46:16Z

@ocefpaf i think that is a good starting point yes, just wanted to point out that different notebooks may have different response times when it comes to collecting data.

birdage · 2014-08-20T16:48:43Z

@jkupiec good question, i think it would be a "nice to have" from an integration stand point, and may help closer to the deadline as we try to get things squared away so we are not scrambling to fix errors from a merge. But on the flip side if we are correctly testing notebooks before merging this wouldnt be as much as an issue. my 2 cents.

ocefpaf · 2014-08-20T19:40:37Z

@birdage Can you setup a travis account for system-test (if one does not already exist )? I can send a PR based on what I do for SECOORA if you want.

ocefpaf · 2014-08-21T01:15:37Z

I just sent a PR (#152) with 3 (actually 2) tests.

test_system-test_rules.py check if the .py, pip-requirement.txt and conda-requirement.txt files exist for each notebook found.
test_notebooks.py will run all notebooks found using runipy and a common environment installed by the .travis config file. The reasons I chose runipy are:
- We can save the html output for visualization.
- runipy creates a log of the run that can be inspected.
- We can run the ipython magics present in the notebooks that would fail in a regular python call.
test_environments.py is not implemented! It is just a skeleton for a test that would create a virtualenv or a conda virtual env based on the requirement file for the notebook.

I do not know how travis will "react" to this. As soon as someone creates the Travis account we can start playing with this and fix the broken stuff.

birdage · 2014-08-22T15:23:07Z

@ocefpaf i spoke to @daf that there isnt really a need to create a travis account, it attaches to the github account , i dont have admin access to the SIT repo so cant flip the switch to turn travis on, but i have it on for my fork of the SIT repo https://travis-ci.org/birdage/system-test

ocefpaf · 2014-08-22T15:27:08Z

By "creating the travis account" I meant exactly that: attach the ioos/system-test and flip the switch.

@birdage I did the same for my branch. But if the main ioos/system-test has travis on we will see the test status for every PR.

birdage · 2014-08-22T15:37:02Z

@ocefpaf indeed! we need someone with github admin access on that repo to flip the switch, unfortunately i dont have it. ill add the badge as well to the read me 👍
@kknee for info

kwilcox · 2014-09-10T14:53:03Z

Travis seems to be doing something, closing this issue. Further issues with Travis can have their own home.

birdage added the enhancement label Aug 14, 2014

ocefpaf mentioned this issue Aug 15, 2014

Testing SECOORA/skill_score#52

Merged

ocefpaf mentioned this issue Aug 21, 2014

Travis test prototype. #152

Merged

birdage closed this as completed in #152 Aug 22, 2014

birdage reopened this Aug 22, 2014

kwilcox closed this as completed Sep 10, 2014

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Automated testing of notebooks #147

Automated testing of notebooks #147

birdage commented Aug 14, 2014

ocefpaf commented Aug 14, 2014

ocefpaf commented Aug 15, 2014

rsignell-usgs commented Aug 15, 2014

birdage commented Aug 18, 2014

rsignell-usgs commented Aug 18, 2014

ocefpaf commented Aug 19, 2014

jkupiec commented Aug 19, 2014

birdage commented Aug 20, 2014

birdage commented Aug 20, 2014

ocefpaf commented Aug 20, 2014

ocefpaf commented Aug 21, 2014

birdage commented Aug 22, 2014

ocefpaf commented Aug 22, 2014

birdage commented Aug 22, 2014

kwilcox commented Sep 10, 2014

Automated testing of notebooks #147

Automated testing of notebooks #147

Comments

birdage commented Aug 14, 2014

ocefpaf commented Aug 14, 2014

ocefpaf commented Aug 15, 2014

rsignell-usgs commented Aug 15, 2014

birdage commented Aug 18, 2014

rsignell-usgs commented Aug 18, 2014

ocefpaf commented Aug 19, 2014

jkupiec commented Aug 19, 2014

birdage commented Aug 20, 2014

birdage commented Aug 20, 2014

ocefpaf commented Aug 20, 2014

ocefpaf commented Aug 21, 2014

birdage commented Aug 22, 2014

ocefpaf commented Aug 22, 2014

birdage commented Aug 22, 2014

kwilcox commented Sep 10, 2014