Tests runnable on default CI agent #1507

laserprec · 2021-08-23T20:50:18Z

Description

At the moment, some of our unit & integration test suites are not fit to run on the default CI vms which has relatively small memory (7GB). These are usually tests that run our example notebook end-to-end with a large dataset (ranging from 100k to 20million rows). As a result, we can only run them on self-hosted VM with larger memory. Does the team think that there is a need to redesign these tests/notebooks so they runnable on GitHub-hosted VMs? @miguelgfierro, @gramhagen?

Link to integration test failures on Github-hosted VM
Link to unit test failures on Github-hosted VM

I think there are some solutions we can go after, besides optimizing the content in the notebook:

Incomplete training using a smaller dataset, and relax the assertion checks.
- assert results["rmse"] == pytest.approx(0.8621, rel=TOL, abs=ABS_TOL)
  for example, instead of checking if rsme is within some error range within a target value, we can check that our notebook runs successfully with a smaller dataset, and rsme "improves" or is not none.
Avoid running the full notebook multiple times for different data-size.

Expected behavior with the suggested feature

Tests should meet the resource constrains in a Standard_DS2_v2 SKU so they are runnable in such machine.

This has the following benefits:

CI is less dependent on self-hosted resource so we can scale up better when we support more python versions
Reduce build time significantly
Faster iteration

The devil's advocate:

Do we care how long our nightly build would take to run?
Are we comfortable with relaxing some of the assertions in the tests and risk notebook failing for users?
Is it even possible to run some of the notebooks e2e under 7GB memory (minus whatever it is needed to keep the OS running)

Other Comments

The text was updated successfully, but these errors were encountered:

miguelgfierro · 2021-08-24T19:18:16Z

The idea of having the current test configuration comes from a publication of Lenskit from University of Minnesota https://buildingrecommenders.wordpress.com/2016/02/04/testing-recommenders/ and it is explained here: https://miguelgfierro.com/blog/2018/a-beginners-guide-to-python-testing/

This method is a more complete way of testing ML pipelines than just checking that the code works, we also want to make sure that the ML models provide metrics that are reasonable. That's why we wanted to use a relatively big dataset in the tests.

I hope this clarifies

gramhagen · 2021-08-25T13:35:10Z

the test configuration is very thorough, but i think we can restrict the smoke and integration tests to running less frequently (nightly, weekly?).

also, the notebook tests are not really unit tests since they involve multiple components interacting with each other.
i think we should be aiming for unit tests that exercise individual components and don't use external data this will make them very fast and less likely to fail if a website goes down. all the other types of tests can be moved to smoke/integration and run nightly.

miguelgfierro · 2021-08-25T17:33:17Z

@gramhagen both ideas are really good, would you like them to discuss them in the weekly meeting?

laserprec added the enhancement New feature or request label Aug 23, 2021

laserprec added this to the GitHub Action Migration & CI Infra Enchancement milestone Aug 23, 2021

laserprec mentioned this issue Aug 23, 2021

Add nightly build #1503

Merged

4 tasks

laserprec added build labels Aug 23, 2021

laserprec self-assigned this Aug 26, 2021

laserprec linked a pull request Sep 24, 2021 that will close this issue

Optimize Notebook Unit Tests #1538

Merged

4 tasks

laserprec mentioned this issue Sep 27, 2021

Optimize Notebook Unit Tests #1538

Merged

4 tasks

miguelgfierro closed this as completed Aug 31, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tests runnable on default CI agent #1507

Tests runnable on default CI agent #1507

laserprec commented Aug 23, 2021 •

edited

Loading

miguelgfierro commented Aug 24, 2021

gramhagen commented Aug 25, 2021

miguelgfierro commented Aug 25, 2021

Tests runnable on default CI agent #1507

Tests runnable on default CI agent #1507

Comments

laserprec commented Aug 23, 2021 • edited Loading

Description

Expected behavior with the suggested feature

Other Comments

miguelgfierro commented Aug 24, 2021

gramhagen commented Aug 25, 2021

miguelgfierro commented Aug 25, 2021

laserprec commented Aug 23, 2021 •

edited

Loading