Research: `./wpt test-jobs` and `./wpt affected-tests` stats #13936

foolip · 2018-11-05T23:56:32Z

Travis does not support making decisions in one job that affects another. Instead we start 10 jobs (example) and exit early in each if there's nothing to do. Both Azure Pipelines and Taskcluster supports making decisions, with build-wide variables and decision tasks respectively.

This issue is about how to set up such dependent jobs. Considerations:

Starting a VM and cloning the repo takes time, ~30s on Azure Pipelines.
Avoid waste by starting jobs that end up doing nothing.
Keep overall latency low. (In tension with above, because with enough parallelism speculatively starting jobs lowers latency.)
Keep latency to test results in particular low, since those will feed into @lukebjerring's wpt.fyi check that combines results from CIs.

To inform the decision, I've looked into what ./wpt test-jobs and ./wpt affected-tests has returned in recent history, see script at end.

For ./wpt test-jobs (fast) I compared 1410 merge_pr_* to their preceding tags as an approximation of what would happen on the PRs, and got these numbers of "PRs" triggering each job:

   1410 manifest_upload (100%)
   1410 lint (100%)
   1151 stability (82%)
    317 build_css (22%)
    227 resources_unittest (16%)
    192 wptrunner_infrastructure (14%)
    185 wpt_integration (13%)
    185 tools_unittest (13%)
    181 update_built (13%)
     97 wptrunner_unittest (7%)

For ./wpt test-jobs (fast) I looked at 243 merge_pr_* tags. Of the 190 that would triggered the stability job, I made a spreadsheet, and it's clear (and unsurprising) that most PRs don't change many tests. The 50th percentile is 1, and the 90th percentile is 12. Only 6 "PRs" affected >100 tests.

(Among the 53 that didn't trigger the stability job, 3 still had affected tests. Will file bug.)

Putting this together, conclusions are:

Lint always runs, so whatever job it's in should be unconditional. (No change.)
Prioritize running affected tests, since there will usually (82%) be some, and getting those results fast is very useful.
The rest are rare enough that the best trade-off is probably to spin up parallel job for them from the "decision task".

shell script used

#!/bin/bash

mkdir -p ../tests-affected
prevtag=""
# manifest version bumped in merge_pr_12563
# interfaces/*.idl affected tests logic fixed in merge_pr_13392
git tag --list --contains=merge_pr_13392 --sort=committerdate | while read tag; do
    if [[ -z "$prevtag" ]]; then
        prevtag=$tag
        continue
    fi

    echo "Listing $tag tests-affected"
    git checkout -q $tag
    ./wpt manifest
    ./wpt tests-affected $prevtag > ../tests-affected/$tag.txt
    prevtag=$tag
done

mkdir -p ../test-jobs
prevtag=""
# tools/ci/jobs.py changed in merge_pr_12174
git tag --list --contains=merge_pr_12174 --sort=committerdate | while read tag; do
    if [[ -z "$prevtag" ]]; then
        prevtag=$tag
        continue
    fi

    echo "Listing $tag test-jobs"
    git checkout -q $tag
    ./wpt test-jobs $prevtag > ../test-jobs/$tag.txt
    prevtag=$tag
done

Possibly interested parties: @jugglinmike and @web-platform-tests/admins

The text was updated successfully, but these errors were encountered:

foolip · 2018-12-12T09:10:01Z

The research has been done, and I've used my own recommendation in #14156 and #14459, closing.

foolip mentioned this issue Nov 6, 2018

./wpt tests-affected can list tests when ./wpt test-jobs doesn't list stability #13937

Open

gsnedders added the infra label Nov 6, 2018

foolip added the priority:roadmap label Nov 7, 2018

foolip closed this as completed Dec 12, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Research: `./wpt test-jobs` and `./wpt affected-tests` stats #13936

Research: `./wpt test-jobs` and `./wpt affected-tests` stats #13936

foolip commented Nov 5, 2018

foolip commented Dec 12, 2018

Research: ./wpt test-jobs and ./wpt affected-tests stats #13936

Research: ./wpt test-jobs and ./wpt affected-tests stats #13936

Comments

foolip commented Nov 5, 2018

foolip commented Dec 12, 2018

Research: `./wpt test-jobs` and `./wpt affected-tests` stats #13936

Research: `./wpt test-jobs` and `./wpt affected-tests` stats #13936