Add CI workflow to run android tests on mobile phones #13024

pzread · 2023-04-11T19:57:07Z

Add a new workflow to cross-compile and run tests on Android devices.

It breaks out the Android target from the matrix cross_compile_and_test. I think it makes sense to do so as Android has its own matrix to run tests on multiple devices.

Currently this will only run on postsubmit due to the limited capacity of the mobile devices.

ScottTodd · 2023-04-12T19:01:02Z

.github/workflows/ci.yml

-          - platform: android
-            arch: armv8.2-a
-            abi: arm64-v8a
-            docker_image: "gcr.io/iree-oss/android@sha256:3f641d25786b1e5e430ee4cacb8bfe57540fda5ecaa7ca2802c179c26e77ce09"
-            build_script: "./build_tools/cmake/build_android.sh"
-            # No test_script


It breaks out the Android target from the matrix cross_compile_and_test. I think it makes sense to do so as Android has its own matrix to run tests on multiple devices.

I'd prefer to add to the existing matrix (android_pixel_4, android_pixel_6, etc.). We can add another setup script or more env vars as needed for that. This PR is adding a lot of boilerplate code right now.

AFAIK we are not able to do that because android test workflow needs two different machines: 1. build machine to do cross-compile and 2. mobile phones to run tests. So they must be run in different jobs, but the matrix only supports a single job.

Ideally in the test step we can call a reusable workflow to run tests on different devices, but I don't think it's possible right now (reusable workflow must be called on the job level: https://docs.github.com/en/actions/using-workflows/reusing-workflows#calling-a-reusable-workflow)

Another way is having a matrix job to cross-compile and upload test artifacts, and having another matrix job to download and run tests on different machines. But that will introduce unnecessary deps between cross-compile and test jobs (android tests need to wait for all cross-compilation finished because it can't just wait on the android cross-compile in the matrix due to Github limitation)

because android test workflow needs two different machines: 1. build machine to do cross-compile and 2. mobile phones to run tests

Isn't that the point of cross compiling in general though? Every job in the matrix will want to

run the already built compiler the generate program artifacts

build the runtime (fast, so doesn't need a large build machine)

start an emulator, connect to a device, etc.

run tests on the emulated platform / connected device

Correct, but previously we can run build and test in a single job because we only use emulators to run tests on the same build machine. However it's not the case now because android tests are running on mobile phones and those devices don't connect to the build machine.

So we want something like

jobs: cross_compile_and_test: matrix: - arch: arm_64 test_device: pixel-4-rpi - arch: riscv_64 test_device: riscv-emulator steps: cross_compile: run-on: x86_64_build_machine runs: ./cross_compile.sh ${matrix.arch} test: # This is not possible today because `run-on` can only be at the job level. run-on: ${matrix.test_device} runs: ./run_test.sh

But the problem is we can't run steps on different devices in a single job and the matrix is limited to a single job.

Desktop GPU machines should be plenty powerful enough to build the runtime and compile any artifacts they need though

? We're only talking about Android here. We definitely would like to run Android emulator tests, but we're still going to need physical devices for real GPU tests and benchmarking.

It's annoying that GitHub Actions doesn't give us the right tools here. Looking at the graph, it seems like making the Android tests depend on the full cross-compilation wouldn't be too awful:

If we do, we should definitely leave a note about what's going on though.

An alternative would be to factor cross-compile into a reusable workflow, so that there's less duplication. I'm not sure whether passing the all the arguments ends up being as complicated as just copy-pasting the whole thing though.

If we put it in a reusable workflow, can we get a matrix to run multiple jobs? https://docs.github.com/en/actions/using-workflows/reusing-workflows#using-a-matrix-strategy-with-a-reusable-workflow

The reusable workflow would do both compilation and testing and that would be abstracted away from the caller

If we put it in a reusable workflow, can we get a matrix to run multiple jobs? https://docs.github.com/en/actions/using-workflows/reusing-workflows#using-a-matrix-strategy-with-a-reusable-workflow

The reusable workflow would do both compilation and testing and that would be abstracted away from the caller

I think that is a good solution to reduce boilerplate code. But Github CI decides to not show the dependency graph for the reusable workflow in a matrix... (the two jobs in the screenshot below should be cross-compile -> test). Feel like this makes the readability worse

(from the run https://github.com/openxla/iree/actions/runs/4735027150)

That's pretty slow for pure overhead of something we want to run on presubmit. Will that be a bottleneck if we have multiple PRs and merged commits all queuing to use a single RPi? We might be able to proceed anyways, but the time burnt on overhead and the extra workflow complexity concerns me.

I updated the PR description. This test workflow is intentionally to only run on postsubmit for now (as we are currently doing with the buildkite test pipeline), as we only have two devices for each model.

I also want to mention the big benefit to proceeding is that we can drop the buildkite pipeline.

After trying more with reusable workflows and matrix jobs, I decided to stick with the original separate android workflow. Added the comment to explain why: https://github.com/openxla/iree/pull/13024/files#diff-b803fcb7f17ed9235f1e5cb1fcd2f5d3b2838429d4368ae4c57ce4436577f03fR935-R941

Basically the requirements of running on physical (non-scalable) devices and limited capabilities of matrix jobs make the Android test more complicated.

Eventually other platforms might also have physical devices to run. At that point we can generalize the current Android workflow to reuse it. But that will require passing more parameters and customisable/dynamic logic in the workflow and I don't think it will happen in the near future, so maybe we shouldn't over-generalized the solution right now

Yeah, while I'm not thrilled that we have to do it this way, the other near-term options are not awesome either. I think we should merge this as-is

.github/workflows/ci.yml

This reverts commit d2b6dc4.

This reverts commit 6e5b2d2.

GMNGeoffrey · 2023-04-19T17:21:03Z

.github/workflows/ci.yml

-          - platform: android
-            arch: armv8.2-a
-            abi: arm64-v8a
-            docker_image: "gcr.io/iree-oss/android@sha256:3f641d25786b1e5e430ee4cacb8bfe57540fda5ecaa7ca2802c179c26e77ce09"
-            build_script: "./build_tools/cmake/build_android.sh"
-            # No test_script


Yeah, while I'm not thrilled that we have to do it this way, the other near-term options are not awesome either. I think we should merge this as-is

Add a new workflow to cross-compile and run tests on Android devices. It breaks out the Android target from the matrix `cross_compile_and_test`. I think it makes sense to do so as Android has its own matrix to run tests on multiple devices. Currently this will only run on postsubmit due to the limited capacity of the mobile devices.

pzread marked this pull request as ready for review April 12, 2023 17:24

pzread requested review from GMNGeoffrey and ScottTodd as code owners April 12, 2023 17:24

ScottTodd reviewed Apr 12, 2023

View reviewed changes

pzread force-pushed the ci-android-test branch from 376e1ee to 93fd45c Compare April 12, 2023 19:45

pzread requested a review from ScottTodd April 13, 2023 05:32

ScottTodd added infrastructure Relating to build systems, CI, or testing platform/android 🤖 Android-specific build, execution, benchmarking, and deployment labels Apr 14, 2023

Che-Yu Wu added 8 commits April 18, 2023 16:46

Add android test workflow

e21da5b

Fix write-caches type

92fd782

Use gcloud cli

e8de8d5

Fix is-pr type

a69687f

Use matrix

0c6d22b

Fix variable name

845e7a8

Don't pack *.o and *.a

3661aac

Add job to summary

c795166

pzread force-pushed the ci-android-test branch 9 times, most recently from b06ec61 to 6e5b2d2 Compare April 19, 2023 01:28

Che-Yu Wu added 4 commits April 19, 2023 07:56

Cross-compile all then test on real devices

4576004

Test pr check

1d199a2

Revert "Test pr check"

33f0865

This reverts commit d2b6dc4.

Revert "Cross-compile all then test on real devices"

41b2fc7

This reverts commit 6e5b2d2.

pzread force-pushed the ci-android-test branch from d2b6dc4 to 41b2fc7 Compare April 19, 2023 07:56

Better matrix job name

eb27ee3

pzread force-pushed the ci-android-test branch 2 times, most recently from b2f5fb9 to 7f5ea4b Compare April 19, 2023 08:46

Add comment to explain the separate test workflow

759d30e

pzread force-pushed the ci-android-test branch from 7f5ea4b to 759d30e Compare April 19, 2023 08:57

GMNGeoffrey approved these changes Apr 19, 2023

View reviewed changes

pzread force-pushed the ci-android-test branch 2 times, most recently from 01c981a to 764401e Compare April 19, 2023 17:47

Enable postsubmit guard

9787e96

pzread force-pushed the ci-android-test branch from 764401e to 9787e96 Compare April 19, 2023 17:47

pzread enabled auto-merge (squash) April 19, 2023 18:11

pzread merged commit e4e2398 into iree-org:main Apr 19, 2023

pzread mentioned this pull request Apr 20, 2023

Migrate to GitHub actions #9855

Closed

42 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add CI workflow to run android tests on mobile phones #13024

Add CI workflow to run android tests on mobile phones #13024

pzread commented Apr 11, 2023 •

edited

Loading

ScottTodd Apr 12, 2023

pzread Apr 12, 2023

pzread Apr 12, 2023 •

edited

Loading

ScottTodd Apr 12, 2023

pzread Apr 12, 2023 •

edited

Loading

GMNGeoffrey Apr 17, 2023

GMNGeoffrey Apr 17, 2023

pzread Apr 18, 2023 •

edited

Loading

pzread Apr 19, 2023 •

edited

Loading

GMNGeoffrey Apr 19, 2023

GMNGeoffrey Apr 19, 2023

Add CI workflow to run android tests on mobile phones #13024

Add CI workflow to run android tests on mobile phones #13024

Conversation

pzread commented Apr 11, 2023 • edited Loading

ScottTodd Apr 12, 2023

Choose a reason for hiding this comment

pzread Apr 12, 2023

Choose a reason for hiding this comment

pzread Apr 12, 2023 • edited Loading

Choose a reason for hiding this comment

ScottTodd Apr 12, 2023

Choose a reason for hiding this comment

pzread Apr 12, 2023 • edited Loading

Choose a reason for hiding this comment

GMNGeoffrey Apr 17, 2023

Choose a reason for hiding this comment

GMNGeoffrey Apr 17, 2023

Choose a reason for hiding this comment

pzread Apr 18, 2023 • edited Loading

Choose a reason for hiding this comment

pzread Apr 19, 2023 • edited Loading

Choose a reason for hiding this comment

GMNGeoffrey Apr 19, 2023

Choose a reason for hiding this comment

GMNGeoffrey Apr 19, 2023

Choose a reason for hiding this comment

pzread commented Apr 11, 2023 •

edited

Loading

pzread Apr 12, 2023 •

edited

Loading

pzread Apr 12, 2023 •

edited

Loading

pzread Apr 18, 2023 •

edited

Loading

pzread Apr 19, 2023 •

edited

Loading