Maintenance: run e2e tests in parallel #1512

am29d · 2023-06-19T17:08:47Z

Summary

While we run e2e tests in a matrix:

    strategy:
      matrix:
        package: [logger, metrics, tracer, parameters]
        version: [14, 16, 18]
      fail-fast: false

it would be great to also be able to execute these tests during development.

Why is this needed?

To reduce feedback cycle time for e2e tests during development.

Which area does this relate to?

Automation

Solution

@dreamorosi already opened a discussion, and the pointers to RFCs. I don't have a solution yet, so any recommendation and contribution is appreciated.

Acknowledgment

This request meets Powertools for AWS Lambda (TypeScript) Tenets
Should this be considered in other Powertools for AWS Lambda languages? i.e. Python, Java, and .NET

Future readers

Please react with 👍 and your use case to help us understand customer demand.

dreamorosi · 2023-06-19T17:42:26Z

If you run them with npm run test:e2e -ws it will run them sequentially for one runtime (Node.js 18).

If we want to run them in parallel we could write a bash script that does that like npm run test:e2e -w packages/tracer & npm run test:e2e -w packages/logger & ....

This however will mix all the stdout & stderr making it hard to read what's happening in case of multiple failures. This is also what happens with lerna, which is able to run things in parallel and is already available in the repo (but used only for version and release).

This is something we should keep our eyes on, however in most cases I'm running locally only specific groups/utilities and then run the full suite only on the repo. This makes it also easier to collaborate on results and troubleshooting when needed.

dreamorosi · 2023-09-04T14:14:31Z

Been looking into this while working on other issues and running the tests in parallel with a fairly crude method caused me to hit rate limits on deployment.

Specifically, I ran three terminal sessions like this:

session 1: RUNTIME=nodejs14x npm run test:e2e -w packages/tracer
session 2: RUNTIME=nodejs16x npm run test:e2e -w packages/tracer
session 3: npm run test:e2e -w packages/tracer

I wonder if this will be an issue if we try to run multiple packages / runtimes combos concurrently.

dreamorosi · 2023-10-16T11:38:06Z

I have been spending some time investigating this and wanted to share some updates as well as a low hanging/low effort way of mitigating this in the short term.

The main way of running the integration tests in all the workspaces is via npm workspaces, and can be done with this command:

npm run test:e2e -ws
# ... many logs
# duration 10m 55s

This runs the tests one workspace at the time in the order that they appear in the package.json file under workspaces. Test files (aka cases) within a workspaces are already parallelized by Jest by default.

As I mentioned in one of the comments above, another option that is enabled (but not documented) in our repo would be to use Lerna:

npx lerna exec --no-bail --no-sort --stream --concurrency 8 -- npm run test:e2e
# alias npm run test:e2e:parallel
# ... many logs
lerna success exec Executed command in 11 packages: "npm run test:e2e"
# duration 3m 35s

The concurrency can be adjusted via the respective flag, but putting 8 includes all the utilities that have tests to this date thus consisting in maximum concurrency.

Running the tests this way cuts the time almost by 66%, however the experience is not great imo because the logs are all mixed (see below) and since they are streamed things like progress bars won't work but instead generate one line for each update:

@aws-lambda-powertools/metrics: arn:aws:cloudformation:eu-west-1:12345678901:stack/Metrics-18-x86-77c72-BasicFeatures-Decorators/0c06b3f0-6c0b-11ee-8eaf-02b78ef81927
@aws-lambda-powertools/idempotency:  ✅  Idempotency-18-x86-12cfb-makeHandlerIdempoten
@aws-lambda-powertools/tracer: arn:aws:cloudformation:eu-west-1: 12345678901:stack/Tracer-18-x86-a6e80-AllFeatures-AsyncDecorato/262b5c40-6c0b-11ee-9da3-0acf4e5b4307

Additionally, since we must use the --no-bail flag to ensure that test lifecycle to completes and the CloudFormation stacks are torn down, in case of failing tests the error messages can easily get lost in the stream and make the output useless (see below):

lerna ERR! Received non-zero exit code 1 during execution
lerna success exec Executed command in 11 packages: "npm run test:e2e"

However if you want a quick way of running all the tests and are reasonably confident that they will pass (aka you don't have to consume the logs), then this is the way to do it.

Both methods are now documented in the updated maintainers playbook that is being added in the linked PR.

Note

Below are some considerations on an alternative tool that I have been investigating but that I'm not yet set on. If you're not interested in reading about it, you can stop here.

Investigating other tools

I have been looking at several tools used in monorepos and trying some of them. One of them, called wireit stood out for now, but there are others that I want to try before settling on one.

I have tried wireit in a personal project that uses a monorepo with 3 workspaces and it looks promising, especially when it comes to caching and parallelizing scripts, however it requires a significant lift in terms of configuring and maintaining the wiring config.

For example, imagine you want to have some shared npm scripts that live in the main package.json file at the project's root, and then you have some scripts that are specific to some workspaces (i.e. frontend/package.json). Wireit allows you to specify dependencies between scripts (i.e. deploy requires build, etc.) and automatically handles parallelism whenever it's safe to do so.

flowchart LR
    npm-run-frontend:deploy-->deploy
    build-->exportCDKoutputs
    subgraph frontend
    deploy-->build
    end
    subgraph root
    exportCDKoutputs
    end

Blow an excerpt of the root package.json from the project I was mentioning that expresses the tree shown above:

{
  "scripts": {
    "exportCDKoutputs": "wireit"
  },
  "wireit": {
    "exportCDKoutputs": {
      "command": "ts-node ./scripts/exportCDKoutputs.ts",
      "files": [
        "./infrastructure/cdk.out/params.json",
        ".env"
      ],
      "output": [
        ".env"
      ],
      "clean": false
    },
  }
}

and this is another from the frontend/package.json file:

{
  "scripts": {
    "build": "rimraf dist && mkdir dist && sh build.sh",
    "deploy": "wireit"
  },
  "wireit": {
    "deploy": {
      "command": "export $(cat ../.env | xargs) && aws s3 sync dist s3://$WEB_STATIC_ASSETS_BUCKET_NAME --delete",
      "dependencies": [
        "../:exportCDKoutputs",
        "build"
      ],
      "WEB_STATIC_ASSETS_BUCKET_NAME": {
        "external": true
      }
    }
  }
}

Another interesting aspect of wireit is the caching, as you can see it allows you to specify files & output fields. These are used by the tool to determine whether the script should be run or it's safe to skip (aka return output from cache).

The reason why I'm still not sure about it however is the complexity of setting this up. The above scripts express one relationship between 3 scripts in one workspace, and as you can see it's very verbose and somewhat hard to reason about. Additionally, the caching features are enabled by default and it's very easy to shoot yourself in the foot if you don't know exactly what artefacts are being generated and how they are used by all the scripts in the dependency tree.

Nevertheless, I think it's something we could potentially consider in the future, but at the time I don't think we have enough bandwidth to focus on this, especially because the potential time savings are still unclear.

github-actions · 2023-10-18T11:43:56Z

⚠️ COMMENT VISIBILITY WARNING ⚠️

Comments on closed issues are hard for our team to see.
If you need more assistance, please either tag a team member or open a new issue that references this one.
If you wish to keep having a conversation with other community members under this issue feel free to do so.

am29d added triage This item has not been triaged by a maintainer, please wait internal PRs that introduce changes in governance, tech debt and chores (linting setup, baseline, etc.) labels Jun 19, 2023

dreamorosi added automation This item relates to automation blocked This item's progress is blocked by external dependency or reason and removed triage This item has not been triaged by a maintainer, please wait labels Jun 19, 2023

dreamorosi linked a pull request Oct 12, 2023 that will close this issue

feat(idempotency): add esmodule support #1743

Merged

9 tasks

dreamorosi removed a link to a pull request Oct 12, 2023

feat(idempotency): add esmodule support #1743

Merged

9 tasks

dreamorosi self-assigned this Oct 16, 2023

dreamorosi added confirmed The scope is clear, ready for implementation and removed blocked This item's progress is blocked by external dependency or reason labels Oct 16, 2023

dreamorosi mentioned this issue Oct 16, 2023

docs(maintenance): add processes tab #1747

Merged

9 tasks

dreamorosi closed this as completed in #1747 Oct 18, 2023

dreamorosi added completed This item is complete and has been merged/shipped and removed confirmed The scope is clear, ready for implementation labels Oct 18, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Maintenance: run e2e tests in parallel #1512

Maintenance: run e2e tests in parallel #1512

am29d commented Jun 19, 2023

dreamorosi commented Jun 19, 2023

dreamorosi commented Sep 4, 2023

dreamorosi commented Oct 16, 2023 •

edited

Loading

github-actions bot commented Oct 18, 2023

Maintenance: run e2e tests in parallel #1512

Maintenance: run e2e tests in parallel #1512

Comments

am29d commented Jun 19, 2023

Summary

Why is this needed?

Which area does this relate to?

Solution

Acknowledgment

Future readers

dreamorosi commented Jun 19, 2023

dreamorosi commented Sep 4, 2023

dreamorosi commented Oct 16, 2023 • edited Loading

Investigating other tools

github-actions bot commented Oct 18, 2023

⚠️ COMMENT VISIBILITY WARNING ⚠️

dreamorosi commented Oct 16, 2023 •

edited

Loading