OIDC: Threat model for Warehouse #10644

woodruffw · 2022-01-25T16:43:02Z

This is a summary of my own notes from conversations with @di concerning Warehouse's threat model around OIDC (and GitHub, in particular, as an OIDC provider).

JWT considerations

JWT reuse

Warehouse needs well-defined and well-documented behavior around handling of a JWT seen more than once. For example, a GitHub action might do the following (pseudocode):

mint a JWT

for each dist in dist/* {
    use the JWT to mint an access token
    upload dist with the access token
}

In this case, the JWT gets used N times: once for each access token minting. This is probably not what we want; instead, we want this:

mint a JWT
mint an access token using the JWT

for each dist in dist/* { ... }

In this use pattern, we invalidate the JWT in Warehouse's backend after its first use. The process for that is probably as simple as performing uniqing on the jti claim after JWT verification, and then adding (jti, exp) to some table, where exp is the JWT's expiration claim (which gives us the ability to automatically clear out old JWTs once they expire).

Non-JWT considerations

Access token ephemerality

We should determine exactly how long-lived our ephemeral access tokens/API keys will be: how long is a reasonable period to allow uploads for? Should we cap the number of individual requests at some high number (e.g., is there any legitimate packaging workflow that creates more than 100 separate distribution files and uploads them)?

Provider-specific considerations

GitHub: account resurrection/reuse

GitHub allows a deleted user's username to be reused. When this happens, JWTs minted by the new user are indistinguishable from JWTs minted by the old one. This is a potential problem if PyPI is configured to accept JWTs from user/repo @ workflow, where user changes hands on GitHub -- the new (malicious) user would still be able to authenticate with PyPI as if they were the old user.

We probably need a mitigation for this before we fully enable support for GitHub as an OIDC provider. Two potential solutions:

Verify the repository_owner claim in the JWT with some sidecar information: on trusted setup, retrieve the GitHub User ID (which is hopefully unique) for the user and cross-check it against the current ID. This is not the preferred solution.
Get GitHub to include a repository_owner_id claim in the JWT, or something similar (like a repository_owner_epoch, if they don't want to guarantee the ID's uniqueness).

Neither of these is a great solution, because both still require some amount of sidecar state: we still have to either initially store the GitHub user's ID during trusted setup or during TOFU, so that we can check for changes during subsequent authentications. That complicates our DB schema, which we'd like to keep generic.

GitHub: JWTs minted for other consumers on the same workflow

GitHub allows any action to mint a JWT. Warehouse's consumer will only accept JWTs with acceptable claims (e.g., a matching ref), but that might not be sufficient.

In particular, it's conceivable for a publish workflow to have two jobs: publish-pypi and publish-rubygems, both of which mint JWTs for their respective services. Currently, there is no way for Warehouse to distinguish between these two JWTs: both originate from the same workflow. We could distinguish them with the aud claim, which Warehouse would then filter on, but that claim does not provide authenticity: an attacker who manages to take over the publish-rubygems job could mint a JWT with aud=pypi. We should determine whether this is a situation we need to handle on Warehouse's side.

GitHub: access token leakage

We should make sure that any (official) actions that mint access tokens via an OIDC JWT add the access token as a "mask value" so that it does not appear in CI logs: https://docs.github.com/en/actions/using-workflows/workflow-commands-for-github-actions#masking-a-value-in-log

GitHub: trusted workflows

Warehouse has no visibility into workflow access: a user could misconfigure their GitHub Actions such that anybody can run a publish/release action and thereby publish malicious distributions. We can't prevent this, but we should probably provide documentation/guidance to steer users in the right direction.

This is true also for trusted branches/tags: anybody who can push a new tag to the repo could conceivably authenticate with PyPI if the action is e.g. configured to match against tags like v*.

The text was updated successfully, but these errors were encountered:

woodruffw · 2022-01-25T16:59:03Z

Summarizing:

Warehouse will attempt to prevent JWT reuse by enforcing jti as a nonce.
Warehouse will produce only ephemeral access tokens from OIDC JWTs, and never long-lived tokens.
Subject to feasibility, Warehouse will attempt to defend against account reuse/squatting on the GitHub OIDC provider.
Warehouse will not distinguish between JWT "intent", i.e. a valid workflow can produce arbitrarily many JWTs, some of which might be intended for use on other consumers.
Warehouse will not protect the user from misconfiguring their own CI.

di · 2022-02-18T18:09:38Z

Neither of these is a great solution, because both still require some amount of sidecar state: we still have to either initially store the GitHub user's ID during trusted setup or during TOFU, so that we can check for changes during subsequent authentications. That complicates our DB schema, which we'd like to keep generic.

I think we can move forward under the assumption that we will eventually have a unique user ID available in the claim.

woodruffw · 2022-02-18T18:11:13Z

I think we can move forward under the assumption that we will eventually have a unique user ID available in the claim.

Yep! I'm working under that assumption in #10753.

woodruffw · 2022-03-17T18:51:26Z

I think we've fully fleshed out the threat/security considerations here, so I'm inclined to close this for now. The only thing that's currently missing is a way to verify a GitHub user's unique ID, which is blocked on GitHub rather than us (and is more of a development action item than a threat model design item).

Is that good with you @di?

di · 2022-03-17T18:54:57Z

Yep. Let's open an issue to capture the eventual migration of the user ID check from the API call to an OIDC claim, which is blocked on the token having this claim.

woodruffw · 2022-03-17T19:05:58Z

Yep. Let's open an issue to capture the eventual migration of the user ID check from the API call to an OIDC claim, which is blocked on the token having this claim.

We're still going to need the API call for the trusted setup, unfortunately -- we still need to establish the initial ID to trust when verifying actual JWTs. But yeah, I can make a sub-issue for tracking GitHub's progress on this + adding it to our verification impl.

di · 2022-03-17T19:22:55Z

Ah, yep, I meant for verification.

woodruffw · 2022-04-22T22:36:27Z

#11239 will more or less round out the intended threat model here, by preventing "account resurrection" attacks.

To summarize:

Warehouse will not attempt to prevent JWT reuse, since JWTs are short-lived.
Warehouse will produce only ephemeral access tokens from OIDC JWTs, and never long-lived tokens.
Warehouse will attempt to defend against account reuse/squatting on the GitHub OIDC provider.
Warehouse will distinguish JWT via the aud claim.
Warehouse will not protect the user from misconfiguring their own CI.

woodruffw · 2022-04-26T22:08:21Z

@di assuming that #10644 (comment) matches your mental model of the threat model, I think this is safe to close.

woodruffw mentioned this issue Jan 25, 2022

Samples of OIDC JWTs from GitHub #10645

Closed

di added the APIs/feeds label Feb 18, 2022

di closed this as completed Apr 26, 2022

woodruffw mentioned this issue Apr 27, 2022

GitHub OIDC: validate job_workflow_ref #11263

Merged

woodruffw mentioned this issue Aug 8, 2022

Routes and endpoints for JWT consumption #10970

Closed

This was referenced Feb 15, 2023

Add container image signing to build workflow PrefectHQ/prefect#8531

Open

Add Python package signing to build workflow PrefectHQ/prefect#8532

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OIDC: Threat model for Warehouse #10644

OIDC: Threat model for Warehouse #10644

woodruffw commented Jan 25, 2022 •

edited

Loading

woodruffw commented Jan 25, 2022

di commented Feb 18, 2022

woodruffw commented Feb 18, 2022

woodruffw commented Mar 17, 2022

di commented Mar 17, 2022

woodruffw commented Mar 17, 2022

di commented Mar 17, 2022

woodruffw commented Apr 22, 2022 •

edited

Loading

woodruffw commented Apr 26, 2022

OIDC: Threat model for Warehouse #10644

OIDC: Threat model for Warehouse #10644

Comments

woodruffw commented Jan 25, 2022 • edited Loading

JWT considerations

JWT reuse

Non-JWT considerations

Access token ephemerality

Provider-specific considerations

GitHub: account resurrection/reuse

GitHub: JWTs minted for other consumers on the same workflow

GitHub: access token leakage

GitHub: trusted workflows

woodruffw commented Jan 25, 2022

di commented Feb 18, 2022

woodruffw commented Feb 18, 2022

woodruffw commented Mar 17, 2022

di commented Mar 17, 2022

woodruffw commented Mar 17, 2022

di commented Mar 17, 2022

woodruffw commented Apr 22, 2022 • edited Loading

woodruffw commented Apr 26, 2022

woodruffw commented Jan 25, 2022 •

edited

Loading

woodruffw commented Apr 22, 2022 •

edited

Loading