feat: parse JUJU_* env in one place #1313

IronCore864 · 2024-08-13T02:57:25Z

Parse all JUJU_* env vars in one place.

Closes #1075.

Note: This draft does the minimum changes by creating a dataclass, no other logic in main.py is changed. Variables are not grouped into smaller dataclasses, and all types are string just as they were in os.environ.get().

There will be another alternative draft with variable grouping and typing.

More Explanations

1 Grouping

I think it's fine without it but I'm not against it either. If there is a solid reason for grouping, I'm OK with it, but here are my thoughts:

Value added: It seems only JUJU_NOTICE_* and JUJU_SECRET_* can be grouped into a smaller dataclass. Maybe we can argue that JUJU_RELATION and JUJU_RELATION_ID can be grouped too, but then the first needs to be renamed to JUJU_RELATION_NAME which deviates from the acutal env variable name. So, most env vars can't really be grouped, which means the _JujuContext dataclass can't be shortened or improved readability-wise by grouping. And when using it, it's not too much difference, comparing juju_context_obj.juju_notice_id and juju_context_obj.juju_notice.id. So I think the value added by grouping is insignificant.
Logic: Without grouping, all variables are on the same level in parallel, similar to each other. If we group like _JujuContext.notice.id and _JujuContext.juju_charm_dir, it might indicate that _JujuContext owns notice and juju_charm_dir, they are on the same level, and there is another object that is notice who owns id, while in fact it's not the case, they are all env vars.
Last but not least, the source comes from here where it's just a list of key/value pairs with no grouping, so I think maybe it's better to stay the same.

2 Type

I put all fields as str in this draft, because:

I wanted to put some fields as int in the first place but after investigating, only very few fields are actually non-str, like JUJU_SECRET_REVISION; and some values are processed based on the env var, like juju relation id. So, maybe it's easier to keep them as they are and handle the type conversion whenever needed (mostly aren't). There are on extra plus side and that is we don't need to do extra processing if the key is JUJU_SECRET_REVISION, keeping the from_env method simple without key-name related logic.
Since the whole _JujuContext dataclass comes from env vars, it's intuitive to think all fields are strings.
The source is all strings.

Some explanation on Optional or not, and default values, becaues it may seem strange at first glace, and these logic is not from Juju but from ops.main, how we are using them currently:

Optional: Some fields are not Optional because in main.py the current usage of them is treating them like they surely exist (os.environ[key]), so I kept them as type str instead of Optional[str].
Some fields must be Optional with None value because in the current logic there are tests like if key in os.environ.
Some fields are used as args to pass to other things and they can be None so I left these type all as Optional that can be None.

chore: poc parsing juju env in one place chore: refactor _JuJuContext to a dataclass

tonyandrewmeyer · 2024-08-13T04:30:55Z

For what it's worth, I agree that it's not worth grouping (flat is better than nested). However, I do think it's worth doing the type conversion in the class, as well as any split type logic.

I'd also suggest that the juju_ prefix for the attribute names isn't needed, since it's implied by the fact that it's in _JujuContext.

dimaqq · 2024-08-13T05:30:21Z

Consider

	// relationId identifies the relation for which a relation hook is
	// executing. If it is -1, the context is not running a relation hook;
	// otherwise, its value must be a valid key into the relations map.
	relationId int

If we're going to cast or interpret the values, we gotta handle these special cases too. In this case, I think -1 ==> None.

benhoyt

Let's finalise the decision of what approach to use in the daily sync today.

Note that there are also a bunch of uses of os.environ and os.getenv is other files:

framework.py
655:        debug_at = os.environ.get('JUJU_DEBUG_AT')

jujuversion.py
104:        v = os.environ.get('JUJU_VERSION')

model.py
3234:        if 'JUJU_RELATION_ID' in os.environ and 'JUJU_REMOTE_APP' in os.environ:
3235:            event_relation_id = int(os.environ['JUJU_RELATION_ID'].split(':')[-1])

model.py
3161:        unit_name_ = unit_name or os.getenv('JUJU_UNIT_NAME')
3167:        self.model_name: str = model_name or typing.cast(str, os.getenv('JUJU_MODEL_NAME'))
3168:        self.model_uuid: str = model_uuid or typing.cast(str, os.getenv('JUJU_MODEL_UUID'))
3238:                return os.getenv('JUJU_REMOTE_APP') or None

ops/main.py

IronCore864 · 2024-08-14T06:52:58Z

After discussion, we will use a flat structure without nested objects, and we will put all fields as Optional[str] or Optional[int].

IronCore864 · 2024-08-15T09:18:43Z

Notes:

_JujuContext.from_environ(dict(os.environ)) looks weird, because strictly speaking, it's not from_environ but from_dict. The name from_environ is used to be consistent with a few other usages in the code base; a Dict is used instead of os._Environ per previous code review suggestions to make it easy to be mocked in tests.
Where to put _JujuContext: I think it makes sense to put it in main.py because it's first used there. But no matter where I put it, I can't bypass circular imports in jujuversion.py, so I left it where it's most logical and used TYPE_CHECKING to fix the import for typing checks. However, this breaks sphinx doc build. How can we solve this?

Changes:

_JujuContext:

Every field is of type Optional[str] or Optional[int] and defaults to None, according to the code review and our daily discussion.
The only two int-type fields are relation_id and secret_revision, where I did not handle the case that the env var's value exists but is not a number. Should we handle this case? Or let it throw a ValueError is OK?
A few more env vars used in model.py and JujuVersion are added, except JUJU_DEBUG_AT (used in framework.py) and OPERATOR_DISPATCH (used in main.py), because they are not in the source here and here.
_JujuContext is instantiated only once in _Manager, then passed to _Dispatcher and _ModelBackend, the latter also passes it to _Container. These classes will pass it to JujuVersion too.

JujuVersion:

The original from_environ method is renamed to from_context because it's not appropriate any more. from_juju_context was also considered, but JujuVersion.from_juju_context(juju_context) seemed too verbose and duplicated, hence JujuVersion.from_context(juju_context).
The original if v is None: v = '0.0.0 logic is removed; default value is handled now in _JujuContext.

model.py:

There were a few os.environ / os.getenv in it, all refactored to use _JujuContext.

test/*:

Add tests for JujuVersion.from_context.
Refactor tests to make them pass.
Add tests for _JujuContext.from_environ.

benhoyt

This looks like a great start, thanks. A few initial comments.

ops/jujuversion.py

ops/model.py

ops/main.py

test/test_model.py

test/test_testing.py

ops/main.py

ops/model.py

IronCore864 · 2024-08-16T08:38:36Z

db-charm-tests failed because the charm imports JujuVersion from ops.model instead of ops.jujuversion.

Issue created here, and a fixing PR submitted.

tonyandrewmeyer

Looking good! A few small comments.

I tried this out on Juju 3.6, LXD, but only with a few events, not exhaustively. Everything looked good for the ones I tried.

ops/main.py

ops/model.py

test/test_model.py

ops/jujuversion.py

benhoyt

This looks really good. Just a couple more requests for file moving (sorry for the back and forth!).

ops/jujuversion.py

ops/jujucontext.py

ops/model.py

test/test_main.py

benhoyt · 2024-08-21T01:57:13Z

Decision was to put JUJU_DEBUG_AT in _JujuContext.debug_at.

… into parse-juju-env-vars

IronCore864 · 2024-08-21T15:56:45Z

All comments resolved. Major changes:

JUJU_DEBUG_AT added into ops/jujucontext.py as _JujuContext.debug_at, also added a test for it, ops/framework.py and tests refactored accordingly. Now when instantiating a Framework object, a _JujuContext.debug_at shall be passed in as a keyword argument (Optional for backward compatibility because some charms' test does import ops.Framework then instantiate it).
ops/jujuversion.py restored.

… into parse-juju-env-vars

sed-i

So happy to see this PR!

sed-i · 2024-08-21T19:19:39Z

ops/jujucontext.py

+    """
+
+    @classmethod
+    def from_dict(cls, env: Mapping[str, Any]) -> '_JujuContext':


Design question:

The half-baked idea I imagined was that from_dict returns a Union of types, and the caller always uses a match expression on the result. It would imply py3.10 (ubuntu 22.04) which isn't great, but I wonder (a) if it's a feasible design goal in the first place, and (b) if we should plan the design today with match in mind.

I.e. it would be great if this new class brought significant new added value over poking into os.environ directly.

Example: https://stackoverflow.com/a/71519690

If I understand correctly, there is already some similar logic in main.py, specifically, in _get_event_args, where it tries to match the type of the event then pull related environment variables.

I think it would be great if we could somehow integrate it with JujuContext, maybe in the future we can build on top of this PR and implement that.

Regarding the scope of this PR, we had a short discussion today in the daily and we think this PR serves two purposes: 1, parse all ENV vars from a single place (previously it was scattered across main.py and framework.py); and 2, provide some unified object so that others can build on top of it if they want to implement some experiments.

So, for now, I'm sorry that there is no implementation of the event type matching thingy and we are going to merge this _JujuContext. Thanks!

The half-baked idea I imagined was that from_dict returns a Union of types, and the caller always uses a match expression on the result.

If the intention is to find out which event Juju has emitted, then it's much simpler to just look at that directly rather than trying to match against which context variables have been set. If the intention is to get back a collection of types (e.g. which fields are set are the same with different events - start and stop would be setting the same) then I'm not sure I really see the use-case.

I don't think it's good to have a hierarchy of context objects when there's already a 1:1 mapping of what that would be in the hierarchy of event classes. A SecretChangedJujuContext would only be useful for SecretChangedEvent, for example.

As @IronCore864 mentioned, _get_event_args provides the "given a context, what are the arguments needed to create an event object". It would be simple enough for someone to subclass _JujuContext to add a to_event method (or whatever), if they wanted to build an "alt ops" package.

What would be most useful in that sort of situation, I believe, is being able to create SecretChangedEvent (and on on) objects without opting in to the whole framework/handle system (and registering, in particular). My feeling is that's what we could provide (not this cycle) to continue making it easier for people to explore alternative ways to do event handling without re-implementing everything.

I.e. it would be great if this new class brought significant new added value over poking into os.environ directly.

"significant" is subjective, of course, but I think there is value in not just collecting all the uses together as @IronCore864 mentioned, but also doing all the conversion so that you have a bunch of Python objects rather than a bunch of strings. With this, plus the similar hook tool wrapping in the model/model backend (the border of those could be a bit cleaner) that's the main interface with Juju (other than Pebble).

tonyandrewmeyer

Looks great!

I'm fine with this as-is, but left a couple of small suggestions - feel free to discard those if you prefer.

test/test_framework.py

ops/framework.py

test/test_helpers.py

The next release of ops has some backwards-incompatible changes to private methods. This PR does the minimum to keep Scenario working with the current version of ops and the next release. I'll open a ticket for doing a nicer version of this where the new `_JujuContext` class is used (which would presumably mean requiring the new version of ops). But this will let people continue upgrading their ops as long as they're using the latest 6.x of Scenario. The relevant ops PR is: canonical/operator#1313

As of canonical/operator#1313 , the test harness inserts JUJU_VERSION=0.0.0 into os.environ, so we must account for this in the unit tests.

samuelallan72 · 2024-09-03T23:16:29Z

test/test_framework.py

+    if 'JUJU_VERSION' not in os.environ:
+        os.environ['JUJU_VERSION'] = '0.0.0'


@IronCore864 why does the test framework need to manipulate the environment now? The new context appears to already pick a default if JUJU_VERSION is not present.

cc @gabrielcocenza

@IronCore864 why does the test framework need to manipulate the environment now? The new context appears to already pick a default if JUJU_VERSION is not present.

Note that this is the internal tests for ops themselves, not ops.testing (Harness). It seems like no-one else should really care what the internal ops tests are doing. Does this actually impact you, or is it just something you noticed while running the ops tests or similar?

It's being set here because _JujuContext expects the environment variable to be set, because all supported Juju versions so do.

Does this actually impact you, or is it just something you noticed while running the ops tests or similar?

Yes, it's affecting our unit tests that are checking the arguments that were called in subprocess. See an example at canonical/charm-simple-streams#22

Ah, it's not the code linked above (your tests will never run that), it's this similar code in ops.testing.

We failed to consider this 😞, and we must not have run this change against the full set of charms we know about 😞. It would indeed be better to have Harness not change the environment. It's not immediately clear to me what we should do now: we could change the behaviour back so that it doesn't, but if everyone that this impacts has then already changed their code by the time that fix gets out, then it's just doubling up the impact (although it is cleaner in general).

I'll talk with the team about this later today.

FYI, we're going to do an ops 2.16.1 change that reverts this behaviour.

I confirm that now it's fixed 👍

chore: poc parsing juju env in one place

8b7eb0e

chore: poc parsing juju env in one place chore: refactor _JuJuContext to a dataclass

IronCore864 mentioned this pull request Aug 13, 2024

feat: poc parsing juju env in one place - alternative draft #1314

Closed

benhoyt reviewed Aug 13, 2024

View reviewed changes

ops/main.py Outdated Show resolved Hide resolved

ops/main.py Outdated Show resolved Hide resolved

ops/main.py Outdated Show resolved Hide resolved

ops/main.py Outdated Show resolved Hide resolved

ops/main.py Outdated Show resolved Hide resolved

IronCore864 changed the title ~~feat: poc parsing juju env in one place~~ feat: parse JUJU_* env in one place Aug 14, 2024

IronCore864 added 9 commits August 15, 2024 08:47

feat: parse all juju env vars in one place

384fa9c

chore: add some more env vars that are used in models

4f146be

chore: update juju version and model

2f4694a

chore: update juju version to use juju context and update model

a21a204

test: fix ut

239a595

chore: refactor after self review

a296905

chore: add ut and lint

c7f7104

chore: add ut and lint

d9981e5

chore: add jujuversion from environ back

aad4788

benhoyt requested changes Aug 15, 2024

View reviewed changes

chore: remove JujuVersion.from_context

982a2f7

dimaqq reviewed Aug 16, 2024

View reviewed changes

test/test_model.py Outdated Show resolved Hide resolved

dimaqq reviewed Aug 16, 2024

View reviewed changes

test/test_testing.py Outdated Show resolved Hide resolved

tonyandrewmeyer reviewed Aug 16, 2024

View reviewed changes

chore: refactor according to discussion and code review

f477b51

chore: some final refactor after self review

aced15d

IronCore864 marked this pull request as ready for review August 19, 2024 06:20

IronCore864 requested review from benhoyt, tonyandrewmeyer and dimaqq August 19, 2024 06:20

tonyandrewmeyer approved these changes Aug 20, 2024

View reviewed changes

IronCore864 added 2 commits August 20, 2024 16:18

chore: refactor according to code review comments

e8b44d2

chore: some refactor after self review and testing

0d8fffc

benhoyt requested changes Aug 20, 2024

View reviewed changes

ops/jujuversion.py Outdated Show resolved Hide resolved

ops/jujucontext.py Show resolved Hide resolved

ops/jujucontext.py Outdated Show resolved Hide resolved

ops/model.py Show resolved Hide resolved

test/test_main.py Outdated Show resolved Hide resolved

tonyandrewmeyer reviewed Aug 20, 2024

View reviewed changes

test/test_main.py Outdated Show resolved Hide resolved

IronCore864 added 7 commits August 21, 2024 21:24

chore: moving JujuVersion back to ops/jujuversion.py

fef2b9d

chore: add juju_debug_at to _JujuContext and some refactor

4020b6f

chore: move tests for _JujuContext to a new file

1d3d07d

chore: fix tests after refactoring

a440143

chore: fix tests and backward compatibility

b9cfbdc

chore: fix tests and backward compatibility

922dffc

Merge branch 'parse-juju-env-vars' of github.com:IronCore864/operator…

9f702d1

… into parse-juju-env-vars

IronCore864 added 2 commits August 22, 2024 00:19

chore: fix tests and backward compatibility

deef8c8

Merge branch 'parse-juju-env-vars' of github.com:IronCore864/operator…

4733687

… into parse-juju-env-vars

IronCore864 requested a review from benhoyt August 21, 2024 17:11

sed-i reviewed Aug 21, 2024

View reviewed changes

benhoyt approved these changes Aug 21, 2024

View reviewed changes

benhoyt requested a review from tonyandrewmeyer August 21, 2024 21:29

tonyandrewmeyer approved these changes Aug 21, 2024

View reviewed changes

test/test_framework.py Outdated Show resolved Hide resolved

ops/framework.py Outdated Show resolved Hide resolved

test/test_helpers.py Show resolved Hide resolved

chore: minor refactor

ccd3acb

IronCore864 merged commit 5d74b82 into canonical:main Aug 23, 2024
32 checks passed

IronCore864 deleted the parse-juju-env-vars branch August 23, 2024 02:33

dimaqq mentioned this pull request Aug 23, 2024

test: minor tests fix to run tests without tox #1327

Closed

This was referenced Aug 26, 2024

chore: add compatibility with the next ops release canonical/ops-scenario#178

Merged

Use ops.jujucontext._JujuContext canonical/ops-scenario#179

Closed

samuelallan72 mentioned this pull request Sep 3, 2024

Fix unit tests after recent ops update canonical/charm-simple-streams#22

Closed

samuelallan72 reviewed Sep 3, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: parse JUJU_* env in one place #1313

feat: parse JUJU_* env in one place #1313

IronCore864 commented Aug 13, 2024

tonyandrewmeyer commented Aug 13, 2024

dimaqq commented Aug 13, 2024

benhoyt left a comment

IronCore864 commented Aug 14, 2024

IronCore864 commented Aug 15, 2024

benhoyt left a comment

IronCore864 commented Aug 16, 2024 •

edited

Loading

tonyandrewmeyer left a comment

benhoyt left a comment

benhoyt commented Aug 21, 2024

IronCore864 commented Aug 21, 2024 •

edited

Loading

sed-i left a comment

sed-i Aug 21, 2024

sed-i Aug 22, 2024

IronCore864 Aug 22, 2024

tonyandrewmeyer Aug 25, 2024

tonyandrewmeyer left a comment

samuelallan72 Sep 3, 2024

tonyandrewmeyer Sep 4, 2024

gabrielcocenza Sep 4, 2024 •

edited

Loading

tonyandrewmeyer Sep 4, 2024

tonyandrewmeyer Sep 5, 2024

gabrielcocenza Sep 5, 2024

		if 'JUJU_VERSION' not in os.environ:
		os.environ['JUJU_VERSION'] = '0.0.0'

feat: parse JUJU_* env in one place #1313

feat: parse JUJU_* env in one place #1313

Conversation

IronCore864 commented Aug 13, 2024

More Explanations

1 Grouping

2 Type

tonyandrewmeyer commented Aug 13, 2024

dimaqq commented Aug 13, 2024

benhoyt left a comment

Choose a reason for hiding this comment

IronCore864 commented Aug 14, 2024

IronCore864 commented Aug 15, 2024

benhoyt left a comment

Choose a reason for hiding this comment

IronCore864 commented Aug 16, 2024 • edited Loading

tonyandrewmeyer left a comment

Choose a reason for hiding this comment

benhoyt left a comment

Choose a reason for hiding this comment

benhoyt commented Aug 21, 2024

IronCore864 commented Aug 21, 2024 • edited Loading

sed-i left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tonyandrewmeyer left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

gabrielcocenza Sep 4, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

IronCore864 commented Aug 16, 2024 •

edited

Loading

IronCore864 commented Aug 21, 2024 •

edited

Loading

gabrielcocenza Sep 4, 2024 •

edited

Loading