docs: updating runtime guard policy #10983

alyssawilk · 2020-04-28T19:05:33Z

Codifying that most L7 changes should be runtime guarded, updating deprecation timeline as we changed it a while back.

Risk Level: n/a
Testing: n/a
Docs Changes: yes
Release Notes: n/a

Signed-off-by: Alyssa Wilk <alyssar@chromium.org>

alyssawilk · 2020-04-28T19:07:36Z

cc @envoyproxy/maintainers

I'll say we aren't super strict about this today - for example #10957 changes connect behavior fixing bugs for HTTP/1.0 (which barely anyone is using) and where someone sends "Connection: close, foo" which is both unlikely and was buggy before.

At some point we may need to guard even the weird corner cases and bugfixes but I think we're not there yet. cc @AndresGuedez who may disagree :-P

Signed-off-by: Alyssa Wilk <alyssar@chromium.org>

mattklein123

At some point we may need to guard even the weird corner cases and bugfixes but I think we're not there yet. cc @AndresGuedez who may disagree :-P

TBH I was surprised you didn't guard those changes. Honestly, I would be fine just saying that we have to do it for all such changes at this point given the widespread usage and chance of breakage. (I mentioned to @alyssawilk offline that removing transfer-encoding: identity from HTTP/1 broke a team at Lyft.)

mattklein123 · 2020-04-28T19:22:19Z

CONTRIBUTING.md

-change).
+change). Generally as a community we try to guard both high risk changes (major
+refactors such as replacing Envoy's buffer implementation) and most user-visible
+non-config-guarded changes to HTTP processing (for example additions or changes to HTTP headers or


I would probably s/HTTP/protocol and make the thing in parenthesis an example, but would be curious to hear what others think. At a high level I think we should be config guarding any user visible non-config-guarded change for any protocol extension that is not alpha. WDYT?

Hm, I was trying to (possibly unsuccessfully) differentiate between changes like adding and removing headers, to changes like chunk coalescing or fixing infinite buffering. do you think the memory fixes for chunk encoding and the gRPC infinite buffering fix should be guarded as well? They're user-visible insofar as the user could be running a VM and consuming those chunks, and those logs. I don't object to us getting that zealous but I wasn't arguing for it yet.

If you think it's time to guard all L7 changes I can retroactively guard the chunk PR - it's a mild bummer to keep the literally buggy cruft around for 6 months but such is life for an increasingly widely used proxy. :-P

do you think the memory fixes for chunk encoding and the gRPC infinite buffering fix should be guarded as well?

No (unless we view them as high enough regression risk). I was mainly saying that we should apply the same rigor for non-HTTP protocols that are not alpha, such as Redis, etc.

If you think it's time to guard all L7 changes I can retroactively guard the chunk PR - it's a mild bummer to keep the literally buggy cruft around for 6 months but such is life for an increasingly widely used proxy. :-P

IMO we can still apply maintainer discretion, but we should probably at least have the conversation for any such change?

I'm happy to make the change you suggested - I'm trying to get clarity around "any user visible"
I tend towards "user visible changes likely to cause problems" where for HTTP header changes are often going to, things like changing hashing is definitely going to cause rollout pain, but chunking the body differently is very unlikely to be problematic.
Not sure how much we want to spell out and how much we want to say "maintainers believe will be problematic"
I am also honestly curious about the gRPC thing. I think if we limit log failure to flush size + buffer size it might be OK, but realistically we're going from never dropping to sometimes dropping and it's dicey :-P

Also changing to protocol because I agree with your overall sentiment. Might be good to have a !http example but I don't know enough about redis/kafka etc to suggest a likely to happen and user visible change. Ideas?

Not sure how much we want to spell out and how much we want to say "maintainers believe will be problematic"

Yeah agreed. I think you can soften the language however you feel is appropriate.

I am also honestly curious about the gRPC thing. I think if we limit log failure to flush size + buffer size it might be OK, but realistically we're going from never dropping to sometimes dropping and it's dicey :-P

Sorry I wasn't clear which fix you were talking about above. Yeah I agree it's dicey and could potentially benefit from a flag, though this is one of the cases that I don't feel strongly about.

Signed-off-by: Alyssa Wilk <alyssar@chromium.org>

mattklein123

LGTM modulo remaining thoughts. I can't think of a specific non-HTTP example right now but will add one if I think of it.

mattklein123 · 2020-04-28T21:13:58Z

CONTRIBUTING.md

+change). Generally as a community we try to guard both high risk changes (major
+refactors such as replacing Envoy's buffer implementation) and most user-visible
+non-config-guarded changes to protocol processing (for example additions or changes to HTTP headers or
+how HTTP is serialized out) for non-alpha features.


A couple of other thoughts:

Should we have some blurb about maintainer discretion / consulting maintainers as needed?

Should we update the PR template with a "feature flag / runtime" field to get people thinking about this? It wouldn't be filled out that often but it would require people to add "N/A" explicitly? WDYT?

Signed-off-by: Alyssa Wilk <alyssar@chromium.org>

alyssawilk · 2020-04-29T12:58:29Z

How about having it optional for now to have folks thinking about it but not requiring n/a on the bulk of PRs?

mattklein123 · 2020-04-29T17:44:37Z

How about having it optional for now to have folks thinking about it but not requiring n/a on the bulk of PRs?

Sure happy to start there for sure.

mattklein123

LGTM with small comment, thank you!

/wait

mattklein123 · 2020-04-29T21:15:11Z

PULL_REQUEST_TEMPLATE.md

@@ -7,5 +7,6 @@ Risk Level:
 Testing:
 Docs Changes:
 Release Notes:
+[Optional Runtime guard:]


Document more fully in https://github.com/envoyproxy/envoy/blob/master/PULL_REQUESTS.md with links to the contributing guide, etc.?

Signed-off-by: Alyssa Wilk <alyssar@chromium.org>

mattklein123

Awesome, thanks.

docs: updating runtime guard policy

4d0bc49

Signed-off-by: Alyssa Wilk <alyssar@chromium.org>

not all changes are created equal

924f546

Signed-off-by: Alyssa Wilk <alyssar@chromium.org>

mattklein123 reviewed Apr 28, 2020

View reviewed changes

reviewer comments

2f5a5da

Signed-off-by: Alyssa Wilk <alyssar@chromium.org>

mattklein123 reviewed Apr 28, 2020

View reviewed changes

mattklein123 self-assigned this Apr 28, 2020

mattklein123 added the waiting:any label Apr 28, 2020

mattklein123 mentioned this pull request Apr 28, 2020

http codecs: considering forking #10988

Closed

reviewer comments

d28eacf

Signed-off-by: Alyssa Wilk <alyssar@chromium.org>

repokitteh-read-only bot removed the waiting:any label Apr 29, 2020

mattklein123 requested changes Apr 29, 2020

View reviewed changes

repokitteh-read-only bot added the waiting label Apr 29, 2020

reviewer comments

b81cdf8

Signed-off-by: Alyssa Wilk <alyssar@chromium.org>

repokitteh-read-only bot removed the waiting label Apr 30, 2020

mattklein123 approved these changes Apr 30, 2020

View reviewed changes

mattklein123 merged commit fd9325d into envoyproxy:master Apr 30, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: updating runtime guard policy #10983

docs: updating runtime guard policy #10983

alyssawilk commented Apr 28, 2020

alyssawilk commented Apr 28, 2020

mattklein123 left a comment

mattklein123 Apr 28, 2020

alyssawilk Apr 28, 2020

mattklein123 Apr 28, 2020

alyssawilk Apr 28, 2020

alyssawilk Apr 28, 2020

mattklein123 Apr 28, 2020

mattklein123 left a comment

mattklein123 Apr 28, 2020

alyssawilk commented Apr 29, 2020

mattklein123 commented Apr 29, 2020

mattklein123 left a comment

mattklein123 Apr 29, 2020

mattklein123 left a comment

docs: updating runtime guard policy #10983

docs: updating runtime guard policy #10983

Conversation

alyssawilk commented Apr 28, 2020

alyssawilk commented Apr 28, 2020

mattklein123 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mattklein123 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

alyssawilk commented Apr 29, 2020

mattklein123 commented Apr 29, 2020

mattklein123 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mattklein123 left a comment

Choose a reason for hiding this comment