beresp 304 header merging makes impossible to distinguish between headers set by varnish and backend #3102

nigoroll · 2019-10-21T09:08:28Z

Reported by @martin-uplex in https://varnish-cache.org/lists/pipermail/varnish-dev/2019-October/009469.html

Basically, the way we merge headers for 304 responses makes it impossible to distinguish headers originating from the backend from ones which we set in VCL (for the previous response).

bsdphk · 2019-10-21T11:08:42Z

Just to make sure I understand the scenario: The Cache-Control: nocache is created in VCL ?

bsdphk · 2019-10-21T11:10:53Z

We talked at one point about 304 responses getting their own dedicated VCL method, where both beresp.* (RW) and obj.* (RO) were available.

I wonder how much of the current C-lang 304 logic we could implement in builtin_vcl if we did that ?

martin-uplex · 2019-10-21T12:01:16Z

Just to make sure I understand the scenario: The Cache-Control: nocache is created in VCL ?

Yes, correct.

nigoroll · 2019-10-21T12:05:43Z

@bsdphk yes to the first question: The scenario is to modify the headers of the cache-object with respect to the downstream behavior, but have a differing ttl/cacheable status.
And yes, I think that an additional VCL method would help and IIUC this was my preferred option when we initially discussed the 304 implementation.

bsdphk · 2019-10-21T16:18:29Z

But didnt we find out back then that we would still need C-magic to make cond-fetch work ?

nigoroll · 2019-10-21T16:54:45Z

Discussed on IRC: I think this model should work:

before vcl_backend_response {} , for a 304, call vcl_backend_refresh {} with beresp.* (r/w) and obj.* (r/o)
builtin.vcl would just contain sub vcl_backend_refresh { return(deliver); }
the header merge, as exists now, would run after vcl_backend_refresh {}:
- beresp. headers always replace obj. headers
304-aware code would basically do almost all of the work (except deletion, see below) in vcl_backend_refresh {} and skip most of the processing in vcl_backend_response {} based on beresp.was_304
The only special case (which I see) would be header deletion, which requires cooperation between vcl_backend_refresh {} and vcl_backend_response {}: _refresh would set some marker (e.g. in a header or variable) and _response would do the actual deletion.

Unless I miss something, this solution should be 100% compatible with existing VCL and allow for advanced handling of 304s only where needed.

If there was a need, we could facilitate header deletion also by marking headers not to be merged (as a vmod function in tandem with some core code addition).

@martin-uplex can you please review if this suggestion would work for you?

martin-uplex · 2019-10-22T10:40:31Z

discussed with @nigoroll, he will follow up on this

suggestion in short: no vcl_backend_refresh, but make obj.* (r/o) available in vcl_backend_response, filled with same values as beresp.* if not 304

dridi · 2019-10-22T14:07:15Z

sub vcl_backend_revalidate {
    if (beresp.backend.name ~ "legacy") {
        return (replace);
    }
}

# built-in

sub vcl_backend_revalidate {
    beresp.merge(obj);
    return (reuse);
}

I have been confronted to misbehaving backends in the past, being able to ignore them would be a plus. You can already ignore them in v_b_response by removing condition headers. Nit-picking on the "v_b_refresh" name, and especially the return (deliver) transition if it's meant to lead to v_b_response.

nigoroll · 2019-10-22T14:13:12Z

@dridi how would you handle a misbehaving backend when you already have a 304 response (with no body)? I do not see how return (replace) vs. return (reuse) would work, if that is referring to the body.

regarding the header merge, making it explicit would be another option, which would also facilitate the header deletion, yes.

dridi · 2019-10-22T15:24:33Z

Right, forget that part of my suggestion...

slimhazard · 2019-10-22T16:34:59Z

Qs about beresp.merge(obj):

Can it be called with any other object besides obj? If so, then what other object could make sense as the argument? If not, then wouldn't it be better if there is no argument? Say beresp.merge() or beresp.merge_obj()?
What happens if it doesn't get called? Then varnishd does the default merging? Or does that mean you don't want the merge? In the latter case, the documentation should probably point out that the HTTP standard has some requirements about merging headers of the object to be validated with headers from the validating response, so it's a "use at your own risk" option. I would assume then that the default in builtin.vcl would be to call the merge.

dridi · 2019-10-22T18:03:36Z

Can it be called with any other object besides obj?

Yes, it can be called in a different context with different arguments.

sub vcl_init {
    my_replicator.request(...).merge(req);
    my_replicator.send();
    # where the request() function returns an HTTP
}

What happens if it doesn't get called?

My suggestion to move (and expose) this to VCL introduces a risk. Much like I can break a bereq or a resp today by messing with it in pure VCL code. I'm definitely leaning towards the "use at your own risk" side.

hermunn · 2019-10-23T08:27:28Z

* What happens if it doesn't get called?

We have to choose if the default behavior is to remain in the core or be moved to the builtin VCL. In my humble opinion, if we introduce a .merge(), then we should not do any merging in the core, but do it in VCL.

On a return (reuse) we have the option of stealing the body and headers, RFC be damned.

... the documentation should probably point out that the HTTP standard has some requirements about merging headers of the object to be validated with headers from the validating response, so it's a "use at your own risk" option.

Yeah, this is true, and for many app writers, not well understood. Having the option to amend that in VCL is a natural request.

dridi · 2019-10-23T08:34:27Z

One use case we might also consider is not merging surrogate keys for example:

sub vcl_backend_revalidate {
    if (beresp.http.xkey) {
        # don't keep track of stale surrogate keys
        set beresp.http.new_xkey = beresp.http.xkey;
        beresp.merge(obj);
        set beresp.http.xkey = beresp.http.new_xkey;
        unset beresp.http.new_xkey;
        return (TBD); # my reuse vs replace transition didn't make sense
    }
}

hermunn · 2019-10-23T08:42:33Z

        set beresp.http.new_xkey = beresp.http.xkey;

Well, a std.collect is in order here, and this illustrates a maybe sore point in all of this - Varnish does not work great with repeated headers.

dridi · 2019-10-23T08:46:43Z

set beresp.http.new_xkey = beresp.http.xkey.collect();

# or

beresp.http.new_xkey.clone(beresp.http.xkey);

Now that we have type properties and methods, we can make vmod_header redundant by enhancing the VCL_HEADER and VCL_HTTP types.

nigoroll · 2019-12-23T14:09:47Z

bugwash: @dridi and myself agree that sub vcl_backend_refresh preferred

nigoroll · 2020-02-07T15:28:26Z

Discussed with @dridi and @bsdphk

We agree that a new sub vcl_backend_refresh is the best option
- return values: ok/retry/abandon/fail
The 304 handling should move to vcl, by default calling merge_304;

I will prepare some mock up vcl illustrating how the relevant use cases will look like

@martin-uplex es case
Etag mangling for (un)gzip-ing
Handle changing Content-Encoding for inconsistent backends? #3169

Also:

read https://tools.ietf.org/html/draft-ietf-httpbis-semantics-06

nigoroll · 2020-03-22T12:20:26Z

FYI, working on the promised VCL mockup proposal made me realize that not even the current master version of the cache rfc can possibly work:

For each stored response identified for update, the cache MUST use the header fields provided in the 304 (Not Modified) response to replace all instances of the corresponding header fields in the stored response.

As noticed here and in #3169, if origin servers wrongly update some headers (like Content-Length, Content-Encoding or maybe Content-Type), bad things might happen.

The corresponding discussion seems to take place in httpwg/http-core#165 and the latest sensible proposal seems to be httpwg/http-core#337

I will follow this proposal and maybe participate in the discussion if need to.

nigoroll · 2020-04-17T12:14:03Z

TODO also: consider https://cache-tests.fyi/#conditional when writing the VIP

dridi · 2023-09-27T12:08:22Z

Bugwash suggestion:

# built-in
sub vcl_backend_refresh {
        return (merge);
}

In this subroutine we have access to beresp (read-write) and obj_stale (read-only) and the available transitions are:

merge (merge obj_stale with beresp)
replace (take beresp as-is)
fail
abandon
error?

nigoroll · 2023-09-27T13:01:06Z

VDD CONSENSUS:

1.Do not add new functionality unless an implementor cannot complete
a real application without it.

vcl_backend_refresh
beresp r/w access
obj_stale r/o access
return (merge, obj_stale, beresp, fail, abandon, error, retry);

merge: existing logic

change status to 200 if beresp.status == 304
header merge logic as is

obj_stale: undo the result of the sub, just copy everything from obj_stale

beresp: use beresp as it is (including status)

beresp.was_304 is going to stay true

nigoroll added the a=need bugwash label Oct 21, 2019

bsdphk added the a=feedback please label Oct 21, 2019

nigoroll assigned martin-uplex Oct 21, 2019

dridi mentioned this issue Oct 23, 2019

RFC: handle non-symbolic HTTP header names in pure VCL #3103

Closed

nigoroll mentioned this issue Dec 23, 2019

Handle changing Content-Encoding for inconsistent backends? #3169

Closed

nigoroll assigned nigoroll and unassigned martin-uplex Dec 23, 2019

bsdphk added a=NextVCL things which need a VCL bump r=7.0 labels Feb 24, 2020

This was referenced Mar 23, 2020

RFC: sub vcl_lookup #3259

Closed

RFC: log implicitly filtered headers #3260

Closed

nigoroll removed the a=need bugwash label Mar 30, 2020

dridi mentioned this issue Sep 28, 2021

8.0 Compliance improvements #3246

Open

dridi assigned walid-git Sep 27, 2023

walid-git mentioned this issue Oct 6, 2023

New vcl_backend_refresh method #3994

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

beresp 304 header merging makes impossible to distinguish between headers set by varnish and backend #3102

beresp 304 header merging makes impossible to distinguish between headers set by varnish and backend #3102

nigoroll commented Oct 21, 2019 •

edited

Loading

bsdphk commented Oct 21, 2019

bsdphk commented Oct 21, 2019

martin-uplex commented Oct 21, 2019

nigoroll commented Oct 21, 2019 •

edited

Loading

bsdphk commented Oct 21, 2019

nigoroll commented Oct 21, 2019 •

edited

Loading

martin-uplex commented Oct 22, 2019

dridi commented Oct 22, 2019

nigoroll commented Oct 22, 2019

dridi commented Oct 22, 2019

slimhazard commented Oct 22, 2019

dridi commented Oct 22, 2019

hermunn commented Oct 23, 2019

dridi commented Oct 23, 2019

hermunn commented Oct 23, 2019

dridi commented Oct 23, 2019

nigoroll commented Dec 23, 2019

nigoroll commented Feb 7, 2020

nigoroll commented Mar 22, 2020

nigoroll commented Apr 17, 2020

dridi commented Sep 27, 2023 •

edited

Loading

nigoroll commented Sep 27, 2023 •

edited

Loading

beresp 304 header merging makes impossible to distinguish between headers set by varnish and backend #3102

beresp 304 header merging makes impossible to distinguish between headers set by varnish and backend #3102

Comments

nigoroll commented Oct 21, 2019 • edited Loading

bsdphk commented Oct 21, 2019

bsdphk commented Oct 21, 2019

martin-uplex commented Oct 21, 2019

nigoroll commented Oct 21, 2019 • edited Loading

bsdphk commented Oct 21, 2019

nigoroll commented Oct 21, 2019 • edited Loading

martin-uplex commented Oct 22, 2019

dridi commented Oct 22, 2019

nigoroll commented Oct 22, 2019

dridi commented Oct 22, 2019

slimhazard commented Oct 22, 2019

dridi commented Oct 22, 2019

hermunn commented Oct 23, 2019

dridi commented Oct 23, 2019

hermunn commented Oct 23, 2019

dridi commented Oct 23, 2019

nigoroll commented Dec 23, 2019

nigoroll commented Feb 7, 2020

nigoroll commented Mar 22, 2020

nigoroll commented Apr 17, 2020

dridi commented Sep 27, 2023 • edited Loading

nigoroll commented Sep 27, 2023 • edited Loading

nigoroll commented Oct 21, 2019 •

edited

Loading

nigoroll commented Oct 21, 2019 •

edited

Loading

nigoroll commented Oct 21, 2019 •

edited

Loading

dridi commented Sep 27, 2023 •

edited

Loading

nigoroll commented Sep 27, 2023 •

edited

Loading