Add connect retries #1196

florimondmanca · 2020-08-19T08:03:24Z

Closes #1141

Adds off-by-default "retry for connect errors" behavior, used as httpx.Client(..., retries=<int>).

Makes sure that only ConnectError/ConnectTimeout are retried on, and that we only retry on idempotent HTTP verbs (i.e. exclude those that by definition should not be retried on because it could duplicate state changes on the server side).
httpx.request(<method>, connect_retries=<int>) is not included, since I'm not sure we want this knob in the high-level API.
The httpx.Client(retries=<int>) API is a good start to any future enhancements to retry functionality, such as an httpx.Retry class for encapsulating retries configuration, or providing a custom retries API, while preserving "retries=<int> retries on connect errors" for the most basic use cases.

Rendered docs preview:

tomchristie · 2020-08-19T10:32:24Z

Fab. 👍✨

Thoughts...

I guess we should probably use the inverse form instead of if request.method in ("POST", "PATCH"), eg. if request.method in IDEMPOTENT_METHODS with IDEMPOTENT_METHODS = ["GET", "HEAD", "PUT", "DELETE", "OPTIONS", "TRACE"], so that requests with nonstandard methods are not treated as idempotent.
Wondering if we ought to be keeping the API as narrow as possible, and initially only supporting retries: int = 0? Rather than exposing httpx.Retries() and backoff_factor. Pushing back on the API surface area as long as possible is generally a good idea.
It'd be good to review requests/urllib3 behaviour here, and just double check that we're matching up with requests' behaviour when installing an adapter with retries=<int>. Any working through of that would be great.

tomchristie · 2020-08-19T10:39:59Z

The requests docs give...

max_retries – The maximum number of retries each connection should attempt. Note, this applies only to failed DNS lookups, socket connections and connection timeouts, never to requests where data has made it to the server. By default, Requests does not retry failed connections. If you need granular control over the conditions under which we retry a request, import urllib3’s Retry class and pass that instead.

However, it looks like if retries != 0 then Retry.from_int(...) is used...

https://github.com/psf/requests/blob/48237afd9d064c639b9c2bcea7e75ab7b717b181/requests/adapters.py#L116-L119

Which I think means you'll end up with read=None on the class (Not the same as read=False), and that read retries will be used... https://github.com/urllib3/urllib3/blob/f0c43419e45db17a3c5f287be80aa0d14e2f58c6/src/urllib3/util/retry.py#L243

So I think the actual behaviour that requests uses isn't quite as documented, and is actually...

ConnectError and ConnectTimeout always retry if retries=<positive int>.
ReadError retries if the method is idempotent and retries=<positive int>.

?

florimondmanca · 2020-08-29T19:09:51Z

Based on #1141 (comment), I updated this PR to solely focus on adding support for httpx.Client(connect_retries=<int>).

@tomchristie Should be ready for a new round of reviews…

httpx/_client.py

tomchristie

This is really nicely done.

I think this probably strikes the right balance for us at the moment.
One comment for us to think about around the parameter naming.

Plus I guess we'd want the docs in before we merge it?

But yeah I think this is pretty fab. 👍👍👍

florimondmanca · 2020-09-01T21:16:11Z

@tomchristie Changed the connect_retries naming, and added some docs. :-) Leaving this open for a bit in case you or anyone else has any comments on the docs, or remaining comments on the implementation.

httpx/_utils.py

httpx/_client.py

florimondmanca · 2020-09-04T21:05:52Z

@tomchristie Addressed latest feedback — we should be good to go on this now? 😄

tests/client/test_retries.py

tomchristie · 2020-09-10T10:45:13Z

Okay, so I was leaving this on pause for a bit to try to figure out if connection retries really make sense on the client on at the transport layer, and I think we really do want this on the transport layer, wrapping as close to the connection opening as possible.

The reason here is that if we pull connection retries out to the client layer, then we can end up in a gnarly situation when there's high contention on the connection pool. With connection retries out at the client layer an incoming request with a retry will:

Wait to acquire the max connections semaphore.
Attempt a connection and fail.
Release the max connections semaphore.
Wait to acquire the max connections semaphore.
Attempt a connection and fail.
Release the max connections semaphore.

We'd really prefer a behaviour where once the semaphore is acquired, the connection will perform any connect+retry within that scope, so...

Wait to acquire the max connections semaphore.
Attempt a connection and fail.
Attempt a connection and fail.
Release the max connections semaphore.

This means much less thrashing when there's high contention.
Also supposing we have a timeout configuration like... httpx.Timeout(5.0, pool=60.0), and retries=2, then the maximum possible timeout is a more expected 60+5+5+5, rather than a possibly surprising 60+60+60+5+5+5.

Anyways, upshot of this is that I think? I'm keen on us pushing the implementation details of this into the transport layer. Possibly by passing retries as part of the __init__ configuration, and handling the retry behaviour within the Connection._open_socket method?... https://github.com/encode/httpcore/blob/92d3e9bbd3c442c6035860eea9e15a2249e0cad8/httpcore/_async/connection.py#L99

florimondmanca · 2020-09-10T10:50:12Z

@tomchristie The motivations wrt timeouts and pooling are strong, indeed. Thoughts on my reservations related to testing here? #1141 (comment) On HTTPCore we can't just "swap out a transport", so I'm not sure how we'd test this w/o monkeypatch since it would be right into the socket opening logic…

Hold

tomchristie · 2020-09-10T11:39:44Z

Thoughts on my reservations related to testing here? #1141 (comment) On HTTPCore we can't just "swap out a transport", so I'm not sure how we'd test this w/o monkeypatch since it would be right into the socket opening logic…

Ah really great point yup.

This is kinda the point at which we need to be digging into exposing backend=<BackendClass>, and having tests that run against mock backends, so that we can do stuff like testing connection retries, and all sorts of other in-detail bits.

florimondmanca · 2020-10-10T06:56:10Z

Opened encode/httpcore#221 against HTTPCore, closing this one for now!

florimondmanca added 4 commits August 17, 2020 12:50

Add connection retries

7c9e8fc

Merge branch 'master' into fm/retries

0d2146c

Update tests

66986fa

Lint

612c64f

florimondmanca added 2 commits August 29, 2020 20:57

Merge branch 'master' into fm/retries

0e51223

Rework towards Client(connect_retries=<int>)

61b148a

florimondmanca force-pushed the fm/retries branch from 423b738 to 61b148a Compare August 29, 2020 19:09

florimondmanca requested a review from a team August 29, 2020 19:10

florimondmanca added 2 commits August 29, 2020 21:12

Prefer "method not in IDEMPOTENT_METHODS"

6dc659b

Prefer httpx.Client(), ensure test coverage

8ccdac3

florimondmanca changed the title ~~Add retries~~ Add connect retries Aug 29, 2020

Tweak url destructuring

8c56ad2

tomchristie added the enhancement New feature or request label Sep 1, 2020

tomchristie reviewed Sep 1, 2020

View reviewed changes

httpx/_client.py Outdated Show resolved Hide resolved

tomchristie previously approved these changes Sep 1, 2020

View reviewed changes

tomchristie and others added 4 commits September 1, 2020 16:11

Merge branch 'master' into fm/retries

53c758d

Rename connect_retries as retries

e7fe547

Add docs

24e974d

Merge branch 'master' into fm/retries

824643a

tomchristie mentioned this pull request Sep 2, 2020

Version 0.14.3 #1247

Merged

Merge branch 'master' into fm/retries

92bcfdd

tomchristie reviewed Sep 2, 2020

View reviewed changes

httpx/_utils.py Show resolved Hide resolved

tomchristie reviewed Sep 2, 2020

View reviewed changes

httpx/_client.py Outdated Show resolved Hide resolved

florimondmanca added 3 commits September 4, 2020 22:56

Merge branch 'master' into fm/retries

c432233

No need to care about idempotent methods on connect

7c93bd7

Anticipate curio support in HTTPCore

a1413df

tomchristie reviewed Sep 4, 2020

View reviewed changes

tests/client/test_retries.py Outdated Show resolved Hide resolved

florimondmanca added 3 commits September 4, 2020 23:23

Remove unrelated newline change

a25aae3

Merge branch 'master' into fm/retries

f507163

Move exponential_backoff test to test_utils.py

d6c236b

florimondmanca requested a review from tomchristie September 4, 2020 21:59

florimondmanca mentioned this pull request Oct 10, 2020

Add connect retries encode/httpcore#221

Merged

florimondmanca closed this Oct 10, 2020

florimondmanca deleted the fm/retries branch October 10, 2020 06:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add connect retries #1196

Add connect retries #1196

florimondmanca commented Aug 19, 2020 •

edited

Loading

tomchristie commented Aug 19, 2020

tomchristie commented Aug 19, 2020

florimondmanca commented Aug 29, 2020 •

edited

Loading

tomchristie left a comment

florimondmanca commented Sep 1, 2020

florimondmanca commented Sep 4, 2020

tomchristie commented Sep 10, 2020

florimondmanca commented Sep 10, 2020

tomchristie commented Sep 10, 2020

florimondmanca commented Oct 10, 2020

Add connect retries #1196

Add connect retries #1196

Conversation

florimondmanca commented Aug 19, 2020 • edited Loading

tomchristie commented Aug 19, 2020

tomchristie commented Aug 19, 2020

florimondmanca commented Aug 29, 2020 • edited Loading

tomchristie left a comment

Choose a reason for hiding this comment

florimondmanca commented Sep 1, 2020

florimondmanca commented Sep 4, 2020

tomchristie commented Sep 10, 2020

florimondmanca commented Sep 10, 2020

tomchristie commented Sep 10, 2020

florimondmanca commented Oct 10, 2020

florimondmanca commented Aug 19, 2020 •

edited

Loading

florimondmanca commented Aug 29, 2020 •

edited

Loading