Fixes #1165 by having threads wait for any outstanding connect to finish. #1170

slackpad · 2015-08-13T02:02:42Z

This creates a map of channels (per-address) that throttles requests for connections to one at a time (per-address). The other threads will then quickly grab the newly-created connection one at a time and use it.

If the connection they were waiting for fails, they will all try to make a new connection, and the race condition logic for that case was preserved. This seems like the best thing to do in order to move forward, since they all could have been hanging around waiting for that one connection to go. They essentially revert to the behavior we have today, but only in the case where there's network trouble.

I was trying to find a simpler way to handle the case where the connection they are waiting on fails - we could fail all the others immediately but I think the code would be a little more complicated. Thundering a herd at a dead guy seems pretty low-impact.

…ish.

slackpad · 2015-08-13T02:19:04Z

Also noticed that we didn't bump the ref count in the race condition case, so those might have behaved strangely in some rare cases. Pushed a fix for that.

slackpad · 2015-08-13T03:16:01Z

That last change makes it so only one thread attempts the connect and everybody else goes based on that. I'll let that bake for a little while (it's more tricky, imho, but not too bad).

armon · 2015-08-13T16:44:44Z

consul/pool.go

+		}()
+
+		c, err := p.getNewConn(dc, addr, version)
+		if err != nil {


Super minor, but can we move the code in the defer to after getNewConn and delete the limiter entry and add the pool entry inside the same critical section? Just to save a defer and another round of locking

armon · 2015-08-13T18:04:33Z

consul/pool.go

+	var wait chan struct{}
+	var ok bool
+	if wait, ok = p.limiter[addr.String()]; !ok {
+		wait = make(chan struct{}, 1)


We probably don't need to buffer it, since we just close

armon · 2015-08-13T18:05:56Z

LGTM!

Fixes #1165 by having threads wait for any outstanding connect to finish.

Fixes #1165 by having threads wait for any outstanding connect to fin…

40c5af6

…ish.

slackpad force-pushed the b-connection-spam branch from 49a3022 to 5524b95 Compare August 13, 2015 02:19

Adds missing ref count for the race condition case.

8bca3eb

slackpad force-pushed the b-connection-spam branch from 5524b95 to 8bca3eb Compare August 13, 2015 02:26

Gets rid of follow up attempts if the lead thread can't connect.

072811f

armon reviewed Aug 13, 2015
View reviewed changes

Cleans up locking and factors markForUse into a Conn method.

1e8937b

armon reviewed Aug 13, 2015
View reviewed changes

Changes to an unbuffered channel, since we just close it.

614bf44

slackpad added a commit that referenced this pull request Aug 13, 2015

Merge pull request #1170 from hashicorp/b-connection-spam

009f0fb

Fixes #1165 by having threads wait for any outstanding connect to finish.

slackpad merged commit 009f0fb into master Aug 13, 2015

slackpad deleted the b-connection-spam branch August 13, 2015 18:38

slackpad added a commit that referenced this pull request Aug 13, 2015

Merge pull request #1170 from hashicorp/b-connection-spam

48e6cc4

Fixes #1165 by having threads wait for any outstanding connect to finish.

slackpad mentioned this pull request Aug 14, 2015

Blocking on Catalog HTTP requests. #1154

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixes #1165 by having threads wait for any outstanding connect to finish. #1170

Fixes #1165 by having threads wait for any outstanding connect to finish. #1170

slackpad commented Aug 13, 2015

slackpad commented Aug 13, 2015

slackpad commented Aug 13, 2015

armon Aug 13, 2015

armon Aug 13, 2015

armon commented Aug 13, 2015

Fixes #1165 by having threads wait for any outstanding connect to finish. #1170

Fixes #1165 by having threads wait for any outstanding connect to finish. #1170

Conversation

slackpad commented Aug 13, 2015

slackpad commented Aug 13, 2015

slackpad commented Aug 13, 2015

armon Aug 13, 2015

Choose a reason for hiding this comment

armon Aug 13, 2015

Choose a reason for hiding this comment

armon commented Aug 13, 2015