flaky examples/relay tests #1158

marten-seemann · 2021-08-16T17:17:15Z

 === RUN   TestMain
      logharness.go:101: saw: Okay, no connection from h1 to h3: no addresses
      logharness.go:106: did not see expected prefix "Meow! It worked!"
  --- FAIL: TestMain (4.54s)

The text was updated successfully, but these errors were encountered:

marten-seemann · 2021-10-04T09:05:57Z

The problem here is the following:
h1 and h3 connect to h2 (the relay). As soon as both of them have established a connection via to the relay, they try to establish a connection between each other via the relay. This is racy, as both h1 and h3 might dial using multiple transports, (temporarily) resulting in the establishment of multiple connection. Spurious connections will then be closed by the respective hosts. Now it might happen that the connection attempt between h1 and h3 happens before the relay learns which connection is closed.

Stebalien · 2021-10-04T22:45:55Z

Really? IIRC, we don't close these connections anymore.

Stebalien · 2021-10-04T22:51:07Z

Or is this because we're canceling outbound dials once one dial succeeds? I believe we can fix this in

go-libp2p/p2p/host/basic/basic_host.go

Line 602 in e70d7cf

case <-h.ids.IdentifyWait(s.Conn()):

by trying to create a new stream if identify fails... maybe?

marten-seemann · 2021-10-05T14:16:22Z

Or is this because we're canceling outbound dials once one dial succeeds?

Yes: https://github.com/libp2p/go-libp2p-swarm/blob/88ef86a16cf0bbc451c3799f63278d345aaf56ea/limiter.go#L227-L234

by trying to create a new stream if identify fails... maybe?

Not sure if I understand your proposal. h2 is creating a stream to h3 (for relaying the connection from h1 to h3). Where does identify come into play here?

Stebalien · 2021-10-05T21:10:30Z

Well, it's kind of a hack, but it's not too terrible. Basically, if peer A successfully "identifies" peer B, peer B must have yielded the connection to the host and therefore won't cancel it.

marten-seemann added the kind/bug A bug in existing code (including security flaws) label Aug 16, 2021

Stebalien changed the title ~~flaky TestMain~~ flaky examples/relay tests Aug 17, 2021

marten-seemann mentioned this issue Oct 4, 2021

disable flaky relay example test on CI #1219

Merged

marten-seemann closed this as completed in #1219 Feb 6, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

flaky examples/relay tests #1158

flaky examples/relay tests #1158

marten-seemann commented Aug 16, 2021

marten-seemann commented Oct 4, 2021

Stebalien commented Oct 4, 2021

Stebalien commented Oct 4, 2021

marten-seemann commented Oct 5, 2021 •

edited

Loading

Stebalien commented Oct 5, 2021

flaky examples/relay tests #1158

flaky examples/relay tests #1158

Comments

marten-seemann commented Aug 16, 2021

marten-seemann commented Oct 4, 2021

Stebalien commented Oct 4, 2021

Stebalien commented Oct 4, 2021

marten-seemann commented Oct 5, 2021 • edited Loading

Stebalien commented Oct 5, 2021

marten-seemann commented Oct 5, 2021 •

edited

Loading