Support timeouts on Request/Response protocol level #1345

xgreenx · 2023-09-05T19:31:56Z

Problem description

Any error(it includes timeouts) that occurs during the request-response protocol is ignored. If we receive an error, the parts of the blockchain that await the response will be paralyzed. In the case of the fuel-core-sync, it may stuck the synchronization. In other cases, it can consume resources forever.

Solution

Instead of silently removing the channel related to request_id we need to send either an error or None. It allows other parts of the project to handle it and request it again(maybe from another peer).

Implementation details

The approach with error requires modification of the response type. But it triggers cascade changes in the places where we started a request. It will allow us to check all related places and to be sure that we handled an error response correctly and at least created retry logic(or any other logic that suits us to be sure that the node is in the "fine" state after this request).

The main goal of this issue is to have timeouts for requests. We need to add an integration test that verifies that. Also, maybe we need to make a timeout threshold configurable(the default value could be 20 seconds).

The text was updated successfully, but these errors were encountered:

For some reason we had a two layers of serialization for request/response messages. This doesn't seem useful at all, and complicates e.g. error handling. This PR removes the extra layer, substantially simplifying that logic. One major upside of this is that #1345 and #1350 can now be solved in a single follow-up PR. ~~Hopefully this doesn't conflict too much with the ongoing libp2p update PR #1379.~~ --------- Co-authored-by: Green Baneling <XgreenX9999@gmail.com>

Closes #1345. Closes #1346 Closes #1350. This PR stops discarding request errors from libp2p, and instead returns them to the sender of the request. Also penalizes peers for sending invalid responses or for not replying at all. Making penalty configurable should be a follow-up PR, as there are other penalties that should be configurable as well TODO: - [x] Make timeout configutable: Already seems to be case on master branch - [x] Add tests - [x] Fix current tests that for some reason don't terminate --------- Co-authored-by: xgreenx <xgreenx9999@gmail.com>

xgreenx self-assigned this Sep 5, 2023

xgreenx mentioned this issue Sep 5, 2023

Peer Reputation for syncing #943

Open

15 tasks

MitchTurner mentioned this issue Sep 5, 2023

Subtract Peer Reputation for Request Timeout #1346

Closed

xgreenx removed their assignment Sep 26, 2023

xgreenx mentioned this issue Sep 26, 2023

Report Peers that give incorrect (not deserializable) data #1350

Closed

xgreenx assigned Dentosal Nov 28, 2023

Dentosal mentioned this issue Dec 22, 2023

Simplify p2p request/response message serialization #1573

Merged

Dentosal mentioned this issue Jan 9, 2024

Decrease peer reputation on request timeouts and decode errors #1574

Merged

3 tasks

Dentosal closed this as completed in #1574 Feb 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support timeouts on Request/Response protocol level #1345

Support timeouts on Request/Response protocol level #1345

xgreenx commented Sep 5, 2023 •

edited

Loading

Support timeouts on Request/Response protocol level #1345

Support timeouts on Request/Response protocol level #1345

Comments

xgreenx commented Sep 5, 2023 • edited Loading

Problem description

Solution

Implementation details

xgreenx commented Sep 5, 2023 •

edited

Loading