Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

refactor: document and expose Amino DHT defaults #990

Merged
merged 2 commits into from
Nov 13, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
57 changes: 57 additions & 0 deletions amino/defaults.go
Original file line number Diff line number Diff line change
@@ -0,0 +1,57 @@
// Package amino provides protocol parameters and suggested default values for the [Amino DHT].
//
// [Amino DHT] is an implementation of the Kademlia distributed hash table (DHT) algorithm,
// originally designed for use in IPFS (InterPlanetary File System) network.
// This package defines key constants and protocol identifiers used in the Amino DHT implementation.
//
// [Amino DHT]: https://probelab.io/ipfs/amino/
package amino

import (
"time"

"github.com/libp2p/go-libp2p/core/protocol"
)

const (
// ProtocolPrefix is the base prefix for Amono DHT protocols.
ProtocolPrefix protocol.ID = "/ipfs"

// ProtocolID is the latest protocol identifier for the Amino DHT.
ProtocolID protocol.ID = "/ipfs/kad/1.0.0"

// DefaultBucketSize is the Amino DHT bucket size (k in the Kademlia paper).
// It represents the maximum number of peers stored in each
// k-bucket of the routing table.
DefaultBucketSize = 20

// DefaultConcurrency is the suggested number of concurrent requests (alpha
// in the Kademlia paper) for a given query path in Amino DHT. It
// determines how many parallel lookups are performed during network
// traversal.
DefaultConcurrency = 10

// DefaultResiliency is the suggested number of peers closest to a target
// that must have responded in order for a given query path to complete in
// Amino DHT. This helps ensure reliable results by requiring multiple
// confirmations.
DefaultResiliency = 3

// DefaultProvideValidity is the default time that a Provider Record should
// last on Amino DHT before it needs to be refreshed or removed. This value
// is also known as Provider Record Expiration Interval.
DefaultProvideValidity = 48 * time.Hour

// DefaultProviderAddrTTL is the TTL to keep the multi addresses of
// provider peers around. Those addresses are returned alongside provider.
// After it expires, the returned records will require an extra lookup, to
// find the multiaddress associated with the returned peer id.
DefaultProviderAddrTTL = 24 * time.Hour
)

var (
// Protocols is a slice containing all supported protocol IDs for Amino DHT.
// Currently, it only includes the main ProtocolID, but it's defined as a slice
// to allow for potential future protocol versions or variants.
Protocols = []protocol.ID{ProtocolID}
)
3 changes: 2 additions & 1 deletion crawler/options.go
Original file line number Diff line number Diff line change
Expand Up @@ -3,6 +3,7 @@
import (
"time"

"github.com/libp2p/go-libp2p-kad-dht/amino"
"github.com/libp2p/go-libp2p/core/protocol"
)

Expand All @@ -20,7 +21,7 @@
// defaults are the default crawler options. This option will be automatically
// prepended to any options you pass to the crawler constructor.
var defaults = func(o *options) error {
o.protocols = []protocol.ID{"/ipfs/kad/1.0.0"}
o.protocols = amino.Protocols

Check warning on line 24 in crawler/options.go

View check run for this annotation

Codecov / codecov/patch

crawler/options.go#L24

Added line #L24 was not covered by tests
o.parallelism = 1000
o.connectTimeout = time.Second * 5
o.perMsgTimeout = time.Second * 5
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should these be defined in defaults.go as well, even if not exported?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

These seem to be specific to crawler and really part of "Amino DHT contract", so could
I'm going to merge thus PR as-is, we can figure out what to do remaining magic numbers in follow-up PR.

Expand Down
11 changes: 6 additions & 5 deletions dht_options.go
Original file line number Diff line number Diff line change
Expand Up @@ -5,6 +5,7 @@ import (
"testing"
"time"

"github.com/libp2p/go-libp2p-kad-dht/amino"
dhtcfg "github.com/libp2p/go-libp2p-kad-dht/internal/config"
pb "github.com/libp2p/go-libp2p-kad-dht/pb"
"github.com/libp2p/go-libp2p-kad-dht/providers"
Expand Down Expand Up @@ -34,7 +35,7 @@ const (
)

// DefaultPrefix is the application specific prefix attached to all DHT protocols by default.
const DefaultPrefix protocol.ID = "/ipfs"
const DefaultPrefix protocol.ID = amino.ProtocolPrefix

type Option = dhtcfg.Option

Expand Down Expand Up @@ -136,7 +137,7 @@ func NamespacedValidator(ns string, v record.Validator) Option {
// ProtocolPrefix sets an application specific prefix to be attached to all DHT protocols. For example,
// /myapp/kad/1.0.0 instead of /ipfs/kad/1.0.0. Prefix should be of the form /myapp.
//
// Defaults to dht.DefaultPrefix
// Defaults to amino.ProtocolPrefix
func ProtocolPrefix(prefix protocol.ID) Option {
return func(c *dhtcfg.Config) error {
c.ProtocolPrefix = prefix
Expand Down Expand Up @@ -167,7 +168,7 @@ func V1ProtocolOverride(proto protocol.ID) Option {

// BucketSize configures the bucket size (k in the Kademlia paper) of the routing table.
//
// The default value is 20.
// The default value is amino.DefaultBucketSize
func BucketSize(bucketSize int) Option {
return func(c *dhtcfg.Config) error {
c.BucketSize = bucketSize
Expand All @@ -177,7 +178,7 @@ func BucketSize(bucketSize int) Option {

// Concurrency configures the number of concurrent requests (alpha in the Kademlia paper) for a given query path.
//
// The default value is 10.
// The default value is amino.DefaultConcurrency
func Concurrency(alpha int) Option {
return func(c *dhtcfg.Config) error {
c.Concurrency = alpha
Expand All @@ -188,7 +189,7 @@ func Concurrency(alpha int) Option {
// Resiliency configures the number of peers closest to a target that must have responded in order for a given query
// path to complete.
//
// The default value is 3.
// The default value is amino.DefaultResiliency
func Resiliency(beta int) Option {
return func(c *dhtcfg.Config) error {
c.Resiliency = beta
Expand Down
16 changes: 8 additions & 8 deletions internal/config/config.go
Original file line number Diff line number Diff line change
Expand Up @@ -7,6 +7,7 @@
"github.com/ipfs/boxo/ipns"
ds "github.com/ipfs/go-datastore"
dssync "github.com/ipfs/go-datastore/sync"
"github.com/libp2p/go-libp2p-kad-dht/amino"
"github.com/libp2p/go-libp2p-kad-dht/internal/net"
pb "github.com/libp2p/go-libp2p-kad-dht/pb"
"github.com/libp2p/go-libp2p-kad-dht/providers"
Expand All @@ -19,9 +20,7 @@
)

// DefaultPrefix is the application specific prefix attached to all DHT protocols by default.
const DefaultPrefix protocol.ID = "/ipfs"

const defaultBucketSize = 20
const DefaultPrefix protocol.ID = amino.ProtocolPrefix

// ModeOpt describes what mode the dht should operate in
type ModeOpt int
Expand Down Expand Up @@ -127,9 +126,9 @@

o.MaxRecordAge = providers.ProvideValidity

o.BucketSize = defaultBucketSize
o.Concurrency = 10
o.Resiliency = 3
o.BucketSize = amino.DefaultBucketSize
o.Concurrency = amino.DefaultConcurrency
o.Resiliency = amino.DefaultResiliency
o.LookupCheckConcurrency = 256
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Another candidate for defaults? Also, wish there was some explanation why 256 is the chosen value for concurrency.


// MAGIC: It makes sense to set it to a multiple of OptProvReturnRatio * BucketSize. We chose a multiple of 4.
Expand All @@ -139,11 +138,12 @@
}

func (c *Config) Validate() error {
// Configuration is validated and enforced only if prefix matches Amino DHT
if c.ProtocolPrefix != DefaultPrefix {
return nil
}
if c.BucketSize != defaultBucketSize {
return fmt.Errorf("protocol prefix %s must use bucket size %d", DefaultPrefix, defaultBucketSize)
if c.BucketSize != amino.DefaultBucketSize {
return fmt.Errorf("protocol prefix %s must use bucket size %d", DefaultPrefix, amino.DefaultBucketSize)

Check warning on line 146 in internal/config/config.go

View check run for this annotation

Codecov / codecov/patch

internal/config/config.go#L145-L146

Added lines #L145 - L146 were not covered by tests
}
if !c.EnableProviders {
return fmt.Errorf("protocol prefix %s must have providers enabled", DefaultPrefix)
Expand Down
5 changes: 3 additions & 2 deletions protocol.go
Original file line number Diff line number Diff line change
@@ -1,12 +1,13 @@
package dht

import (
"github.com/libp2p/go-libp2p-kad-dht/amino"
"github.com/libp2p/go-libp2p/core/protocol"
)

var (
// ProtocolDHT is the default DHT protocol.
ProtocolDHT protocol.ID = "/ipfs/kad/1.0.0"
ProtocolDHT protocol.ID = amino.ProtocolID
// DefaultProtocols spoken by the DHT.
DefaultProtocols = []protocol.ID{ProtocolDHT}
DefaultProtocols = amino.Protocols
)
5 changes: 3 additions & 2 deletions providers/providers_manager.go
Original file line number Diff line number Diff line change
Expand Up @@ -14,6 +14,7 @@ import (
"github.com/ipfs/go-datastore/autobatch"
dsq "github.com/ipfs/go-datastore/query"
logging "github.com/ipfs/go-log/v2"
"github.com/libp2p/go-libp2p-kad-dht/amino"
"github.com/libp2p/go-libp2p-kad-dht/internal"
"github.com/libp2p/go-libp2p/core/peer"
"github.com/libp2p/go-libp2p/core/peerstore"
Expand All @@ -30,12 +31,12 @@ const (
// peers around. Those addresses are returned alongside provider. After
// it expires, the returned records will require an extra lookup, to
// find the multiaddress associated with the returned peer id.
ProviderAddrTTL = 24 * time.Hour
ProviderAddrTTL = amino.DefaultProviderAddrTTL
)

// ProvideValidity is the default time that a Provider Record should last on DHT
// This value is also known as Provider Record Expiration Interval.
var ProvideValidity = time.Hour * 48
var ProvideValidity = amino.DefaultProvideValidity
var defaultCleanupInterval = time.Hour
var lruCacheSize = 256
var batchBufferSize = 256
Expand Down