[RFC] Support multiple endpoints for load balancing and failover #894

zhicwu · 2022-04-10T00:49:38Z

Background

ClickHouse supports clustering and often runs in a cluster. Typical approaches for load balancing and failover are:

distributed table
proxy/gateway(e.g. chproxy, traefik etc.) sitting between client and server
or modern DNS like consul

Besides, message queues(federated or not) and customization on client-side may require to further optimize reads and writes across servers.

It would be really nice to enhance both Java client and JDBC driver to support multiple endpoints, which provides more options for people to choose. Yes, we have BalancedClickhouseDataSource in JDBC driver, and maybe ClickHouseCluster in Java client, but unfortunately none of them is good enough and the latter was not even tested on a clustered environment.

Concept

Endpoint - essentially a combination of host, port and protocol. It may contain additional information like status, role, tags, weight, server revision/version/timezone, and credentials for authentication. (localhost, 8123, http) and (127.0.0.1, 8123, http) are different endpoints.
Server - an instance of ClickHouse server, which exposes multiple endpoints for client to connect to. clickhouse-local is a special type of server, which only exists when needed.
Cluster - a group of ClickHouse servers under same name, as described in system.clusters table.

User Scenario

#	Operation	Target	Remark
1.1	Connect	an endpoint	memorize which exact server was connected to, based on `X-ClickHouse-Server` in response
1.2	Connect	multiple endpoints	either one of them based on load balancing policy, or multiple based on property matching(e.g. tags/status/distance)
1.3	Connect	a server	either one of discovered endpoints, or multiple
1.4	Connect	a cluster	either one of discovered endpoints, or multiple
2.1	Read	an endpoint	memorize which exact server was queried against, based on `X-ClickHouse-Server` in response
2.2	Read	multiple endpoints	either one of them based on load balancing policy, or multiple based on property matching and the query(prefer to query multiple nodes in parallel)
2.3	Read	a server	one of discovered endpoints, perhaps the best protocol available
2.4	Read	a cluster	either one of discovered endpoints, or multiple with consideration of sharding key and replicas
3.1	Write	an endpoint	memorize which exact server was queried against, based on `X-ClickHouse-Server` in response
3.2	Write	multiple endpoints	either one of them based on load balancing policy, or multiple based on property matching and the target table(prefer to write into local table)
3.3	Write	a server	one of discovered endpoints, perhaps the best protocol available
3.4	Write	a cluster	either one of discovered endpoints, or multiple with consideration of sharding key and replicas

Proposed Solution

In order to support above user scenarios, we need to implement below features:

#	Feature	Remark
1	Auto discovery	The ability to automatically discover: 1) exposed ports of a ClickHouse server; 2) servers of a cluster; 3) connections between configured endpoint and actually connected server(`X-ClickHouse-Server` in response)
2	Load balancing	Pick endpoint from a sorted and filtered list using one of two policies: 1) first alive; and 2) round robin
3	Failover	Retry only when there's network issue(happens before sending query) or failure of select query
4	Protocol detection	Probe given host + port to figure out the protocol - example
5	Health check	Besides liveness detection like ping, we need to understand "distance" between client and each endpoint
6	Parallel query	Query partitioned data stored on multiple servers and return consolidated response(unordered) to the caller
7	Dual write	Write same or distributed data into multiple endpoints. Might need to introduce configurable client-side consistency-level

Potential changes to existing code:

ClickHouseCluster - represents a cluster, which contains list of endpoints. Needs to rewrite the class accordingly and add background thread for auto-discovery.
ClickHouseEndpoint - represents an endpoint. Replacement of ClickHouseNode.
ClickHouseEndpoints - represents list of endpoints. It contains methods to add ClickHouseEndpoint and ClickHouseCluster, and a background thread for health check.
LoadBalancingPolicy - load balancing policy can be used in ClickHouseEndpoints for picking an endpoint from a sorted(based on status, weight and maybe distance) and filtered(based on tags etc.) list
JDBC URL - jdbc:ch://server1,(grpc://server2?tags=dc2),(tcp://server3:9000),(http://user4:passwd@server4:8124/db4?tags=dc1)/db?cluster=mycluster&tags=dc1&lbPolicy=firstAlive&autoDiscovery=true

JDBC Connection Properties

Property	Description	Example
cluster	name of the cluster	cluster=mycluster
tags	comma separated tags for categorization	tags=dc1,readonly,r1s1
lbPolicy	either `firstAlive`(default) or `roundRobin`	lbPolicy=roundRobin
failover	either `true` or `false`(default), only retry on network issue or failure of select query	failover=true
autoDiscovery	either `true` or `false`(default)	autoDiscovery=true
healthCheck	either `true`(default) or `false`	healthCheck=false

The text was updated successfully, but these errors were encountered:

dynaxis · 2022-04-10T11:50:35Z

@zhicwu it seems I need to plan thoroughly since it won't be nearly impossible to implement all of the features at once. I'll think and try to share my thought soon. BTW, what do you mean by "Parallel query"? How is it different from usual distributed table in ClickHouse? I guess it's a means to query multiple partitioned tables without resorting to the distributed table engines?

Maybe first, I need to sync my understandings on your notes above with your actual intention. Thanks.

zhicwu · 2022-04-10T22:52:11Z

No worries, we can start small :) Perhaps you can come up with initial pull request to get your changes merged first, and then enhance/refactor gradually in multiple releases. "Fancy" features can be considered at a later stage.

what do you mean by "Parallel query"? How is it different from usual distributed table in ClickHouse?

Yes, it's similar as distribute table but not limited to one cluster. It's just immature thought - I was thinking to cover multiple clusters across regions and external datasources(e.g. url() and jdbc() etc.).

zhicwu · 2022-05-26T00:56:48Z

Any update @dynaxis? Just trying to avoid duplicated efforts here. I'm thinking to add an alternative of BalancedClickhouseDataSource in the coming weekend. Of course, it's not going to be a complete implementation of all above, but just bare-bone with single load-balancing policy and no auto-discovery and fail-over.

A few things to start with:

1. the ability to probe secure and insecure ports(to figure out protocol) - feature 4
2. new classes: ClickHouseEndpoint and ClickHouseEndpoints(or ClickHouseNode, ClickHouseNodes instead for minimum changes)
3. ClickHouseEndpoint can carry ClickHouseConfig like timeout and buffer size etc., which will override the ones in ClickHouseClient when creating ClickHouseRequest

4. the ability to connect to a URL in Java client

// host=localhost, protocol=tcp, port=9440, ssl=true, sslmode=NONE
client.connect(ClickHouseEndpoint.of("tcps://localhost/system")); // tcps is short for TCP secure
// host=localhost, protocol=http, port=8123, ssl=false
client.connect("http://localhost/system"); // port is NOT 80 as we're connecting to ClickHouse, not a web server
// host=localhost, protocol=grpc, port=9100, ssl=false
client.connect("localhost:9100/system"); // probe port 9100 to figure out protocol
// host=localhost, protocol=http, port=8123, ssl=false, database=default
client.connect("localhost"); // probe default port 8123 to figure out protocol, try more ports only when autoDiscovery is enabled

// may cover below case later - cached ClickHouseEndpoints will be shared among clients
// client.connect("localhost, https://localhost:443,grpc://localhost,tcp://localhost");

5. the ability to define nested URLs in JDBC driver

// the outer-most protocol(defaults to ANY) and parameters are defaults, which can be override by inner URL
Connection conn = DriverManager.getConnection(
  "jdbc:ch://server1?connection_timeout=3000,grpc://server2,(tcp://server3:9000/default)/?connection_timeout=5000", "default", "default");
 
// change default protocol to TCP(instead of ANY), so we'll probe port 9000 for all 3 servers  
conn = DriverManager.getConnection("jdbc:ch:tcp://server1,server2,server3/?connection_timeout=5000", "default", "");

6. background thread for health check when connecting to multiple endpoints - and maybe options to set check intervals and threads(defaults to single-thread)

Please feel free to submit PR or review/merge changes later.

arickbro · 2022-06-13T02:29:22Z

I think this could be a reference to start with
https://mariadb.com/kb/en/failover-and-high-availability-with-mariadb-connector-j/
https://dev.mysql.com/doc/connector-j/5.1/en/connector-j-config-failover.html

zhicwu · 2022-06-13T03:20:23Z

I think this could be a reference to start with https://mariadb.com/kb/en/failover-and-high-availability-with-mariadb-connector-j/ https://dev.mysql.com/doc/connector-j/5.1/en/connector-j-config-failover.html

Thanks @arickbro. Too late that I've been implemented the better-than-nothing version :p

I was expecting a better result, but too bad not much progress being made in the past week so I'm going to release the new patch anyway and hopefully gain more feedback for future enhancement.

tisonkun · 2022-12-06T15:21:39Z

I can see that the load balance policy is supported at 0.3.2-patch11. But the patch version isn't published to the maven central.

Shall I wait for a 0.3.3 release or there's some way I can depend on the patch release?

zhicwu · 2022-12-06T23:55:05Z

Hi @tisonkun, it's been published to maven central months ago - see here. Perhaps you were using legacy groupId ru.yandex.clickhouse?

tisonkun · 2022-12-07T00:03:29Z

@zhicwu Thank you! This is exactly the case.

zhicwu · 2022-12-07T01:12:36Z

@zhicwu Thank you! This is exactly the case.

It's highly recommended to upgrade JDBC driver to 0.3.2 for better performance and stability. As to legacy driver, it has been removed, so hopefully it's less confusing starting 0.3.3.

tisonkun · 2022-12-07T01:54:56Z

@zhicwu I made a patch to upgrade the dependency for Pulsar ClickHouse Connector apache/pulsar#18774.

I'll appreciate if you can also check if the upgrade is correct and transparent to users.

zhicwu · 2023-01-18T15:57:21Z

Will consider parallel read/write etc. in bridge server.

tisonkun · 2023-01-18T16:52:05Z

@zhicwu cool! Do you have an estimate for 0.4.0 (the next feature release)?

zhicwu added the enhancement label Apr 10, 2022

zhicwu added this to the 0.3.3 release milestone Apr 10, 2022

This was referenced Apr 10, 2022

Roadmap 2022 #784

Closed

Why package com.clickhouse.client.grpc.impl isn't in master branch? #799

Closed

This was referenced May 22, 2022

Multi-protocol support for ClickHouse dbeaver/dbeaver#16545

Closed

clickhouse-go compatible DSN #524

Closed

zhicwu mentioned this issue Jun 18, 2022

Multi-endpoint support #956

Merged

zhicwu linked a pull request Jul 15, 2022 that will close this issue

Fix slowness in performance mode and failover not working when protocol is unsupported #995

Merged

This was referenced Dec 6, 2022

[Issue 12998][io] enhance clickhouse cluster sink support apache/pulsar#12999

Closed

Pulsar IO support Clickhouse Load Balance apache/pulsar#12998

Closed

zhicwu closed this as completed Jan 18, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RFC] Support multiple endpoints for load balancing and failover #894

[RFC] Support multiple endpoints for load balancing and failover #894

zhicwu commented Apr 10, 2022 •

edited

Loading

dynaxis commented Apr 10, 2022 •

edited

Loading

zhicwu commented Apr 10, 2022

zhicwu commented May 26, 2022 •

edited

Loading

arickbro commented Jun 13, 2022

zhicwu commented Jun 13, 2022

tisonkun commented Dec 6, 2022

zhicwu commented Dec 6, 2022

tisonkun commented Dec 7, 2022

zhicwu commented Dec 7, 2022

tisonkun commented Dec 7, 2022

zhicwu commented Jan 18, 2023

tisonkun commented Jan 18, 2023

[RFC] Support multiple endpoints for load balancing and failover #894

[RFC] Support multiple endpoints for load balancing and failover #894

Comments

zhicwu commented Apr 10, 2022 • edited Loading

Background

Concept

User Scenario

Proposed Solution

dynaxis commented Apr 10, 2022 • edited Loading

zhicwu commented Apr 10, 2022

zhicwu commented May 26, 2022 • edited Loading

arickbro commented Jun 13, 2022

zhicwu commented Jun 13, 2022

tisonkun commented Dec 6, 2022

zhicwu commented Dec 6, 2022

tisonkun commented Dec 7, 2022

zhicwu commented Dec 7, 2022

tisonkun commented Dec 7, 2022

zhicwu commented Jan 18, 2023

tisonkun commented Jan 18, 2023

zhicwu commented Apr 10, 2022 •

edited

Loading

dynaxis commented Apr 10, 2022 •

edited

Loading

zhicwu commented May 26, 2022 •

edited

Loading