Dual Data Center Config #1368
Replies: 2 comments
-
Why not use only 1 cluster in 1 DC? Never use cross-datacenter cluster. I thought I had seen this discussed here before. Is there a documented way to address this? I was thinking of having a spare server node in each DC that would be added to the cluster if the other DC is unreachable, but it would also need to be taken down when the DC returns. Also, if I tag jobs for NJ and CHI, how would I get the NJ jobs to run in CHI if NJ goes down? No, there's no document on how to approach this, it should be considered case by case. What I would do in your case is to have a single 3 nodes cluster in Chicago and a good backup system in place, so in case of the extreme unlikely failure of one DC you can spin up the cluster in another DC and load the backup in a matter of minutes. |
Beta Was this translation helpful? Give feedback.
-
We have two data centers, and I had run them as separate clusters because we have jobs that run locally in each. However there are some jobs that can run in either DC, and I had them running from one or the other rather arbitrarily. We had one DC go off-line for more than 48 hours, and there were jobs on the down DC cluster that we had to run manually. With one cluster across two DCs we can configure jobs to run in DC1 or DC2 with the DC tag, I believe. That would allow us in times of failure to simply change the DC on the job if we felt the outage was going to be long enough. Right now I have six nodes in DC1 and five in DC2. If DC1 goes down, I suppose I can spin up another node in DC2 to achieve quroum. |
Beta Was this translation helpful? Give feedback.
-
I thought I had seen a discussion of this here previously but can't find it. We have two data centers with a separate DKRON cluster in each. We had the EXTREMELY unlikely event of one of our data centers going down for 51 hours, and jobs that were in our NJ DC had to be manually run in Chcago.
Our chicago jobs were fine, and most of the NJ jobs could have run from Chicago had they been configured to do so. But because we had two separate clusters, this was not possible.
When I set this up I thought of using a single cluster with tags for each DC, but we didn't go that route because of quorum; if we have 5 servers in each DC, and a DC goes down, there's no quorum.
I thought I had seen this discussed here before. Is there a documented way to address this? I was thinking of having a spare server node in each DC that would be added to the cluster if the other DC is unreachable, but it would also need to be taken down when the DC returns. Also, if I tag jobs for NJ and CHI, how would I get the NJ jobs to run in CHI if NJ goes down?
Thanks in advance!
sean
Beta Was this translation helpful? Give feedback.
All reactions