Enable graceful shutdown of server processes #5193

dlmarion · 2024-12-17T21:05:15Z

Added a ShutdownHook via Hadoop's ShutdownHookManager that interrupts the main server thread and sets the shutdownRequested variable to true. Removed variables from subclasses that were used to track shutdown requests. Modified the server threads run methods to attempt an orderly shutdown.

dlmarion · 2024-12-17T21:05:33Z

Marked as draft because I want to do some more testing.

ctubbsii

A smarter person than me (I don't remember their name, but they were introduced to me as an expert on robust systems) once suggested that it's often not worth the complexity to introduce graceful shutdowns if you already have to handle ungraceful ones. It's better to spend time making sure the ungraceful ones are handled robustly, and if they are, there's no harm in making ungraceful shutdowns (SIGKILL) the normal shutdown and not worth the effort to even bother to code graceful shutdowns... especially for a server process that you expect to stay running for long periods of time as part of your backend architecture where shutdowns are abnormal anyway. I wouldn't take that as an absolute rule (because nothing is), of course, but I'm reminded of that opinion every once in awhile, because I think there's some truth to it that's worth thinking about it whenever we try to add some complexity to handle some edge case for starting/stopping/upgrading/etc.

It's also not clear to me that shutdown hooks are always triggered by a SIGTERM or similar. It seems they also run when System.exit, and or maybe even when the main method is finished normally... and possibly if the main method throws an Exception or even an Error. We'll have to make sure that whatever we do to handle the shutdown request is suitable for all such cases, whatever they happen to be.

There are also other considerations to be made, like how uncaught exceptions are expected to be handled in the shutdown hook themselves, and we need to be very careful to shut down quickly. There can be no guarantees that these will even run, as the OS may determine a process is taking too long to stop and could just kill it at any time. So it's possible any complexity added to do this may not even be worth it.

Overall, I kind of like the idea, as long as it shuts down very quickly, doesn't add too much complexity, doesn't add risk by slowing down our emergency Halt measures, and is likely to be worth the effort by getting used. I would strongly prefer we avoid the Hadoop API, though. We've been slowly decoupling/disentangling ourselves from a strict dependency on Hadoop over the years, and we should avoid further mandatory entanglements.

server/base/src/main/java/org/apache/accumulo/server/AbstractServer.java

dlmarion · 2024-12-18T13:09:19Z

A smarter person than me (I don't remember their name, but they were introduced to me as an expert on robust systems) once suggested that it's often not worth the complexity to introduce graceful shutdowns if you already have to handle ungraceful ones. It's better to spend time making sure the ungraceful ones are handled robustly, and if they are, there's no harm in making ungraceful shutdowns (SIGKILL) the normal shutdown and not worth the effort to even bother to code graceful shutdowns... especially for a server process that you expect to stay running for long periods of time as part of your backend architecture where shutdowns are abnormal anyway. I wouldn't take that as an absolute rule (because nothing is), of course, but I'm reminded of that opinion every once in awhile, because I think there's some truth to it that's worth thinking about it whenever we try to add some complexity to handle some edge case for starting/stopping/upgrading/etc.

Accumulo is very good at handling ungraceful shutdowns of server processes. However, there exists a case where a user may want to inform the server process to finish what it's doing, then shut down. A good example of this is a long running compaction in progress, but the user wants to scale things differently. The user might want to let the Compactor process finish what it's doing, then shut down, with the alternative that if it takes too long they can always kill it.

It's also not clear to me that shutdown hooks are always triggered by a SIGTERM or similar. It seems they also run when System.exit, and or maybe even when the main method is finished normally... and possibly if the main method throws an Exception or even an Error. We'll have to make sure that whatever we do to handle the shutdown request is suitable for all such cases, whatever they happen to be.

Runtime.addShutdownHook says that "When the virtual machine begins its shutdown sequence it will start all registered shutdown hooks in some unspecified order and let them run concurrently" and defines the shutdown sequence being initiated when System.exit is called, the program exits normally, or in response to an interrupt.
This table says that SIGTERM, SIGINT, and SIGHUP will execute the shutdown hooks.
Runtime.halt does not start shutdown hooks
I didn't see anything regarding the shutdown hooks running on Exception or Error that terminates the VM, I'm assuming that they won't run.
Shutdown hooks won't be executed on SIGKILL.

There are also other considerations to be made, like how uncaught exceptions are expected to be handled in the shutdown hook themselves, and we need to be very careful to shut down quickly. There can be no guarantees that these will even run, as the OS may determine a process is taking too long to stop and could just kill it at any time. So it's possible any complexity added to do this may not even be worth it.

Uncaught exceptions in shutdown hooks is addressed in the javadoc for Runtime.addShutdownHook.

Overall, I kind of like the idea, as long as it shuts down very quickly, doesn't add too much complexity, doesn't add risk by slowing down our emergency Halt measures, and is likely to be worth the effort by getting used. I would strongly prefer we avoid the Hadoop API, though. We've been slowly decoupling/disentangling ourselves from a strict dependency on Hadoop over the years, and we should avoid further mandatory entanglements.

dlmarion · 2024-12-18T13:20:31Z

@ctubbsii - Thanks for the comments. In answering them, I realized that we need a different way to signal normal shutdown and not use SIGTERM. The reason being that we would need to ensure that our shutdown hook thread does not return until the main thread has exited so that the FileSystem does not get closed out from underneath of us. This long running shutdown hook thread is likely something that we don't want to do.

I was trying not to create a Thrift shutdown endpoint for all of the server processes and use SIGTERM instead, but at this point I think that's what I'm going to have to do. The idea that was discussed already was to create a Thrift RPC endpoint that can be used to signal graceful shutdown and create a Java application (KeywordExecutable) that calls the RPC method given a host/port.

dlmarion · 2024-12-18T21:40:43Z

I removed the shutdown hook in aec6d47 and replaced with a Thrift RPC mechanism

dlmarion · 2024-12-19T19:08:38Z

This is ready for review. I have kicked off a full IT build.

keith-turner

Only partially looked through this, posted the comment I have so far.

keith-turner · 2024-12-20T18:53:33Z

server/compactor/src/main/java/org/apache/accumulo/compactor/Compactor.java

-          LOG.error("Compactor lost lock (reason = {}), exiting.", reason);
-          gcLogger.logGCInfo(getConfiguration());
-        });
+        if (isShutdownRequested()) {


If a compactor process looses it lock anything it was working on will may be cleaned up in the metadata table and after that cleanup any work the compactor completes completes would be discarded. So may want to always halt when the lock is lost.

This is guarding against this Watcher being fired when the Compactor process performs ServiceLock.unlock as part of it shutting down at the end of the run method.

Could try to determine if this happening in the correct time window like the tserver. However its not as important for the compactor as the tserver. If its still compacting, did not delete its own lock, and its lock is gone then there is a chance that further work done is pointless but its not harmful.

Addressed in 0d5f014

keith-turner · 2024-12-20T18:55:23Z

server/tserver/src/main/java/org/apache/accumulo/tserver/TabletServer.java

-            gcLogger.logGCInfo(getConfiguration());
-          });
+          if (isShutdownRequested()) {
+            LOG.warn(


Tablet server should always halt when its lock is lost as the manager may reassign its tablets after the lock is lost.

This is guarding against this Watcher being fired when the TabletServer process performs ServiceLock.unlock as part of it shutting down at the end of the run method.

That makes sense to do, however the check is too broad. There is a period of time where isShutdownRequested is true but the tablet server has not deleted its lock and is working on shutting down and could even be stuck. If its looses its lock during this time period it should still halt.

Addressed in 0d5f014

dlmarion · 2024-12-23T12:50:29Z

Full IT build was successful

ctubbsii

I spent more time reviewing the API and design of the sending and receiving of the shutdown signal than I did on the implementation handling the shutdown request in each server. I skimmed those, but will leave them to others to review more comprehensively.

For the design of the send/receive of the signal, I made a few comments, but it overall looks good.

My main concern would be that adding it to 2.1.4 changes the thrift RPC in a bugfix release, which can have substantial implications if somebody does a rolling restart or for some other reason has a heterogeneous cluster of different 2.1 patch versions. We strive to maintain both forward and backward compatibility within a bugfix release, so people can do things like rolling upgrades. So... if a 2.1.3 instance were to receive a signal like what would be sent by an admin command from 2.1.4, I'd want to know that it has been tested, and that 2.1.3 would ignore the signal, rather than experience a fault.

Additionally, I'm concerned about security. I made a comment about that below.

ctubbsii · 2024-12-23T21:42:10Z

core/src/main/java/org/apache/accumulo/core/rpc/clients/ServerProcessServiceThriftClient.java

+      return ThriftUtil.getClientNoTimeout(this, serverProcess, context);
+    } catch (TTransportException tte) {
+      Throwable cause = tte.getCause();
+      if (cause != null && cause instanceof UnknownHostException) {


null check is redundant here

Suggested change

if (cause != null && cause instanceof UnknownHostException) {

if (cause instanceof UnknownHostException) {

ctubbsii · 2024-12-23T21:42:39Z

core/src/main/java/org/apache/accumulo/core/rpc/clients/ServerProcessServiceThriftClient.java

+      Throwable cause = tte.getCause();
+      if (cause != null && cause instanceof UnknownHostException) {
+        // do not expect to recover from this
+        throw new RuntimeException(tte);


UnknownHostException is an IOException, you could use that:

Suggested change

throw new RuntimeException(tte);

throw new UncheckedIOException(tte.getCause());

or to also preserve the original:

Suggested change

throw new RuntimeException(tte);

var x = throw new UncheckedIOException(tte.getCause());

x.addSuppressed(tte);

throw x;

ctubbsii · 2024-12-23T21:53:34Z

minicluster/src/main/java/org/apache/accumulo/miniclusterImpl/MiniAccumuloClusterControl.java

+        return coordinatorProcess == null ? Set.of() : Set.of(coordinatorProcess);
+      case COMPACTOR:
+        return compactorProcesses == null ? Set.of()
+            : Set.of(compactorProcesses.toArray(new Process[] {}));


I think this syntax might be slightly shorter.

Suggested change

: Set.of(compactorProcesses.toArray(new Process[] {}));

: Set.of(compactorProcesses.toArray(new Process[0]));

but, this is probably better, since you already have a collection, there's no need to make an intermediate array:

Suggested change

: Set.of(compactorProcesses.toArray(new Process[] {}));

: Set.copyOf(compactorProcesses);

Unless, you wanted to check for duplicates and throw an exception, in which case, the first option is better. But I don't really think duplicates are a risk here.

ctubbsii · 2024-12-23T21:57:06Z

server/base/src/main/java/org/apache/accumulo/server/AbstractServer.java

+  }
+
+  protected void requestShutdown() {
+    shutdownRequested.compareAndSet(false, true);


No need to do the compare in this base implementation, since you're ignoring the return value. You can just set it.

Suggested change

shutdownRequested.compareAndSet(false, true);

shutdownRequested.set(true);

ctubbsii · 2024-12-23T21:59:39Z

server/base/src/main/java/org/apache/accumulo/server/AbstractServer.java

+    requestShutdown();
+  }
+
+  protected void requestShutdown() {


I'm not sure why the second method. It seems that subclasses can just extend gracefulShutdown instead.

Alternatively, instead of extending either of these, make the gracefulShutdown method call a registered gracefulShutdown hook that's passed in in the constructor. You can even run it as a separate thread, launched from the gracefulShutdown method, so that you can guarantee it isn't blocking the main thread.

ctubbsii · 2024-12-23T22:04:43Z

server/base/src/main/java/org/apache/accumulo/server/util/Admin.java

+    @Parameter(names = {"-h", "--host"}, description = "<host>")
+    String hostname = null;
+
+    @Parameter(names = {"-p", "--port"}, description = "<port>")
+    int port = 0;


It seems unnecessary to use JCommander to split these into separate options. Why not just do:

bin/accumulo admin signalShutdown <host:port>

No need for a separate -h and -p option. They go together anyway. It'd be weird to split them up only to pass them as separate params, so that the code combines them again to act upon it.

ctubbsii · 2024-12-23T22:22:15Z

server/tserver/src/main/java/org/apache/accumulo/tserver/TabletServer.java

@@ -170,7 +175,8 @@
 import io.opentelemetry.api.trace.Span;
 import io.opentelemetry.context.Scope;

-public class TabletServer extends AbstractServer implements TabletHostingServer {
+public class TabletServer extends AbstractServer
+    implements TabletHostingServer, ServerProcessService.Iface {


A lot of these have redundant interface declarations since it's already declared on the AbstractServer super class. It'd be better to omit ServerProcessService.Iface from the child classes, so it's easier to see where things are coming from in the class hierarchy.

That doesn't work with Thrift. You have to put the Thrift interface declarations on the class being used. I agree that they are redundant, but I have run into this many times, especially with overriding behavior of a Thrift server in a test.

Hmm, interesting. I wasn't aware there was a problem with that. Then, perhaps it should be put on a separate "handler" class, like the other handlers, and be a member of the AbstractServer?

ctubbsii · 2024-12-23T22:24:15Z

server/base/src/main/java/org/apache/accumulo/server/AbstractServer.java

+    // Don't interrupt the server thread, that will cause
+    // IO operations to fail as the servers are finishing
+    // their work.
+    requestShutdown();


I think there needs to be some kind of security on this. Since it's coming via thrift, it should be possible to use the ServerContext with the SystemCredentials to verify that the request is coming from an admin utility with access to this cluster's instance.secret / config file.

dlmarion added this to the 2.1.4 milestone Dec 17, 2024

dlmarion self-assigned this Dec 17, 2024

ctubbsii reviewed Dec 18, 2024

View reviewed changes

server/base/src/main/java/org/apache/accumulo/server/AbstractServer.java Outdated Show resolved Hide resolved

Removed shutdown hook and implemented shutdown via Thrift RPC

aec6d47

dlmarion requested review from keith-turner and ctubbsii December 18, 2024 21:39

dlmarion changed the title ~~Handle SIGTERM for graceful shutdown of server processes~~ Enable graceful shutdown of server processes Dec 18, 2024

dlmarion marked this pull request as ready for review December 18, 2024 21:43

dlmarion mentioned this pull request Dec 18, 2024

New Monitor server implementation #5012

Draft

Add log to note when process is shut down

d0a6e3a

dlmarion mentioned this pull request Dec 19, 2024

Support differentiating between static and dynamic resource groups #5162

Open

dlmarion added 2 commits December 19, 2024 18:54

Updates from testing

98624a4

Fix log message

b6003d5

dlmarion added 2 commits December 20, 2024 13:07

Ensure subclasses implement new Thrift API

47d896b

Change Manager subclass to implement new Thrift API

c8c366b

keith-turner reviewed Dec 20, 2024

View reviewed changes

dlmarion added 2 commits December 20, 2024 19:02

Forgot to add new Thrift API to Coordinator subclasses

2d22d64

fix build

c09918e

Halt VM is lock lost but shutdown not complete

0d5f014

dlmarion requested a review from keith-turner December 23, 2024 13:17

DomGarguilo mentioned this pull request Dec 23, 2024

Suggestions for graceful shutdown branch dlmarion/accumulo#54

Open

ctubbsii reviewed Dec 23, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable graceful shutdown of server processes #5193

Enable graceful shutdown of server processes #5193

dlmarion commented Dec 17, 2024

dlmarion commented Dec 17, 2024

ctubbsii left a comment •

edited

Loading

dlmarion commented Dec 18, 2024

dlmarion commented Dec 18, 2024

dlmarion commented Dec 18, 2024

dlmarion commented Dec 19, 2024

keith-turner left a comment

keith-turner Dec 20, 2024

dlmarion Dec 20, 2024

keith-turner Dec 20, 2024 •

edited

Loading

dlmarion Dec 23, 2024

keith-turner Dec 20, 2024

dlmarion Dec 20, 2024

keith-turner Dec 20, 2024

dlmarion Dec 23, 2024

dlmarion commented Dec 23, 2024

ctubbsii left a comment •

edited

Loading

ctubbsii Dec 23, 2024

ctubbsii Dec 23, 2024

ctubbsii Dec 23, 2024

ctubbsii Dec 23, 2024

ctubbsii Dec 23, 2024

ctubbsii Dec 23, 2024

ctubbsii Dec 23, 2024

dlmarion Dec 23, 2024

ctubbsii Dec 24, 2024

ctubbsii Dec 23, 2024

	if (cause != null && cause instanceof UnknownHostException) {
	if (cause instanceof UnknownHostException) {

	throw new RuntimeException(tte);
	throw new UncheckedIOException(tte.getCause());

-        throw new RuntimeException(tte);
+        var x = throw new UncheckedIOException(tte.getCause());
+        x.addSuppressed(tte);
+        throw x;

	: Set.of(compactorProcesses.toArray(new Process[] {}));
	: Set.of(compactorProcesses.toArray(new Process[0]));

	: Set.of(compactorProcesses.toArray(new Process[] {}));
	: Set.copyOf(compactorProcesses);

	shutdownRequested.compareAndSet(false, true);
	shutdownRequested.set(true);

Enable graceful shutdown of server processes #5193

Are you sure you want to change the base?

Enable graceful shutdown of server processes #5193

Conversation

dlmarion commented Dec 17, 2024

dlmarion commented Dec 17, 2024

ctubbsii left a comment • edited Loading

Choose a reason for hiding this comment

dlmarion commented Dec 18, 2024

dlmarion commented Dec 18, 2024

dlmarion commented Dec 18, 2024

dlmarion commented Dec 19, 2024

keith-turner left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

keith-turner Dec 20, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dlmarion commented Dec 23, 2024

ctubbsii left a comment • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ctubbsii left a comment •

edited

Loading

keith-turner Dec 20, 2024 •

edited

Loading

ctubbsii left a comment •

edited

Loading