Python debugging support #3075

niloc132 · 2022-11-07T22:06:42Z

This commit adds a configuration option to customize engine-created
threads, and provides a default implementation that will register those
threads for debugging with pydevd if python is enabled.

As of this commit, pydevd debugging seems to work correctly, but
VSCode's debugging doesn't work for all threads yet.

Partial #2997

devinrsmith · 2022-11-07T22:21:19Z

Util/src/main/java/io/deephaven/util/thread/ThreadInitializationFactory.java

+    /* private */ String[] CONFIGURED_INITIALIZATION_TYPES =
+            Configuration.getInstance().getStringArrayFromProperty("thread.initialization");


Does this work if the property doesn't exist or is empty?

It fails if it does not exist, but succeeds if empty (with an empty array).

Due to how defaults work (or rather, don't work), I was under the impression we wanted to generally avoid using them, so dh-defaults.prop now contains this property.

py/server/deephaven/__init__.py

devinrsmith · 2022-11-07T22:27:37Z

server/src/main/java/io/deephaven/server/console/python/DebuggingInitializer.java

+            // First call in to create a custom function that has the same name as the Java thread (plus a prefix)
+            PyObject runnableResult = py_deephaven.create_thread_entry(Thread.currentThread().getName());
+            // runnable.run();
+            // Invoke that function directly from Java, so that we have only this one initial frame
+            runnableResult.call("__call__", runnable);


I think we could do this with a single method. The naming would probably change:

py_deephaven.run_for_java_thread(Thread.currentThread().getName(), runnable) or similar

Potentially, and that's how the first implementation worked. However, if we wanted the top-most frame to have a specific name (so that you can tell what Java thread something was running on by looking at the py stack trace, which doesn't come with thread info), we would need to create a custom def to return in the call - the debugger at least loses the __name__ and __qualname__ properties that are assigned to rename defs. As such, we may need to return a pyobject so that it can be called again.

And, if we just use the runnable.run() above (presently commented out), then we don't need to pass runnable at all, so the signature stays the same. In that case however it appears to the debugger that any pydevd.settrace() call in the web IDE actually pauses in the create_change_list function.

Would this provide the same benefits?

def run_for_java_thread(name, runnable): ... def JavaThread(): runnable.run() JavaThread()

Changing the function's name (via declaring JavaThread not by name, but through eval) so the thread appears: technically yes, but by adding two frames instead of one, so the topmost frame isn't named in this way.

Adding one py frame earlier in the stack than the current code: technically yes, but again, because it adds two frames instead of one, we might be making it harder to understand instead of easier

Remember, these run once per thread, when the thread is created, so on the order of 10ish times on startup and once when the first python command is executed from the web console, so paying the extra jni/py round trip is pretty cheap.

devinrsmith · 2022-11-07T22:34:24Z

server/src/main/java/io/deephaven/server/console/python/DebuggingInitializer.java

+            // First call in to create a custom function that has the same name as the Java thread (plus a prefix)
+            PyObject runnableResult = py_deephaven.create_thread_entry(Thread.currentThread().getName());
+            // runnable.run();
+            // Invoke that function directly from Java, so that we have only this one initial frame
+            runnableResult.call("__call__", runnable);


This technique of ensuring the java frame includes python in the Runnable stack does rely on the fact that python drops the GIL when calling into java. jpy-consortium/jpy#48 nothing actionable, but good to note it wouldn't work otherwise.

True, but the fact that DHaaL works at all relies on that fact as well, the entire library would deadlock on the gil and ugp if we didn't make this assumption.

devinrsmith · 2022-11-07T22:37:08Z

Util/src/main/java/io/deephaven/util/thread/ThreadInitializationFactory.java

+public interface ThreadInitializationFactory {
+    /* private */ String[] CONFIGURED_INITIALIZATION_TYPES =
+            Configuration.getInstance().getStringArrayFromProperty("thread.initialization");
+    /* private */ List<ThreadInitializationFactory> INITIALIZERS = Arrays.stream(CONFIGURED_INITIALIZATION_TYPES)


I'm wondering, do we think there are occasions for more than 1 ThreadInitializationFactory?

I have assumed for now that each factory should be created once, but it if wants per-thread state, it should do so by putting that state in the createInitializer method itself, or in the returned Runnable. By only instantiating these once, we grant control to each implementation what kind of scope it wants to have.

For example, the current DebuggingInitializer never closes the deephaven module (in its current form, it should), but could be modified to instead avoid paying the cost of re-creating that proxy for each thread (not just UGP threads, but also the DeephavenApiServerModule scheduler threads too).

rcaudy · 2022-11-14T22:12:48Z

Util/src/main/java/io/deephaven/util/thread/ThreadInitializationFactory.java

+    static Runnable wrapRunnable(Runnable runnable) {
+        Runnable acc = runnable;
+        for (ThreadInitializationFactory INITIALIZER : INITIALIZERS) {
+            acc = INITIALIZER.createInitializer(acc);
+        }
+        return acc;
+    }


Just to be sure I understand, the reason for this pattern is to allow for initializer-specific cleanup on exit? Otherwise, a pattern of iterative runnable invocations would seem less error-prone. I suppose either version allows arbitrary hijacking of the intended run method; the current pattern could skip calling the wrapped runnable, but the iterative approach could just never return or throw an exception on termination.

It isn't that we need to do cleanup on the way out per se, but that we want to insert a stack frame in python. If we end up dropping this frame (which has some weird side effects when you "step out" of the apparent top-level frame), then we could just run one after the other, no wrapping.

The current impl has its option of how it wants to do it - I played with an API that would offer a DSL for either wrapping or prefixing, but didn't really love it, I can try bringing that back if you'd prefer.

Okay, in light of the arm64 issues, a wrapper is necessary. The question then will be how we name that, if JavaThread is clear and simple enough, or if we want to use it to get some more info to the user.

engine/updategraph/src/main/java/io/deephaven/engine/updategraph/UpdateGraphProcessor.java

server/src/main/java/io/deephaven/server/runner/DeephavenApiServerModule.java

rcaudy · 2022-11-14T22:22:54Z

server/src/main/java/io/deephaven/server/console/python/DebuggingInitializer.java

+            // python not enabled, don't accidentally start it
+            return runnable;
+        }
+        DeephavenModule py_deephaven = (DeephavenModule) PyModule.importModule("deephaven")


Note from @rcaudy : I haven't reviewed the bottom of this file past this point, or py/server/deephaven/__init__.py yet.

default

debug

rcaudy

Approving for Python server codeowners, no interface code here.

niloc132 added the python-server-side label Nov 7, 2022

niloc132 added this to the Nov 2022 milestone Nov 7, 2022

niloc132 requested review from jmao-denver, chipkent, rcaudy and devinrsmith November 7, 2022 22:06

niloc132 added the NoDocumentationNeeded label Nov 7, 2022

devinrsmith reviewed Nov 7, 2022

View reviewed changes

rcaudy reviewed Nov 14, 2022

View reviewed changes

niloc132 force-pushed the py-debugging-in-ugp branch from 1b5cc40 to cb3bbb4 Compare November 28, 2022 16:40

niloc132 force-pushed the py-debugging-in-ugp branch from cb3bbb4 to 552bc50 Compare December 8, 2022 21:56

niloc132 added the ReleaseNotesNeeded Release notes are needed label Dec 8, 2022

niloc132 marked this pull request as ready for review December 9, 2022 20:08

niloc132 force-pushed the py-debugging-in-ugp branch from eed53de to fa33665 Compare December 15, 2022 16:01

niloc132 force-pushed the py-debugging-in-ugp branch from 7181b37 to cb4076f Compare January 13, 2023 18:20

niloc132 added 11 commits January 19, 2023 15:26

Remove unused threadfactory types

a69328d

Expand usage of NamingThreadFactory, make the usual value for daemon

27b1c22

default

Move the default script/console lang to config as source of truth

c7be4df

Wrap thread initialization with optional extra wiring, including py

a55c57f

debug

Move java+py threading to its own file, start deephaven module later

ad531d8

Draft at checking other debug libraries, needs debugging

42bed8c

Attempt at correctly setting up debugging in current thread

9fd1a57

Handle absence of initializers, and add better defaults

e8146c8

review feedback

98d6553

remove test debugging property

a1c2d53

Lazily create the py module for java threads

2a7b4b1

niloc132 force-pushed the py-debugging-in-ugp branch from cb4076f to 2a7b4b1 Compare January 19, 2023 21:46

niloc132 requested a review from devinrsmith January 20, 2023 14:14

Merge remote-tracking branch 'upstream/main' into py-debugging-in-ugp

a9b501d

devinrsmith approved these changes Feb 3, 2023

View reviewed changes

devinrsmith requested a review from rcaudy February 3, 2023 22:54

niloc132 changed the title ~~Py debugging in ugp~~ Python debugging support Feb 3, 2023

rcaudy approved these changes Feb 3, 2023

View reviewed changes

niloc132 merged commit 46ef898 into deephaven:main Feb 3, 2023

github-actions bot locked and limited conversation to collaborators Feb 3, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Python debugging support #3075

Python debugging support #3075

niloc132 commented Nov 7, 2022 •

edited

Loading

devinrsmith Nov 7, 2022

niloc132 Nov 7, 2022

devinrsmith Nov 7, 2022

niloc132 Nov 7, 2022

devinrsmith Nov 7, 2022

niloc132 Nov 14, 2022

devinrsmith Nov 7, 2022

niloc132 Nov 7, 2022

devinrsmith Nov 7, 2022

niloc132 Nov 7, 2022

rcaudy Nov 14, 2022

niloc132 Nov 15, 2022

niloc132 Dec 9, 2022

rcaudy Nov 14, 2022

rcaudy left a comment

		/* private */ String[] CONFIGURED_INITIALIZATION_TYPES =
		Configuration.getInstance().getStringArrayFromProperty("thread.initialization");

Python debugging support #3075

Python debugging support #3075

Conversation

niloc132 commented Nov 7, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

rcaudy left a comment

Choose a reason for hiding this comment

niloc132 commented Nov 7, 2022 •

edited

Loading