Concept of Diagnostic Runtimes #2082

pepyakin · 2019-03-22T14:13:39Z

This issue #1790 made me to come up with an idea of a diagnostic runtime.

To recap: In contracts we have the lack of debuggability problem. In the best case, if the top-level contract fails you get the execution failure error. In the worst case, because contract failures don't cascade by default (i.e. if a leaf contract call fails it doesn't propagate to the root call) you don't get anything. There are even no way to log a message.

The simplest way of solving it is by adding some little features such as posting an event every time a contract generates an error, or e.g. add a special function that logs message that is available only for a testnet.

The problem with a testnet only debugging capabilities that they are no use for the production nets. Mixing debugging code into the runtime is also not the best idea either, because it adds overhead, possibly pessimizes the normal path, increases binary size and enlarges security surface.

It seems to me that a better solution would be to provide this functionality off chain.

Enter diagnostic runtime.

Imagine we had:

A special cfg feature in a runtime, which enables some off-chain debugging capabilities.
An RPC method that is the same as state_call, but that takes one extra parameter: a file path to a wasm runtime to use to execute.
Potentially, a way for a runtime to output a large chunk of data that can be returned by the aforementioned RPC method. Possibly in a file.

Having this, we could add a special path (behind a cfg feature) in the contract module logic which output a trace information.

This, for example, can be done exclusively on the runtime level without touching the substrate host. Here are a few ideas:

We can log traps in the trace.
We can record calls/instantiations, their parameters such as transferred value, consumed gas, input/output buffers.
At the present, the contract module performs instrumentation before executing a contract (e.g. to inject gas metering statements). We could alter the instrumentation so as to add extra statements and tracing. Here are few examples:
1. We can insert a call to an ext_begin_fn function in the prologue and a call to an ext_end_fn in the epilogue of each function. With this we can construct stack traces inside of contracts in the case of errors.
2. Or could can instrument every execution path to
3. We could log every parameter
4. In the extreme, we could instrument every instruction and get every operand trace.
5. or as an another extreme option, we could alter wasmi and compile it in wasm and use it instead of using sandboxing.
we can add a function ext_print to the contract runtime. In the normal path it just doesn't do anything (i.e. don't even charge for gas for reading the code), but if the diagnostic feature is enabled the function reads the given buffer and records it in the trace. I think @Robbepop could be interested in this one.

After the execution we can assemble this as a trace and spew it out somehow, which could be decoded by tools (or alternatively, we could use human-readable format) or block explorers.

This setup gives us sheer number of possibilities for diagnostics without burdening the substrate host APIs. There is a caveat though: diagnostic runtimes likely would want to be state-compatible (i.e. produce the same state root) with the on-chain runtime just for the sake of being exchangable (i.e. for doing execute_block) and that might be tricky in one cases and limiting in other. However, this is not a hard requirement.

I think it might be possible that this mechanism could be used outside of the contracts module. For instance, we could use this use mechanism for running quick experiments with on-chain data, quickly testing new versions of runtimes at the development time or before an upgrade.

The text was updated successfully, but these errors were encountered:

xlc · 2019-03-22T21:56:12Z

I would like to have ability to run this diagnostic runtime natively so it is possible to use debugger to debug it. It will be even better if there is a way to run the contracts natively as well.

kianenigma · 2021-07-07T08:30:30Z

An RPC method that is the same as state_call, but that takes one extra parameter: a file path to a wasm runtime to use to execute.
Potentially, a way for a runtime to output a large chunk of data that can be returned by the aforementioned RPC method. Possibly in a file.

@emostov once we

implement arbitrary runtime api calls
proper integration into wasm-override instead of the current hack of writing into :code

we basically achieve the same thing through CLI in try-runtime.

liamaharon · 2023-08-14T06:06:10Z

@kianenigma @xlc @pepyakin I wonder if we can condense this issue down into some concrete feature requests for try-runtime and try-runtime-cli?

I see potentially:

Ability to execute the runtime natively so it can hook into debugging tools
Patterns for adding verbose diagnostic logs behind a try-runtime feature flag (that actually works for all types - I've experienced weird issues with existing logging where many variables are logged as empty strings)
Perhaps some patterns for debugging SCs with try-runtime/-cli?

xlc · 2023-08-14T07:56:00Z

(1) will be useful.
(2) is most likely you need sp-debug-derive/force-debug enabled wasm build

liamaharon · 2023-08-14T08:19:16Z

(2) is most likely you need sp-debug-derive/force-debug enabled wasm build

is there any reason not to implicitly enable this when building with --features try-runtime?

pepyakin added J0-enhancement An additional feature request. Z5-epic Can only be fixed by John Skeet. labels Mar 22, 2019

pepyakin mentioned this issue Mar 22, 2019

Scaffolding for contract VM error codes #1935

Closed

ascjones mentioned this issue Apr 9, 2019

Introduce ext_println to contract runtime #2239

Merged

pepyakin mentioned this issue Jun 12, 2019

seal: dedicated function for deliberate revert #2852

Closed

pepyakin mentioned this issue Jul 2, 2019

Avoid constant rehashing #2988

Closed

jimpo mentioned this issue Jul 5, 2019

[RFC] srml-contracts: Remove ext_return #3038

Closed

pepyakin added this to the Ideas milestone Nov 28, 2019

xlc mentioned this issue Dec 12, 2019

Improved Pallet Error Tracing #4385

Closed

pepyakin mentioned this issue Feb 17, 2020

Is it possible to tell what are changed state by a transaction? #4939

Closed

pepyakin mentioned this issue Mar 12, 2020

Enrich detail of Seal contract trap errors #5239

Closed

xlc mentioned this issue Apr 15, 2020

It is hard to distinguish events from a utility.batch call #5639

Closed

xlc mentioned this issue Sep 7, 2020

Wasm Executor local-blob overwrite #7035

Closed

pepyakin mentioned this issue Oct 26, 2020

The Router Module paritytech/polkadot#1679

Closed

2 tasks

kianenigma mentioned this issue Sep 22, 2021

follow-chain testing mode for try-runtime (and revamp CLI configs). #9788

Merged

5 tasks

pepyakin mentioned this issue Oct 11, 2022

Ability to fork network for testing purpose #12442

Closed

pepyakin mentioned this issue Nov 24, 2022

RFC: AbortHandler w3f/polkadot-spec#587

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Concept of Diagnostic Runtimes #2082

Concept of Diagnostic Runtimes #2082

pepyakin commented Mar 22, 2019

xlc commented Mar 22, 2019

kianenigma commented Jul 7, 2021

liamaharon commented Aug 14, 2023

xlc commented Aug 14, 2023

liamaharon commented Aug 14, 2023

Concept of Diagnostic Runtimes #2082

Concept of Diagnostic Runtimes #2082

Comments

pepyakin commented Mar 22, 2019

xlc commented Mar 22, 2019

kianenigma commented Jul 7, 2021

liamaharon commented Aug 14, 2023

xlc commented Aug 14, 2023

liamaharon commented Aug 14, 2023