fix: further stack reductions for nanox support #45

ryankurte · 2023-05-20T02:21:52Z

it turns out the nanox has a more severe stack use limitation, faulting on -memory access- when the stack pointer exceeds 8k, rather than only when invoking syscalls on the nanosplus.

it is expected that this limit will be resolved in future firmware, for now we should be able to avoid the issue by pre-allocating and using out-pointer tricks to avoid rust's lack of working copy elision / NRVO.

following #23, this PR:

refactors events to be self contained (simplifying passing these by reference)
moves ui, event, and object storage to the heap w/ pointer-based initialisation
splits out identity and key image functions so these can no-longer be inlined in Engine::update

this reduces our worst-case stack depth by ~2k down to ~7524 bytes which appears to get us under the limit on the nanox. this seems to be okay in the simulator, however, needs to be validated on an actual device.

- move UI context to heap - split out key image and ident frames from Engine::update approx 9124 -> 8412 at ring_signing call

…lobal

eranrund · 2023-05-22T16:34:09Z

I'm curious, how does no-inlining help reduce stack usage?

eranrund

Changes all look good to me. I'd still like to understand how no inlining helps here.
Thanks!

ryankurte · 2023-05-22T21:54:12Z

thanks for the review!

I'm curious, how does no-inlining help reduce stack usage?

so LLVM's optimiser really likes to inline functions which makes a lot of sense for performance, but when you inline functions the stack frames get merged. say you have two functions called sequentially that both use 4k of local storage, when inlined that becomes 8k in the caller stack frame whereas non-inlined they'll each have their own 4k frame and when called sequentially will re-use the same (SP+4k) space on the stack.

none of this usually matters that much but it's exacerbated by rust not yet having working copy elision / NRVO and the rather severe 8k limit on the stack pointer (vs. ~20k of available memory) in the nanox OS.

eranrund · 2023-05-22T22:16:37Z

thanks for the review!

I'm curious, how does no-inlining help reduce stack usage?

so LLVM's optimiser really likes to inline functions which makes a lot of sense for performance, but when you inline functions the stack frames get merged. say you have two functions called sequentially that both use 4k of local storage, when inlined that becomes 8k in the caller stack frame whereas non-inlined they'll each have their own 4k frame and when called sequentially will re-use the same (SP+4k) space on the stack.

none of this usually matters that much but it's exacerbated by rust not yet having working copy elision / NRVO and the rather severe 8k limit on the stack pointer (vs. ~20k of available memory) in the nanox OS.

Gotcha, thanks for explaining!

ryankurte added 3 commits May 20, 2023 11:33

working to reduce stack allocations

f6811fe

- move UI context to heap - split out key image and ident frames from Engine::update approx 9124 -> 8412 at ring_signing call

refactor events to be self-contained, make event and output objects g…

c12a522

…lobal

fix debug makefile commands

6a38e8d

ryankurte added the bug Something isn't working label May 20, 2023

ryankurte self-assigned this May 20, 2023

ryankurte requested a review from a team as a code owner May 20, 2023 02:21

ryankurte added 3 commits May 20, 2023 14:24

block digest update inlining

941ed09

lint

0eba844

fix lints

003a698

eranrund approved these changes May 22, 2023

View reviewed changes

ryankurte merged commit 3dc218e into main May 22, 2023

ryankurte deleted the fix/nanox-stack-reduction branch May 22, 2023 21:55

ryankurte mentioned this pull request May 22, 2023

fix application on nanox devices #27

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: further stack reductions for nanox support #45

fix: further stack reductions for nanox support #45

ryankurte commented May 20, 2023 •

edited

Loading

eranrund commented May 22, 2023

eranrund left a comment

ryankurte commented May 22, 2023

eranrund commented May 22, 2023

fix: further stack reductions for nanox support #45

fix: further stack reductions for nanox support #45

Conversation

ryankurte commented May 20, 2023 • edited Loading

eranrund commented May 22, 2023

eranrund left a comment

Choose a reason for hiding this comment

ryankurte commented May 22, 2023

eranrund commented May 22, 2023

ryankurte commented May 20, 2023 •

edited

Loading