refactor: proposal: Modularise VM Ops. #794

ajnavarro · 2023-05-02T17:13:53Z

Description

This is a proposal to refactor the VM and modularize code on the nodes themselves, offloading complexity from the VM.

Why?

IMHO the Machine struct is too complex and will benefit from a refactoring modularizing code into its own nodes.

This modularization will allow to evolve the VM much faster when adding new language expressions, tests, or fixing bugs.

Also, after implementing this PoC, I discovered that a lot of duplicated code can be avoided.

Another benefit will be that Operations, Nodes using these operations, and costs will all be in the same place.

How?

We can define a new method on the Node interface:

	Exec(m *Machine, o Op) (int64, error)

This method will be in charge of analyzing the operation, adding or removing elements into the stack, and returning the CPU usage depending on the executed instructions. This method is temporal, and eventually, instead of receiving the *Machine pointer, it will use a context containing the program stack (that can be serialized and mutated), the store (that can be queried), GC, Allocator..., making the VM stateless:

	Exec(ctx *Context, o Op) (int64, error)

Also, all specific code for each node when calling OpExec can ve moved to its specific Node implementation too, simplifying func (m *Machine) doOpExec(op Op) method.

I think that this change, instead of having a list of Statements and Expressions, will also allow to have everything on the same list of Nodes. We can iterate over that list of nodes and call Exec method on each, simplifying Run() method.

On this PR you can see an example of only refactoring Assign Statements, to let you have an idea about how it will look.

Some cases are not fully covered yet (the main one is related to some logic on PopStmt() method), that's why some tests are still failing.

I would love to know what you think about this.

Checklist (for contributors)

I have assigned this PR to myself
I have added at least 1 reviewer
I have added the relevant labels
I have updated the official documentation
I have added sufficient documentation in code

Example showing a proposal to modularise VM functionality into nodes and offload complexity from the VM itself. Signed-off-by: Antonio Navarro Perez <antnavper@gmail.com>

ajnavarro · 2023-05-02T17:15:03Z

gnovm/pkg/gnolang/machine.go

 //----------------------------------------
 // main run loop.

+func (m *Machine) doExec(op Op) bool {


this method will eventually disappear. It is made to make it compatible with old behavior.

moul · 2023-05-03T16:52:09Z

My initial impression is that this approach could increase cognitive complexity.
It requires navigating through a complex switch/if structure and constantly switching between multiple files, which can be challenging.

While the complete plan may have merit, the current partial implementation seems inadequate. Could you provide more information about your vision for this pull request and how it will look when fully implemented?

Any example of how much a future change will be simplified?

ajnavarro · 2023-05-04T13:10:57Z

It requires navigating through a complex switch/if structure and constantly switching between multiple files, which can be challenging.

Navigating through a complex switch statement is exactly what I want to avoid :). These are all the switch cases removed just after refactoring one statement: https://github.com/gnolang/gno/pull/794/files#diff-0830c8c26a4042165bbd447eb4955f11a0789f85ddfe85b794f6db2f26115aa5L1249-L1287

Also this switch cases will disappear and they will integrated with the new structure called by Exec method: https://github.com/gnolang/gno/blob/master/gnovm/pkg/gnolang/op_exec.go#L435-L482

While the complete plan may have merit, the current partial implementation seems inadequate. Could you provide more information about your vision for this pull request and how it will look when fully implemented?

Actual Machine.Run method:

Details

func (m *Machine) Run() {
	for {
		op := m.PopOp()
		// TODO: this can be optimized manually, even into tiers.
		switch op {
		/* Control operators */
		case OpHalt:
			m.incrCPU(OpCPUHalt)
			return
		case OpNoop:
			m.incrCPU(OpCPUNoop)
			continue
		case OpExec:
			m.incrCPU(OpCPUExec)
			m.doOpExec(op)
		case OpPrecall:
			m.incrCPU(OpCPUPrecall)
			m.doOpPrecall()
		case OpCall:
			m.incrCPU(OpCPUCall)
			m.doOpCall()
		case OpCallNativeBody:
			m.incrCPU(OpCPUCallNativeBody)
			m.doOpCallNativeBody()
		case OpReturn:
			m.incrCPU(OpCPUReturn)
			m.doOpReturn()
		case OpReturnFromBlock:
			m.incrCPU(OpCPUReturnFromBlock)
			m.doOpReturnFromBlock()
		case OpReturnToBlock:
			m.incrCPU(OpCPUReturnToBlock)
			m.doOpReturnToBlock()
		case OpDefer:
			m.incrCPU(OpCPUDefer)
			m.doOpDefer()
		case OpPanic1:
			m.incrCPU(OpCPUPanic1)
			m.doOpPanic1()
		case OpPanic2:
			m.incrCPU(OpCPUPanic2)
			m.doOpPanic2()
		case OpCallDeferNativeBody:
			m.incrCPU(OpCPUCallDeferNativeBody)
			m.doOpCallDeferNativeBody()
		case OpGo:
			m.incrCPU(OpCPUGo)
			panic("not yet implemented")
		case OpSelect:
			m.incrCPU(OpCPUSelect)
			panic("not yet implemented")
		case OpSwitchClause:
			m.incrCPU(OpCPUSwitchClause)
			m.doOpSwitchClause()
		case OpSwitchClauseCase:
			m.incrCPU(OpCPUSwitchClauseCase)
			m.doOpSwitchClauseCase()
		case OpTypeSwitch:
			m.incrCPU(OpCPUTypeSwitch)
			m.doOpTypeSwitch()
		case OpIfCond:
			m.incrCPU(OpCPUIfCond)
			m.doOpIfCond()
		case OpPopValue:
			m.incrCPU(OpCPUPopValue)
			m.PopValue()
		case OpPopResults:
			m.incrCPU(OpCPUPopResults)
			m.PopResults()
		case OpPopBlock:
			m.incrCPU(OpCPUPopBlock)
			m.PopBlock()
		case OpPopFrameAndReset:
			m.incrCPU(OpCPUPopFrameAndReset)
			m.PopFrameAndReset()
		/* Unary operators */
		case OpUpos:
			m.incrCPU(OpCPUUpos)
			m.doOpUpos()
		case OpUneg:
			m.incrCPU(OpCPUUneg)
			m.doOpUneg()
		case OpUnot:
			m.incrCPU(OpCPUUnot)
			m.doOpUnot()
		case OpUxor:
			m.incrCPU(OpCPUUxor)
			m.doOpUxor()
		case OpUrecv:
			m.incrCPU(OpCPUUrecv)
			m.doOpUrecv()
		/* Binary operators */
		case OpLor:
			m.incrCPU(OpCPULor)
			m.doOpLor()
		case OpLand:
			m.incrCPU(OpCPULand)
			m.doOpLand()
		case OpEql:
			m.incrCPU(OpCPUEql)
			m.doOpEql()
		case OpNeq:
			m.incrCPU(OpCPUNeq)
			m.doOpNeq()
		case OpLss:
			m.incrCPU(OpCPULss)
			m.doOpLss()
		case OpLeq:
			m.incrCPU(OpCPULeq)
			m.doOpLeq()
		case OpGtr:
			m.incrCPU(OpCPUGtr)
			m.doOpGtr()
		case OpGeq:
			m.incrCPU(OpCPUGeq)
			m.doOpGeq()
		case OpAdd:
			m.incrCPU(OpCPUAdd)
			m.doOpAdd()
		case OpSub:
			m.incrCPU(OpCPUSub)
			m.doOpSub()
		case OpBor:
			m.incrCPU(OpCPUBor)
			m.doOpBor()
		case OpXor:
			m.incrCPU(OpCPUXor)
			m.doOpXor()
		case OpMul:
			m.incrCPU(OpCPUMul)
			m.doOpMul()
		case OpQuo:
			m.incrCPU(OpCPUQuo)
			m.doOpQuo()
		case OpRem:
			m.incrCPU(OpCPURem)
			m.doOpRem()
		case OpShl:
			m.incrCPU(OpCPUShl)
			m.doOpShl()
		case OpShr:
			m.incrCPU(OpCPUShr)
			m.doOpShr()
		case OpBand:
			m.incrCPU(OpCPUBand)
			m.doOpBand()
		case OpBandn:
			m.incrCPU(OpCPUBandn)
			m.doOpBandn()
		/* Expression operators */
		case OpEval:
			m.incrCPU(OpCPUEval)
			m.doOpEval()
		case OpBinary1:
			m.incrCPU(OpCPUBinary1)
			m.doOpBinary1()
		case OpIndex1:
			m.incrCPU(OpCPUIndex1)
			m.doOpIndex1()
		case OpIndex2:
			m.incrCPU(OpCPUIndex2)
			m.doOpIndex2()
		case OpSelector:
			m.incrCPU(OpCPUSelector)
			m.doOpSelector()
		case OpSlice:
			m.incrCPU(OpCPUSlice)
			m.doOpSlice()
		case OpStar:
			m.incrCPU(OpCPUStar)
			m.doOpStar()
		case OpRef:
			m.incrCPU(OpCPURef)
			m.doOpRef()
		case OpTypeAssert1:
			m.incrCPU(OpCPUTypeAssert1)
			m.doOpTypeAssert1()
		case OpTypeAssert2:
			m.incrCPU(OpCPUTypeAssert2)
			m.doOpTypeAssert2()
		case OpStaticTypeOf:
			m.incrCPU(OpCPUStaticTypeOf)
			m.doOpStaticTypeOf()
		case OpCompositeLit:
			m.incrCPU(OpCPUCompositeLit)
			m.doOpCompositeLit()
		case OpArrayLit:
			m.incrCPU(OpCPUArrayLit)
			m.doOpArrayLit()
		case OpSliceLit:
			m.incrCPU(OpCPUSliceLit)
			m.doOpSliceLit()
		case OpSliceLit2:
			m.incrCPU(OpCPUSliceLit2)
			m.doOpSliceLit2()
		case OpFuncLit:
			m.incrCPU(OpCPUFuncLit)
			m.doOpFuncLit()
		case OpMapLit:
			m.incrCPU(OpCPUMapLit)
			m.doOpMapLit()
		case OpStructLit:
			m.incrCPU(OpCPUStructLit)
			m.doOpStructLit()
		case OpConvert:
			m.incrCPU(OpCPUConvert)
			m.doOpConvert()
		/* GoNative Operators */
		case OpArrayLitGoNative:
			m.incrCPU(OpCPUArrayLitGoNative)
			m.doOpArrayLitGoNative()
		case OpSliceLitGoNative:
			m.incrCPU(OpCPUSliceLitGoNative)
			m.doOpSliceLitGoNative()
		case OpStructLitGoNative:
			m.incrCPU(OpCPUStructLitGoNative)
			m.doOpStructLitGoNative()
		case OpCallGoNative:
			m.incrCPU(OpCPUCallGoNative)
			m.doOpCallGoNative()
		/* Type operators */
		case OpFieldType:
			m.incrCPU(OpCPUFieldType)
			m.doOpFieldType()
		case OpArrayType:
			m.incrCPU(OpCPUArrayType)
			m.doOpArrayType()
		case OpSliceType:
			m.incrCPU(OpCPUSliceType)
			m.doOpSliceType()
		case OpChanType:
			m.incrCPU(OpCPUChanType)
			m.doOpChanType()
		case OpFuncType:
			m.incrCPU(OpCPUFuncType)
			m.doOpFuncType()
		case OpMapType:
			m.incrCPU(OpCPUMapType)
			m.doOpMapType()
		case OpStructType:
			m.incrCPU(OpCPUStructType)
			m.doOpStructType()
		case OpInterfaceType:
			m.incrCPU(OpCPUInterfaceType)
			m.doOpInterfaceType()
		case OpMaybeNativeType:
			m.incrCPU(OpCPUMaybeNativeType)
			m.doOpMaybeNativeType()
		/* Statement operators */
		case OpAssign:
			m.incrCPU(OpCPUAssign)
			m.doOpAssign()
		case OpAddAssign:
			m.incrCPU(OpCPUAddAssign)
			m.doOpAddAssign()
		case OpSubAssign:
			m.incrCPU(OpCPUSubAssign)
			m.doOpSubAssign()
		case OpMulAssign:
			m.incrCPU(OpCPUMulAssign)
			m.doOpMulAssign()
		case OpQuoAssign:
			m.incrCPU(OpCPUQuoAssign)
			m.doOpQuoAssign()
		case OpRemAssign:
			m.incrCPU(OpCPURemAssign)
			m.doOpRemAssign()
		case OpBandAssign:
			m.incrCPU(OpCPUBandAssign)
			m.doOpBandAssign()
		case OpBandnAssign:
			m.incrCPU(OpCPUBandnAssign)
			m.doOpBandnAssign()
		case OpBorAssign:
			m.incrCPU(OpCPUBorAssign)
			m.doOpBorAssign()
		case OpXorAssign:
			m.incrCPU(OpCPUXorAssign)
			m.doOpXorAssign()
		case OpShlAssign:
			m.incrCPU(OpCPUShlAssign)
			m.doOpShlAssign()
		case OpShrAssign:
			m.incrCPU(OpCPUShrAssign)
			m.doOpShrAssign()
		case OpDefine:
			m.incrCPU(OpCPUDefine)
			m.doOpDefine()
		case OpInc:
			m.incrCPU(OpCPUInc)
			m.doOpInc()
		case OpDec:
			m.incrCPU(OpCPUDec)
			m.doOpDec()
		/* Decl operators */
		case OpValueDecl:
			m.incrCPU(OpCPUValueDecl)
			m.doOpValueDecl()
		case OpTypeDecl:
			m.incrCPU(OpCPUTypeDecl)
			m.doOpTypeDecl()
		/* Loop (sticky) operators */
		case OpBody:
			m.incrCPU(OpCPUBody)
			m.doOpExec(op)
		case OpForLoop:
			m.incrCPU(OpCPUForLoop)
			m.doOpExec(op)
		case OpRangeIter:
			m.incrCPU(OpCPURangeIter)
			m.doOpExec(op)
		case OpRangeIterArrayPtr:
			m.incrCPU(OpCPURangeIterArrayPtr)
			m.doOpExec(op)
		case OpRangeIterString:
			m.incrCPU(OpCPURangeIterString)
			m.doOpExec(op)
		case OpRangeIterMap:
			m.incrCPU(OpCPURangeIterMap)
			m.doOpExec(op)
		case OpReturnCallDefers:
			m.incrCPU(OpCPUReturnCallDefers)
			m.doOpReturnCallDefers()
		default:
			panic(fmt.Sprintf("unexpected opcode %s", op.String()))
		}
	}
}

After the changes:

Details

func (m *Machine) Run() {
	for {
		op := m.PopOp()
		// TODO: this can be optimized manually, even into tiers.
		switch op {
		/* Control operators */
		case OpHalt:
			m.incrCPU(OpCPUHalt)
			return
		case OpNoop:
			m.incrCPU(OpCPUNoop)
			continue
        }

        cycles := m.PopNode().Exec(m, op)
		m.incrCPU(cycles)
	}
}

Any example of how much a future change will be simplified?

You won't need to change the VM code to be able to parse a new expression, all the logic will be encapsulated on the Node type itself. The only thing that you will need to do is to add the parsing logic on GoToGno (something that is needed now too). Example: add support for goroutines, or new types.
We can benchmark every node Exec method in an isolated way, making it possible to tweak the actual CPU cost for every node and change it if we improve that Node's performance.

thehowl · 2023-05-31T13:17:04Z

gnovm/pkg/gnolang/op_assign.go

+
+var ErrOpNotSupported = errors.New("operation not supported")
+
+var assignOps map[Op]func(s *AssignStmt, m *Machine) (cpuCycles int64) = map[Op]func(s *AssignStmt, m *Machine) int64{


maps are expensive and Op is a uint8, so change to map to [256]func(...) ...?

moul · 2023-05-31T21:39:03Z

@mvertes, interesting topic for you here.

mvertes · 2023-06-01T16:18:06Z

As @moul gently asked, here is my view on this proposal. It's sure a good way for me to learn more on this project that I'm just starting to discover.

At first, I was reluctant to this change. Why?
Because the affected function func (m *Machine) Run() exactly looks like a classical virtual machine main processing function. It has a single forever infinite loop (with no branch, good), and inside a single switch on all the opcodes. Fine. Perfect for the Go compiler to optimise (recent versions of go are able to transform large cases in jump tables). Everything at the same place. The pattern is super simple, and is easy to understand despite the length. No reason to change that.

What is expected then (outside the scope of this PR) is that all the opcode processing functions are so trivial and simple that the compiler would inline them in Run(), removing another important source of overhead, the cost of function call in the most critical part of the VM, the exec loop.

And this is where I changed my mind.

It would work with a real byte code architecture, where all instructions are atomic, or close to. But the instruction set here is not of a real or virtual CPU. It maps the AST instead, with all the control flow constructs (if, for, statement blocks, expression blocks, etc). Here the AST is the VM (and it's perfectly fine, it's the architecture used in Awk or Perl interpreters).
The thing is that many opcode functions are not trivial, and some even bear most of the complexity of processing (see for ex: doOpExec()).
So in the end, improving Run() doesn't harm, nor matter.

Probably this PR improves slightly the situation. I am new to this code and was confused to see a VM processing function as a virtual CPU, not an AST processor.

Or maybe the refactoring should clarify this ? I'm not sure.

ajnavarro · 2023-06-14T10:27:21Z

As we delayed the effort needed to tackle this, I will share some slides I had to help with the explanation in future meetings. You can have a look here: https://hackmd.io/@ajnavarro/BJDfoONL2

refactor: proposal: Modularise VM Ops.

0132d5c

Example showing a proposal to modularise VM functionality into nodes and offload complexity from the VM itself. Signed-off-by: Antonio Navarro Perez <antnavper@gmail.com>

ajnavarro added don't merge Please don't merge this functionality temporarily investigating This behavior is still being tested out labels May 2, 2023

ajnavarro self-assigned this May 2, 2023

ajnavarro commented May 2, 2023

View reviewed changes

moul assigned ajnavarro and unassigned ajnavarro May 4, 2023

piux2 requested a review from jaekwon May 8, 2023 07:02

michelleellen mentioned this pull request May 16, 2023

Update Agenda {17.05.2023}_#6 gnolang/meetings#9

Merged

michelleellen mentioned this pull request May 30, 2023

Create agenda #7 {31.05.2023}.md gnolang/meetings#13

Merged

10 tasks

thehowl reviewed May 31, 2023

View reviewed changes

zivkovicmilos self-requested a review June 5, 2023 15:30

moul requested a review from mvertes June 26, 2023 15:21

moul added this to the 💡Someday/Maybe milestone Sep 6, 2023

ajnavarro closed this Mar 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor: proposal: Modularise VM Ops. #794

refactor: proposal: Modularise VM Ops. #794

ajnavarro commented May 2, 2023 •

edited

Loading

ajnavarro May 2, 2023

moul commented May 3, 2023

ajnavarro commented May 4, 2023

thehowl May 31, 2023

moul commented May 31, 2023

mvertes commented Jun 1, 2023

ajnavarro commented Jun 14, 2023


		var ErrOpNotSupported = errors.New("operation not supported")

		var assignOps map[Op]func(s AssignStmt, m Machine) (cpuCycles int64) = map[Op]func(s AssignStmt, m Machine) int64{

refactor: proposal: Modularise VM Ops. #794

refactor: proposal: Modularise VM Ops. #794

Conversation

ajnavarro commented May 2, 2023 • edited Loading

Description

Why?

How?

Checklist (for contributors)

ajnavarro May 2, 2023

Choose a reason for hiding this comment

moul commented May 3, 2023

ajnavarro commented May 4, 2023

thehowl May 31, 2023

Choose a reason for hiding this comment

moul commented May 31, 2023

mvertes commented Jun 1, 2023

ajnavarro commented Jun 14, 2023

ajnavarro commented May 2, 2023 •

edited

Loading