New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

[StackAnalyzer] Analyze the static Stack for composite. #486

Open

spadek67424 wants to merge 130 commits into gwsystems:main from spadek67424:Stackanalyzer

spadek67424 commented Aug 14, 2024 •

edited

Loading

Summary of this Pull Request (PR)

Add description here.

Intent for your PR

Choose one (Mandatory):

This PR is for a code-review and is intended to get feedback, but not to be pulled yet.
This PR is mature, and ready to be integrated into the repo.

Reviewers (Mandatory):

(Specify @<github.com username(s)> of the reviewers. Ex: @user1, @user2)

Code Quality

As part of this pull request, I've considered the following:

Comments adhere to the Style Guide (SG)
Spacing adhere's to the SG
Naming adhere's to the SG
All other aspects of the SG are adhered to, or exceptions are justified in this pull request
I have run the auto formatter on my code before submitting this PR (see doc/auto_formatter.md for instructions)

Code Craftsmanship:

I've made an attempt to remove all redundant code
I've considered ways in which my changes might impact existing code, and cleaned it up
I've formatted the code in an effort to make it easier to read (proper error handling, function use, etc...)
I've commented appropriately where code is tricky
I agree that there is no "throw-away" code, and that code in this PR is of high quality

Testing

I've tested the code using the following test programs (provide list here):

micro_booter
unit_pingpong
unit_schedtests
...(add others here)

spadek67424 added 30 commits

September 29, 2023 13:24


          first commit

60bce86


          add parser.

d650df8


          add sub instruction.

9fdccb3


          add push pop instruction.

6e0096d


          add recurrence.

635a7e6


          restructure.

f6130fd


          fix the bugs of recursive.

c085719


          refactor the code.

394fedb


          refactor the code.

9728df3


          try to use pyelftools.

0cd3544


          use pyelftools to disasm.

5a90a4a


          add symbol.

04e1d56


          add push.

403d921


          migrate to capstone to parse instruction.

b68b1a7


          add elf.

765da7d


          add the read write reg.

8260f58


          add catch operands.

411e692


          split decode and execute stage.

7722dcf


          handle more instruction.

67ad88b


          minor improvement.

1f5f29d


          modify the stack size output.

9803b72


          change push and pop as 8.

782a3ed


          add dhrystone testbench.

af3d12f


          remove useless code.

5dd0a67


          nothing.

76c990c


          add DAG.

934b2e7


          change the ret.

d1414ca


          Merge pull request #1 from spadek67424/feature/DAG

59a592f

Feature/dag


          clean the code.

3c99a77


          Merge pull request #2 from spadek67424/feature/DAG

b81919f

clean the code.

spadek67424 added 5 commits

August 10, 2024 01:57


          I need to fix stack.

81eb39c


          fix the stack address calculation.

f54ddbb


          fix the stack calculation.

8f3833a


          add test.

0c6aa45


          Merge remote-tracking branch 'r1remote/main' into Stackanalyzer

6908ab1

spadek67424 changed the title ~~[StackAnalyzer] T~~ [StackAnalyzer] Analyze the static Stack for composite.

spadek67424 added 2 commits

August 14, 2024 00:54


          remove the unrelated files.

6f55729


          remove unrelated files.

78cc449

mbai1010 reviewed

View reviewed changes

pyelftool_parser/testbench/composite/system_binaries/cos_build-ping/test.py Outdated

@@ @@ -0,0 +1,36 @@ @@
+              import unittest   # The test framework
+              import sys
+              sys.path.insert(0, '/home/minghwu/work/StackAnalyzer/pyelftool_parser/src')

Contributor

mbai1010 Aug 15, 2024 •

edited

Loading

I feel that we may better use a relative path here

mbai1010 reviewed

View reviewed changes

pyelftool_parser/testbench/composite/system_binaries/cos_build-ping/test.py Outdated

+                      ## test printc
+                      self.assertEqual(self.stacklist[12], -416)
+              if __name__ == '__main__':
+                  path = "./tests.unit_pingpong.global.ping"

Contributor

mbai1010 Aug 15, 2024 •

edited

Loading

maybe we should remove the part between the if statement and unittest.main(). They are duplicated with setUp(self) above

mbai1010 and others added 13 commits

August 22, 2024 15:31


          generate the initargs.c based on toml file

bd80d7b


          Merge branch 'gwsystems:main' into Stackanalyzer

1242f1b


          integrate the feedback from mao.

4e0f8d9


          Merge branch 'Stackanalyzer' of github.com:spadek67424/composite into…

9e0fc3e

… Stackanalyzer


          change composer.


          change composer.

943c37c


          try to trigger python script.


          Merge branch 'composite-mao' into main

7f93f1a


          Merge pull request #1 from mbai1010/main

8f80fe3

Add mao's composite to my own.


          change analyzer.

9c21982


          add define Stack_analyzer.

4fc2b92


          clean code.

fd3e1e9


          revert Makefile.

77130cb

gparmer requested changes

View reviewed changes

pyelftool_parser/src/analyzer.py

		@@ -0,0 +1,203 @@
		import sys

Collaborator

gparmer Aug 27, 2024

I think that you likely want the entire pyelftool to be in tools/

Author

spadek67424 Aug 28, 2024

I am not sure why I try to merge from my own repo. Then it keeps moving it to main folder. I think it is my command line problems. We could move it afterward after we decide how to integrate with system.

pyelftool_parser/src/analyzer.py

+                                  pc_flag = 1
+                              if (pc_flag == 1):               ## catch the point until the next symbol, means it is exit point.
+                                  self.exit_pc = i.address
+                              log(f'0x{i.address:x}:\t{i.mnemonic}\t{i.op_str}')

Collaborator

gparmer Aug 27, 2024

I haven't run this, but does this log quite a bit to stdout/err? We'd like the output to be relatively quiet in most cases.

Author

spadek67424 Aug 28, 2024

Yes, there is a debug.py. It is actually a log system. We could decide what level log to stdout.

pyelftool_parser/src/analyzer.py

+                                      self.entry_pc = symbol['st_value']
+                                      log("Set up entry point")
+                                      log(hex(self.entry_pc))
+                                  if(symbol.name == 'custom_acquire_stack'):

Collaborator

gparmer Aug 27, 2024

Am I understanding correctly that you don't treat the __cosrt_s_* symbols as entry points? Is the logic that they each call custom_acquire_stack, so it is OK to start tracing from there? That might miss some stack operations. I think that __cosrt_upcall_entry also uses that symbol, so you might do redundant computations.

I'm likely just missing something here ;-)

Author

spadek67424 Aug 28, 2024

Sorry for that. I just test the __cosrt_upcall_entry. I will give it a look.

pyelftool_parser/src/analyzer.py

+                          if self.register.reg["pc"] in self.symbol.keys():  ## check function block (as basic block but we use function as unit.)
+                              self.stackfunction.append(self.symbol[self.register.reg["pc"]])
+                              logstack(self.symbol[self.register.reg["pc"]])   ## TODO: here is error.
+                              self.register.updatestackreg(self.symbol[self.register.reg["pc"]] == 'custom_acquire_stack') ## if it is acquiring stack address, do not setting the stack size.

Collaborator

gparmer Aug 27, 2024

I can't follow a lot of this logic. Not sure why a stackfunction is doing operations with the pc, or why the pc indexes a symbol correctly. I'm just having a hard time following all of this, which means I likely can't give a useful review.

Author

spadek67424 Aug 28, 2024

I will try to fix the comment here. The comments is misleading. I did not notice that. Thank you for giving the ideas of it. But the idea is that I will calculate the stack size whenever we jump or flow to a new function block. I track how much the stack pointer changed, and I store the changing into an array.

pyelftool_parser/src/analyzer.py


		#### set up next instruction pc

		if (self.index == index_list.index(self.register.reg["pc"])): ## fetch next instruction

Collaborator

gparmer Aug 27, 2024

This is all quite a bit more complicated than I was expecting. Great job figuring it all out!

pyelftool_parser/src/analyzer.py

+                              self.stackfunction.append(self.symbol[self.register.reg["pc"]])
+                              logstack(self.symbol[self.register.reg["pc"]])   ## TODO: here is error.
+                              self.register.updatestackreg(self.symbol[self.register.reg["pc"]] == 'custom_acquire_stack') ## if it is acquiring stack address, do not setting the stack size.
+                              self.stacklist.append(self.register.reg["stack"])

Collaborator

gparmer Aug 27, 2024

We found an API in the library that told you if an instruction modifies stacks. Is this the API? I'm looking through here trying to find the stack logic.

Author

spadek67424 Aug 28, 2024

Yes, there is Executor(executor.py). I pass each instruction into it. And I use capstone to check if it is modifing the rsp. There is a "flagrsp" in executor to check it.

pyelftool_parser/src/execute.py

+                      ############################################
+                      ##------------------------------------------
+                      ## execute stage.
+                      if flagrsp:  ## if rsp is in the instruction

Collaborator

gparmer Aug 27, 2024

Isn't there a part of the API that tells you if an instruction modifies the stack? Aren't those the only instructions you need to worry about?

Author

spadek67424 Aug 28, 2024

I use capstone's "(regs_read, regs_write) = inst.regs_access()". Check that is it reading/writing rsp? This regs_access can also specify the implication read/write rsp.

src/composer/src/main.rs Outdated

                       sys.add_objs_iter(&c_id, ElfObject::transition_iter(c_id, &sys, &mut build)?);
                       sys.add_invs_iter(&c_id, Invocations::transition_iter(c_id, &sys, &mut build)?);
+                      println!("path:{}", obj.get_path());
+                      let output = Command::new("python3")
+                      .arg("/home/minghwu/work/minghwu/composite/pyelftool_parser/src/analyzer.py")

Collaborator

gparmer Aug 27, 2024

You'll want to use relative paths here so that you can avoid hardcoding the absolute path.

Author

spadek67424 Aug 28, 2024

Yes, this is indeed a problem. I do not figure out how to put into tools. Therefore, I put absolute path for this moment. I will fix it afterward. Thank you for pointing out.

src/composer/src/main.rs Outdated

                       sys.add_objs_iter(&c_id, ElfObject::transition_iter(c_id, &sys, &mut build)?);
                       sys.add_invs_iter(&c_id, Invocations::transition_iter(c_id, &sys, &mut build)?);
+                      println!("path:{}", obj.get_path());

Collaborator

gparmer Aug 27, 2024

Unless a print is for a specific purpose (other than debugging), we don't want to add them.

Author

spadek67424 Aug 28, 2024

Oh sorry. It is a debugging mistake.

src/composer/src/main.rs

+                      } else {
+                          let stderr = String::from_utf8_lossy(&output.stderr);
+                          eprintln!("Script error: {}", stderr);
+                      }

Collaborator

gparmer Aug 27, 2024

For all of this, we'll want to add a pass. This is likely in your other code, so I'm hesitate to say much here.

Collaborator

gparmer commented Aug 27, 2024

I'd like to understand a couple of things:

The complexity of the analysis of the stack usage -- I'm likely just missing where you're using the library API feature to tell you if a stack is updated in an instruction.
Simple updates to structure (put it in tools/, remove absolute paths, figure out how to add a pass in the composer and use the output of this program.)

spadek67424 added 5 commits

August 30, 2024 17:01


          change the composer.

27870ad


          clean code.

b11974d


          fix the captialization.

98f2818


          add log2.

0734aeb


          change the define name.

4080cee

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet