Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Mellanox][Smartswitch] Add no_wait option for dpu reboot and add platform information parsing #20943

Merged
merged 9 commits into from
Dec 4, 2024

Conversation

gpunathilell
Copy link
Contributor

Why I did it

Changes for dpuctlplat.py:

  • Added option to invoke systemctl rshim start/stop from the pmon container (Using dbus)
  • Added no_wait option for reboot (Since we do not need to wait for the dpu to be ready if NPU+DPU reboot is ongoing)
  • Added platform JSON parsing for rshim and pcie information
Work item tracking
  • Microsoft ADO (number only):

How I did it

Changed dpuctlplat.py to support systemctl commands from pmon container using the dbus-send command

How to verify it

Which release branch to backport (provide reason below if selected)

  • 201811
  • 201911
  • 202006
  • 202012
  • 202106
  • 202111
  • 202205
  • 202211
  • 202305

Tested branch (Please provide the tested image version)

Description for the changelog

Link to config_db schema for YANG module changes

A picture of a cute animal (not mandatory but encouraged)

@@ -169,23 +189,32 @@ def get_open_fd(self, path, flag):
finally:
os.close(fd)

def get_pci_dev_path(self):
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@gpunathilell any specific reason why this PCI remove could not be done in common code? @vvolam fyi

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

we discussed this at the meeting. We have an extra dependency on the rshim here. Before removing PCI we need to stop rshim. After rescaning the PCI we need to start the rshim. Common code can't cover this.

@liat-grozovik liat-grozovik merged commit 3cc4c11 into sonic-net:master Dec 4, 2024
12 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants