Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Tool for monitoring Arc GPU VRAM usage on Ubuntu? #9594

Closed
kyang-06 opened this issue Dec 4, 2023 · 4 comments
Closed

Tool for monitoring Arc GPU VRAM usage on Ubuntu? #9594

kyang-06 opened this issue Dec 4, 2023 · 4 comments
Assignees

Comments

@kyang-06
Copy link

kyang-06 commented Dec 4, 2023

Hi, thank you for excellent efforts on supporting LLM running on Arc GPU.
Do you know if there exists any tool for this feature? Sadly I only found their answer is no.

@rnwang04
Copy link
Contributor

rnwang04 commented Dec 5, 2023

You can use xpu-smi tool to monitor your Arc GPU VRAM usage.
Install it by sudo apt-get install xpu-smi
Then monitor your Arc GPU VRAM usage by sudo xpu-smi stats -d 0, you will see something like:

+-----------------------------+--------------------------------------------------------------------+
| Device ID                   | 0                                                                  |
+-----------------------------+--------------------------------------------------------------------+
| GPU Utilization (%)         | 0                                                                  |
| EU Array Active (%)         |                                                                    |
| EU Array Stall (%)          |                                                                    |
| EU Array Idle (%)           |                                                                    |
|                             |                                                                    |
| Compute Engine Util (%)     | 0; Engine 0: 0, Engine 1: 0, Engine 2: 0, Engine 3: 0              |
| Render Engine Util (%)      | 0;                                                                 |
| Media Engine Util (%)       | 0                                                                  |
| Decoder Engine Util (%)     |                                                                    |
| Encoder Engine Util (%)     |                                                                    |
| Copy Engine Util (%)        | 0;                                                                 |
| Media EM Engine Util (%)    |                                                                    |
| 3D Engine Util (%)          |                                                                    |
+-----------------------------+--------------------------------------------------------------------+
| Reset                       |                                                                    |
| Programming Errors          |                                                                    |
| Driver Errors               |                                                                    |
| Cache Errors Correctable    |                                                                    |
| Cache Errors Uncorrectable  |                                                                    |
| Mem Errors Correctable      |                                                                    |
| Mem Errors Uncorrectable    |                                                                    |
+-----------------------------+--------------------------------------------------------------------+
| GPU Power (W)               | 45                                                                 |
| GPU Frequency (MHz)         | 1500                                                               |
| GPU Core Temperature (C)    | 52                                                                 |
| GPU Memory Temperature (C)  |                                                                    |
| GPU Memory Read (kB/s)      |                                                                    |
| GPU Memory Write (kB/s)     |                                                                    |
| GPU Memory Bandwidth (%)    |                                                                    |
| GPU Memory Used (MiB)       | 26                                                                 |
| GPU Memory Util (%)         | 0                                                                  |
| Xe Link Throughput (kB/s)   |                                                                    |
+-----------------------------+--------------------------------------------------------------------+

@kyang-06
Copy link
Author

kyang-06 commented Dec 5, 2023

Thank you for soon reply!
But watch -n 1 xpu-smi -d 0 is always showing GPU memory used = 0.
I also tried pti-gpu/sysmon, showing running process unknown, as same as this
image

image

@kyang-06
Copy link
Author

kyang-06 commented Dec 5, 2023

Sorry to bother, solved by using root privilege for xpu-smi. even though sysmon is still unavailable.
Thank you for the help!
image

@hkvision
Copy link
Contributor

hkvision commented Dec 7, 2023

Closing this issue. Feel free to tell us if you have further issues :)

@hkvision hkvision closed this as completed Dec 7, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants