-
Notifications
You must be signed in to change notification settings - Fork 38
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
"ze_peak" freezes on DG1 with latest drm-tip kernel + drivers #20
Comments
@eero-t : could you check if it is just that it is taking a long time? please execute with reduced number of iterations
|
With In total it took 3.5 mins with What's the default iteration count? With that, I see this in dmesg:
Which may explain why it freezes. With default iteration count, there are no numbers shown for "Single Precision Compute" even after 40 mins, so I think that test is really frozen. Especially as numbers for the two earlier categories came only with few second delays. Benchmarks code may be missing some error checks and warnings for the errors (when to skip given thing). As to PS. why both of these GPU benchmarking programs take constantly 100% CPU, and need to allocate 32TB of virtual memory?
|
Latest
There are again these kernel driver warnings:
|
Setup:
Bug:
./ze_peak
freezes with 99% CPU usage after showing:Single Precision Compute (GFLOPS)
(I.e. half precision and global BW tests before it worked fine.)
It can be quit with ^C, so it's not in 100% CPU loop.
Gdb shows:
perf
showed most of the time being spent insidelibze_intel_gpu.so.1
. I.e. it could be driver issue, but I thought it better to start from the app.ze_image_copy
,ze_nano
andze_pingpong
work fine.ze_bandwidth
gets slower and slower, and I did not wait for it to complete.The text was updated successfully, but these errors were encountered: