Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

run_ltp: Print tasktrace on test failure #18909

Merged
merged 3 commits into from
Mar 19, 2024

Conversation

mdoucha
Copy link
Contributor

@mdoucha mdoucha commented Mar 18, 2024

Add support for optionally printing process backtraces and blocked tasks on test failure.

Also add minor fixes for corner cases in tcpdump and crashdump handling.

@mdoucha mdoucha force-pushed the ltp_tasktrace branch 2 times, most recently from 7e020a7 to 7d92678 Compare March 18, 2024 16:50
Copy link
Contributor

@pevik pevik left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, thanks for this!

I was looking for something else which would be useful to dump (https://docs.kernel.org/admin-guide/sysrq.html), but it looks only t and w. Or maybe l (Shows a stack backtrace for all active CPUs.)?

BTW many of these do on my laptop: this sysrq option is disabled.

@mdoucha
Copy link
Contributor Author

mdoucha commented Mar 18, 2024

BTW many of these do on my laptop: this sysrq option is disabled.

Yes, SysRq features can be disabled via /proc/sys/kernel/sysrq but that should not be the case on our test systems.

But I guess I'll have to take a look at PPC support later, crashdump and tasktrace don't work as expected there:
https://openqa.suse.de/tests/13814428

mdoucha added 3 commits March 19, 2024 10:52
The run_ltp debug features need active VNC console to run some optional
commands. However, logging in after test failure may not be possible
due to the system being frozen. Preinitialize VNC console after boot
so that at least SysRq shortcuts can be used to generate crashdump.
Add support for optionally printing process backtraces and blocked
tasks on test failure.
If the command to terminate tcpdump times out, restore the previous
console. Otherwise OpenQA may get confused which console should be
active after VM snapshot reload.
@mdoucha
Copy link
Contributor Author

mdoucha commented Mar 19, 2024

I've increased tasktrace wait_serial() timeout to avoid issues with slow console. The crashdump problem on PPC was caused by crashdump being already enabled but misconfigured in the base image. Nothing to do here.

@czerw czerw merged commit 39f2c9d into os-autoinst:master Mar 19, 2024
10 checks passed
@mdoucha mdoucha deleted the ltp_tasktrace branch March 22, 2024 10:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants