Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Failed to collect some domain info while migrating vm #53

Open
laushunyu opened this issue Nov 15, 2024 · 2 comments
Open

Failed to collect some domain info while migrating vm #53

laushunyu opened this issue Nov 15, 2024 · 2 comments

Comments

@laushunyu
Copy link

I use openstack 23.2, and deployed prometheus-libvirt-exporter by apt install prometheus-libvirt-exporter (version 1.6.0).

I found some journal err log while live-migrate vm:

Nov 15 07:49:48 xfusion-3 prometheus-libvirt-exporter[4037]: ts=2024-11-15T07:49:48.773Z caller=prometheus-libvirt-exporter.go:461 level=warn warn="failed to get DomainBlockStats" domain=instance-00001045 msg="Timed out during operation: cannot acquire state change lock (held by monitor=remoteDispatchDomainMigratePrepare3Params)"
Nov 15 07:49:48 xfusion-3 prometheus-libvirt-exporter[4037]: ts=2024-11-15T07:49:48.773Z caller=prometheus-libvirt-exporter.go:445 level=warn warn="failed to collect some domain info" domain=instance-00001045 msg="Timed out during operation: cannot acquire state change lock (held by monitor=remoteDispatchDomainMigratePrepare3Params)"
Nov 15 07:49:48 xfusion-3 prometheus-libvirt-exporter[4037]: ts=2024-11-15T07:49:48.805Z caller=prometheus-libvirt-exporter.go:575 level=warn warn="failed to get DomainMemoryStats" domain=instance-00001045 msg="Timed out during operation: cannot acquire state change lock (held by monitor=remoteDispatchDomainMigratePrepare3Params)"
Nov 15 07:49:48 xfusion-3 prometheus-libvirt-exporter[4037]: ts=2024-11-15T07:49:48.805Z caller=prometheus-libvirt-exporter.go:445 level=warn warn="failed to collect some domain info" domain=instance-00001045 msg="Timed out during operation: cannot acquire state change lock (held by monitor=remoteDispatchDomainMigratePrepare3Params)"

Prometheus show there are some failed metric points:

image

@frittentheke
Copy link
Collaborator

This is somewhat of a known issue which I started a discussion about on the ML a while back: https://lists.libvirt.org/archives/list/users@lists.libvirt.org/thread/PXKLIU7DPF6MDGEHWQYX3KLGCDNRAIW3/

@swenske
Copy link

swenske commented Dec 10, 2024

https://lists.libvirt.org/archives/list/users@lists.libvirt.org/message/USIPK3CHB2O35767UDR3CQMBM24HOUKM/

We shall be looking to using this to improve the exporters robustness.
I'll drop an update once I got to it

👋 @frittentheke

Did you find any leads?

Have a nice day and thank you!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants