Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CI ROS test timeout #11290

Closed
mcsauder opened this issue Jan 24, 2019 · 13 comments
Closed

CI ROS test timeout #11290

mcsauder opened this issue Jan 24, 2019 · 13 comments

Comments

@mcsauder
Copy link
Contributor

mcsauder commented Jan 24, 2019

Describe the bug
The CI ROS tests frequently timeout. Sometimes after a rebase a previously passing PR will fail, and other times after a rebase against current master the same tests will pass.

image

At the time of this writing, the following PR(s) exhibit this failure mode:
http://ci.px4.io:8080/blue/organizations/jenkins/PX4_misc%2FFirmware-SITL_tests/detail/PR-11194/20/pipeline
http://ci.px4.io:8080/blue/organizations/jenkins/PX4_misc%2FFirmware-SITL_tests/detail/PR-11260/11/pipeline

To Reproduce
Rebase a current PR against master and force push to remote. This will re-run/restart the CI tests, and sometimes everything will run smoothly, however, sometimes the tests will timeout.

Expected behavior
@dagar, if a timeout occurs, is there a way to automatically restart the test, (perhaps a finite number of times), or allow users to manually restart failing tests through the web interface?

@dagar
Copy link
Member

dagar commented Jan 24, 2019

Duplicate of #11257

@dagar dagar marked this as a duplicate of #11257 Jan 24, 2019
@lamping7
Copy link
Member

This isn't a load-based timeout. There is a failure of the software env not monitored by the test itself.

@mcsauder
Copy link
Contributor Author

mcsauder commented Jan 27, 2019

Another occurrence of the issue reported in this issue is in PR #11314 at the time of this writing: http://ci.px4.io:8080/blue/organizations/jenkins/PX4_misc%2FFirmware-SITL_tests/detail/PR-11314/1/pipeline

(I don't believe that the LOC in this PR could impact the ROS tests.)

@mcsauder
Copy link
Contributor Author

PR #11314 is passing now after a rebase with current PX4/Firmware master.

@mcsauder
Copy link
Contributor Author

@mcsauder
Copy link
Contributor Author

@lamping7
Copy link
Member

#11308 and #11189 are different from the original issue you posted about. Those are preflight fails for accels.

@mcsauder
Copy link
Contributor Author

Thanks @lamping7 , this is what I see at the bottom of #11308:

ERROR: max time [300.0s] allotted for test [VTOL_mission_1] of type [px4/mission_test.py]

But you're right about #11189,

WARN  [commander] Preflight Fail: Accels inconsistent - Check Cal


Exiting...

I'll delete the reference to #11189, I think I autocompleted an incorrect PR. I'll look again. Thanks!

@lamping7
Copy link
Member

lamping7 commented Jan 28, 2019

Scroll up on #11308. The accel problem is not associated with these PRs. It seems something crept into master.

#11316 is doing it too

@lamping7 lamping7 reopened this Jan 28, 2019
@mcsauder
Copy link
Contributor Author

@Lamping, #11305 still shows the same behaviors, so at least that instance is still relevant. I watched #11189 automagically reload the page about 20 times and keep scrolling my page to the bottom shortly after your post, so I honestly can't say if it was the same at the point you saw it as when I saw it... if the CI tools run again it definitely yields different results, I've seen this many times in the past few months with rebasing my own PRs. Any ideas? How can I help out?

@mcsauder
Copy link
Contributor Author

PR #11224 is back to the same failure mode after rebase with current master: http://ci.px4.io:8080/blue/organizations/jenkins/PX4_misc%2FFirmware-SITL_tests/detail/PR-11224/21/pipeline

@mcsauder
Copy link
Contributor Author

mcsauder commented Jan 28, 2019

PR #11192 is back to the same failure mode after rebase against current master: http://ci.px4.io:8080/blue/organizations/jenkins/PX4_misc%2FFirmware-SITL_tests/detail/PR-11192/36/pipeline

Updated: An additional rebase against current master is passing again now: #11192

@mcsauder
Copy link
Contributor Author

I haven't seen this issue in roughly the past week, so I think recent changes might have fixed it. Thanks for squashing this bug!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants