-
Notifications
You must be signed in to change notification settings - Fork 9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Sometimes Net::OpenTimeout / EOFError #29
Comments
I have no idea. During this period we received the same volume of requests as usual. Our load average and response time are as usual. I checked the open source app and it received at least a request every minute (https://www.rorvswild.com/applications/136101/requests?range=2023-12-30T19:20:00Z..2023-12-30T20:20:00Z). The throughput is even a little bit higher than the daily average. The timeout is set to 3 seconds. Maybe it's too short when they are perturbations on the network. Thanks, happy new year ! |
@alexisbernard Strange, I continue to see this in the logs of all my applications. I'm using dockerized apps, running on Hetzner Cloud Shared VMs, located in Nuremberg, Germany. Low traffic, I can't see any bottlenecks on my side. I just increased the timeout to 10 seconds and will see if that solves it. |
After the timeout was increased to 10 seconds, the error no longer occurred. Nice, but now I see this
|
Sorry for the delay. I suspect that the timeout has been triggered from the server, but the latency between our servers is very constant :
Do you have an idea of the frequency of EOFError ? |
The Further observation: In a low-traffic app, the error can occur even when there have been no requests for several minutes. I do not understand why the By the way, the exception does not appear in RorVsWild's exception list and also not in HoneyBadger, which I usually have installed in parallel for bug tracking. |
Thanks for those details.
Because server metrics are sent each minute.
Because data are sent in a separate thread, and the error is caught directly (https://github.com/BaseSecrete/rorvswild/blob/master/lib/rorvswild/client.rb#L71). |
Hm, I don't think this explains the issue. Please take a look at this snippet from the logs with timestamps:
At 11:25 a request comes in. Five minutes (!) later, without further requests, the |
https://github.com/BaseSecrete/rorvswild/blob/master/lib/rorvswild/queue.rb#L71 Yes, it's the server metrics :-) You can disable the NetHttp plugin and you should get the same error and same stack trace minus the 3
I should also add an option to disable server metrics. I increased the open timeout as you and I got 3 times the EOFError on the 9th January. Compare to the volume it's extremely low percentage but that is still embarrassing to miss data. It's the same stack trace as you, except that it comes from
|
Thanks for clarification, I understand that the Yes, disabling server metrics could be useful. In my case, I run multiple dockerized applications on the same host, so the server metrics are all the same. Here is a list of timestamps (in UTC) where an
I'm currently using RorVsWild in 11 applications (with |
Thanks for the grep. So it does not seem to happen at the same time. I'm currently running out of ideas for this issue. I will come back if I have something new to try.
Indeed, in your case, it sends at least 11 times the same server metrics. |
Since yesterday (2023-12-30 20:49:00 UTC+1) I get this in all my apps using RorVsWild:
Since I don't know if you have a status page, I'll report it here.
Happy new year ;-)
The text was updated successfully, but these errors were encountered: