RPC threads dying w/ "Too many open files" #9102

XertroV · 2018-07-12T03:24:00Z

I'm running:

Which Parity version?: 1.10.9 (observed on 1.10.8 too, though haven't dug through logs)

Which operating system?: Linux

How installed?: binaries

Are you fully synchronized?: yes

Which network are you connected to?: ethereum

Did you try to restart the node?: yes (workaround)

pretty self explanatory - all RCP threads die simultaneously, but everything else keeps working (including websockets according to netstat; RPC ports disappear but websocket ports don't)

(Most of those requests are 2x eth-stats running via pm2)

Logs:

Jul 11 12:16:35 eth-aws-nv-node-03 parity[11186]: 2018-07-11 12:16:35 UTC   39/50 peers    978 MiB chain   92 MiB db  0 bytes queue  270 KiB sync  RPC:  0 conn, 15 req/s,  42 µs
Jul 11 12:17:14 eth-aws-nv-node-03 parity[11186]: 2018-07-11 12:17:14 UTC Imported #5944667 2191…9c17 (201 txs, 7.99 Mgas, 577.74 ms, 28.41 KiB)
Jul 11 12:17:23 eth-aws-nv-node-03 parity[11186]: 2018-07-11 12:17:23 UTC Imported #5944668 8712…a45d (225 txs, 8.00 Mgas, 401.54 ms, 32.90 KiB)
Jul 11 12:17:23 eth-aws-nv-node-03 parity[11186]: 2018-07-11 12:17:23 UTC   40/50 peers    979 MiB chain   93 MiB db  0 bytes queue  270 KiB sync  RPC:  0 conn, 15 req/s, 200849 µs
Jul 11 12:17:34 eth-aws-nv-node-03 parity[11186]: 2018-07-11 12:17:34 UTC Imported #5944652 e9ce…5142 (158 txs, 7.98 Mgas, 530.73 ms, 27.99 KiB)
Jul 11 12:17:43 eth-aws-nv-node-03 parity[11186]: 2018-07-11 12:17:43 UTC Imported #5944669 3f6e…aa3e (123 txs, 7.98 Mgas, 319.24 ms, 23.49 KiB)
Jul 11 12:17:57 eth-aws-nv-node-03 parity[11186]: 2018-07-11 12:17:57 UTC Imported #5944670 00f2…1fc4 (141 txs, 7.96 Mgas, 525.96 ms, 30.61 KiB)
Jul 11 12:18:06 eth-aws-nv-node-03 parity[11186]: 2018-07-11 12:18:06 UTC Imported #5944671 2a2c…dc79 (180 txs, 7.98 Mgas, 295.11 ms, 31.91 KiB)
Jul 11 12:18:16 eth-aws-nv-node-03 parity[11186]: 2018-07-11 12:18:16 UTC Imported #5944672 ad08…bc85 (91 txs, 7.98 Mgas, 467.72 ms, 18.07 KiB)
Jul 11 12:18:25 eth-aws-nv-node-03 parity[11186]: 2018-07-11 12:18:25 UTC Imported #5944672 7141…18b3 (128 txs, 7.99 Mgas, 300.18 ms, 29.26 KiB)
Jul 11 12:18:26 eth-aws-nv-node-03 parity[11186]: 2018-07-11 12:18:26 UTC   41/50 peers    979 MiB chain   93 MiB db  0 bytes queue  270 KiB sync  RPC:  0 conn, 13 req/s, 85801 µs
Jul 11 12:18:42 eth-aws-nv-node-03 parity[11186]: 2018-07-11 12:18:42 UTC Imported #5944673 cdc2…490a (178 txs, 7.97 Mgas, 334.38 ms, 29.28 KiB)
Jul 11 12:18:52 eth-aws-nv-node-03 parity[11186]: 2018-07-11 12:18:52 UTC Imported #5944674 1312…9ce7 (161 txs, 7.98 Mgas, 402.45 ms, 27.58 KiB)
Jul 11 12:19:01 eth-aws-nv-node-03 parity[11186]: 2018-07-11 12:19:01 UTC Imported #5944675 b50d…d2ea (151 txs, 7.97 Mgas, 288.44 ms, 29.28 KiB)
Jul 11 12:19:09 eth-aws-nv-node-03 parity[11186]: 2018-07-11 12:19:09 UTC Imported #5944676 6751…fcd2 (110 txs, 8.00 Mgas, 185.48 ms, 23.90 KiB)
Jul 11 12:19:13 eth-aws-nv-node-03 parity[11186]: 2018-07-11 12:19:13 UTC   44/50 peers    979 MiB chain   93 MiB db  0 bytes queue  270 KiB sync  RPC:  0 conn, 15 req/s, 4483 µs
Jul 11 12:19:42 eth-aws-nv-node-03 parity[11186]: 2018-07-11 12:19:42 UTC Imported #5944677 03a4…19ab (109 txs, 7.98 Mgas, 309.37 ms, 23.08 KiB)
Jul 11 12:19:46 eth-aws-nv-node-03 parity[11186]: 2018-07-11 12:19:46 UTC   42/50 peers    979 MiB chain   93 MiB db  0 bytes queue  270 KiB sync  RPC:  0 conn, 18 req/s, 2073 µs
Jul 11 12:25:31 eth-aws-nv-node-03 parity[11186]: 2018-07-11 12:25:31 UTC Incoming streams error, closing sever: Os { code: 24, kind: Other, message: "Too many open files" }
Jul 11 12:25:31 eth-aws-nv-node-03 parity[11186]: 2018-07-11 12:25:31 UTC Incoming streams error, closing sever: Os { code: 24, kind: Other, message: "Too many open files" }
Jul 11 12:25:31 eth-aws-nv-node-03 parity[11186]: 2018-07-11 12:25:31 UTC Incoming streams error, closing sever: Os { code: 24, kind: Other, message: "Too many open files" }
Jul 11 12:25:31 eth-aws-nv-node-03 parity[11186]: 2018-07-11 12:25:31 UTC Incoming streams error, closing sever: Os { code: 24, kind: Other, message: "Too many open files" }
Jul 11 12:25:31 eth-aws-nv-node-03 parity[11186]: 2018-07-11 12:25:31 UTC Incoming streams error, closing sever: Os { code: 24, kind: Other, message: "Too many open files" }
Jul 11 12:25:31 eth-aws-nv-node-03 parity[11186]: 2018-07-11 12:25:31 UTC Incoming streams error, closing sever: Os { code: 24, kind: Other, message: "Too many open files" }
Jul 11 12:25:31 eth-aws-nv-node-03 parity[11186]: 2018-07-11 12:25:31 UTC Incoming streams error, closing sever: Os { code: 24, kind: Other, message: "Too many open files" }
Jul 11 12:25:32 eth-aws-nv-node-03 parity[11186]: 2018-07-11 12:25:32 UTC Incoming streams error, closing sever: Os { code: 24, kind: Other, message: "Too many open files" }
Jul 11 12:25:32 eth-aws-nv-node-03 parity[11186]: 2018-07-11 12:25:32 UTC Incoming streams error, closing sever: Os { code: 24, kind: Other, message: "Too many open files" }
Jul 11 12:25:32 eth-aws-nv-node-03 parity[11186]: 2018-07-11 12:25:32 UTC Incoming streams error, closing sever: Os { code: 24, kind: Other, message: "Too many open files" }
Jul 11 12:25:32 eth-aws-nv-node-03 parity[11186]: 2018-07-11 12:25:32 UTC Incoming streams error, closing sever: Os { code: 24, kind: Other, message: "Too many open files" }
Jul 11 12:25:32 eth-aws-nv-node-03 parity[11186]: 2018-07-11 12:25:32 UTC Incoming streams error, closing sever: Os { code: 24, kind: Other, message: "Too many open files" }
Jul 11 12:25:32 eth-aws-nv-node-03 parity[11186]: 2018-07-11 12:25:32 UTC Incoming streams error, closing sever: Os { code: 24, kind: Other, message: "Too many open files" }
Jul 11 12:25:33 eth-aws-nv-node-03 parity[11186]: 2018-07-11 12:25:33 UTC Incoming streams error, closing sever: Os { code: 24, kind: Other, message: "Too many open files" }
Jul 11 12:25:33 eth-aws-nv-node-03 parity[11186]: 2018-07-11 12:25:33 UTC Incoming streams error, closing sever: Os { code: 24, kind: Other, message: "Too many open files" }
Jul 11 12:27:18 eth-aws-nv-node-03 parity[11186]: 2018-07-11 12:27:18 UTC Syncing #5944677 03a4…19ab     0 blk/s    0 tx/s   0 Mgas/s      0+    0 Qed  #5944678   41/50 peers    979 MiB chain   93 MiB db  0 bytes queue  270 KiB sync  RPC:  0 conn,  0 req/s, 108293215 µs
Jul 11 12:27:18 eth-aws-nv-node-03 parity[11186]: 2018-07-11 12:27:18 UTC Syncing #5944677 03a4…19ab     0 blk/s    0 tx/s   0 Mgas/s      0+    0 Qed  #5944678   35/50 peers    979 MiB chain   93 MiB db 776 bytes queue  270 KiB sync  RPC:  0 conn,  0 req/s, 108293215 µs
Jul 11 12:27:28 eth-aws-nv-node-03 parity[11186]: 2018-07-11 12:27:28 UTC Imported #5944679 0c6c…52bf (164 txs, 8.00 Mgas, 315.46 ms, 30.88 KiB)
Jul 11 12:27:46 eth-aws-nv-node-03 parity[11186]: 2018-07-11 12:27:46 UTC Syncing #5944679 0c6c…52bf     0 blk/s    0 tx/s   0 Mgas/s      0+    0 Qed  #5944678    2/50 peers    979 MiB chain   93 MiB db  0 bytes queue  295 KiB sync  RPC:  0 conn,  0 req/s, 108293215 µs
Jul 11 12:27:46 eth-aws-nv-node-03 parity[11186]: 2018-07-11 12:27:46 UTC Reorg to #5944680 9ada…5d2d (0c6c…52bf #5944678 73c0…4c05 ff48…870c)
Jul 11 12:28:04 eth-aws-nv-node-03 parity[11186]: 2018-07-11 12:28:04 UTC Syncing #5944717 cdcd…0071     3 blk/s  576 tx/s  30 Mgas/s      0+    0 Qed  #5944717    3/50 peers    979 MiB chain   94 MiB db  0 bytes queue  270 KiB sync  RPC:  0 conn,  0 req/s, 108293215 µs
Jul 11 12:28:04 eth-aws-nv-node-03 parity[11186]: 2018-07-11 12:28:04 UTC Syncing #5944717 cdcd…0071     0 blk/s    0 tx/s   0 Mgas/s      0+    0 Qed  #5944717    3/50 peers    979 MiB chain   94 MiB db  0 bytes queue  271 KiB sync  RPC:  0 conn,  0 req/s, 108293215 µs
Jul 11 12:28:11 eth-aws-nv-node-03 parity[11186]: 2018-07-11 12:28:11 UTC Syncing #5944718 a475…6508     0 blk/s   22 tx/s   1 Mgas/s      0+    0 Qed  #5944718    4/50 peers    979 MiB chain   94 MiB db  0 bytes queue  270 KiB sync  RPC:  0 conn,  0 req/s, 108293215 µs
Jul 11 12:28:21 eth-aws-nv-node-03 parity[11186]: 2018-07-11 12:28:21 UTC Imported #5944720 e519…b8fd (157 txs, 7.94 Mgas, 291.49 ms, 21.24 KiB) + another 3 block(s) containing 248 tx(s)
Jul 11 12:28:31 eth-aws-nv-node-03 parity[11186]: 2018-07-11 12:28:31 UTC Imported #5944721 f565…2c7d (76 txs, 7.99 Mgas, 292.40 ms, 26.64 KiB)
Jul 11 12:28:47 eth-aws-nv-node-03 parity[11186]: 2018-07-11 12:28:47 UTC    6/50 peers    980 MiB chain   94 MiB db  0 bytes queue  270 KiB sync  RPC:  0 conn,  0 req/s, 108293215 µs
Jul 11 12:28:47 eth-aws-nv-node-03 parity[11186]: 2018-07-11 12:28:47 UTC    6/50 peers    980 MiB chain   94 MiB db  0 bytes queue  270 KiB sync  RPC:  0 conn,  0 req/s, 108293215 µs
Jul 11 12:28:58 eth-aws-nv-node-03 parity[11186]: 2018-07-11 12:28:58 UTC Imported #5944721 f876…f4b7 (90 txs, 7.99 Mgas, 478.49 ms, 16.77 KiB)
Jul 11 12:29:08 eth-aws-nv-node-03 parity[11186]: 2018-07-11 12:29:08 UTC Imported #5944722 6608…88b0 (85 txs, 7.97 Mgas, 288.22 ms, 26.36 KiB)
Jul 11 12:29:12 eth-aws-nv-node-03 parity[11186]: 2018-07-11 12:29:12 UTC Reorg to #5944723 f887…a9ab (9879…957e #5944721 f565…2c7d 6608…88b0)
Jul 11 12:29:21 eth-aws-nv-node-03 parity[11186]: 2018-07-11 12:29:21 UTC Imported #5944724 f254…0257 (44 txs, 7.98 Mgas, 630.93 ms, 9.78 KiB) + another 1 block(s) containing 146 tx(s)
Jul 11 12:29:22 eth-aws-nv-node-03 parity[11186]: 2018-07-11 12:29:22 UTC    5/50 peers    980 MiB chain   94 MiB db  0 bytes queue  270 KiB sync  RPC:  0 conn,  0 req/s, 108293215 µs

The text was updated successfully, but these errors were encountered:

Tbaut · 2018-07-12T09:31:54Z

Is it something that you were able to reproduce? Any way to give us more logs (-l debug could probably give us a hint ).

ascjones · 2018-07-12T11:21:23Z

Related: #8123 and #8813. It might be fixed in 1.11.5?

XertroV · 2018-07-12T12:10:57Z

@Tbaut - not sure how to reproduce except to leave nodes running long term. Will add -l debug and see if that captures some more interesting detail.

As an aside, I've looked before for how to increase the log level and it wasn't easy to find (well, at least when I looked I didn't find anything). Might be worth adding a wiki page on debugging / increased logging.

@ascjones I think the above issues were different for a few reasons:

when I had RPC issues before in parity crashes with too many open files #8813 I had LOTS of RPC requests coming in, and haven't noticed that this time.
The log outputs are different this time - all the RPC threads die simultaneously, but this isn't consistent with what I experienced with parity crashes with too many open files #8813. Particularly the node itself keeps running fine atm; it's just RPC that fails, whereas before it was a complete panic and node-restart.

So unless something particularly significant changed between 1.10.6 and 1.10.9 I suspect it's a different issue.

XertroV · 2018-07-12T12:47:44Z

Happened to another node a few hours ago - just checked.

A thing I noticed:

Jul 12 04:31:05 eth-aws-syd-node-02 parity[5414]: 2018-07-12 04:31:05 UTC Imported #5948726 19c5…bca6 (129 txs, 7.99 Mgas, 386.64 ms, 24.63 KiB)
Jul 12 04:36:50 eth-aws-syd-node-02 parity[5414]: 2018-07-12 04:36:50 UTC Incoming streams error, closing sever: Os { code: 24, kind: Other, message: "Too many open files" }

There are ~5-6 minutes between those two log lines. 🤔

Jul 12 04:29:06 eth-aws-syd-node-02 parity[5414]: 2018-07-12 04:29:06 UTC Imported #5948724 3731…6aa7 (173 txs, 7.98 Mgas, 74.41 ms, 28.90 KiB)
Jul 12 04:29:10 eth-aws-syd-node-02 parity[5414]: 2018-07-12 04:29:10 UTC   49/50 peers    340 MiB chain   20 MiB db  0 bytes queue   50 KiB sync  RPC:  0 conn, 18 req/s, 3544 µs
Jul 12 04:29:10 eth-aws-syd-node-02 parity[5414]: 2018-07-12 04:29:10 UTC   49/50 peers    340 MiB chain   20 MiB db  0 bytes queue   50 KiB sync  RPC:  0 conn, 18 req/s, 3544 µs
Jul 12 04:29:46 eth-aws-syd-node-02 parity[5414]: 2018-07-12 04:29:46 UTC Imported #5948725 15cb…f4d9 (41 txs, 7.98 Mgas, 204.38 ms, 13.59 KiB)
Jul 12 04:29:48 eth-aws-syd-node-02 parity[5414]: 2018-07-12 04:29:48 UTC   50/50 peers    340 MiB chain   20 MiB db  0 bytes queue   50 KiB sync  RPC:  0 conn, 13 req/s, 126805 µs
Jul 12 04:29:48 eth-aws-syd-node-02 parity[5414]: 2018-07-12 04:29:48 UTC   50/50 peers    340 MiB chain   20 MiB db  0 bytes queue   50 KiB sync  RPC:  0 conn, 13 req/s, 126805 µs
Jul 12 04:30:20 eth-aws-syd-node-02 parity[5414]: 2018-07-12 04:30:20 UTC   52/500 peers    340 MiB chain   20 MiB db  0 bytes queue   50 KiB sync  RPC:  0 conn, 17 req/s,  60 µs
Jul 12 04:30:51 eth-aws-syd-node-02 parity[5414]: 2018-07-12 04:30:51 UTC Imported #5948726 e81f…991b (43 txs, 4.79 Mgas, 208.14 ms, 9.81 KiB)
Jul 12 04:30:52 eth-aws-syd-node-02 parity[5414]: 2018-07-12 04:30:52 UTC   51/500 peers    341 MiB chain   20 MiB db  0 bytes queue   50 KiB sync  RPC:  0 conn, 13 req/s, 541023 µs
Jul 12 04:30:52 eth-aws-syd-node-02 parity[5414]: 2018-07-12 04:30:52 UTC   51/500 peers    341 MiB chain   20 MiB db  0 bytes queue   51 KiB sync  RPC:  0 conn, 13 req/s, 140738 µs
Jul 12 04:31:05 eth-aws-syd-node-02 parity[5414]: 2018-07-12 04:31:05 UTC Imported #5948726 19c5…bca6 (129 txs, 7.99 Mgas, 386.64 ms, 24.63 KiB)
Jul 12 04:36:50 eth-aws-syd-node-02 parity[5414]: 2018-07-12 04:36:50 UTC Incoming streams error, closing sever: Os { code: 24, kind: Other, message: "Too many open files" }
Jul 12 04:36:50 eth-aws-syd-node-02 parity[5414]: 2018-07-12 04:36:50 UTC Incoming streams error, closing sever: Os { code: 24, kind: Other, message: "Too many open files" }
Jul 12 04:36:50 eth-aws-syd-node-02 parity[5414]: 2018-07-12 04:36:50 UTC Incoming streams error, closing sever: Os { code: 24, kind: Other, message: "Too many open files" }
Jul 12 04:36:50 eth-aws-syd-node-02 parity[5414]: 2018-07-12 04:36:50 UTC Incoming streams error, closing sever: Os { code: 24, kind: Other, message: "Too many open files" }
Jul 12 04:36:50 eth-aws-syd-node-02 parity[5414]: 2018-07-12 04:36:50 UTC Incoming streams error, closing sever: Os { code: 24, kind: Other, message: "Too many open files" }
Jul 12 04:36:50 eth-aws-syd-node-02 parity[5414]: 2018-07-12 04:36:50 UTC Incoming streams error, closing sever: Os { code: 24, kind: Other, message: "Too many open files" }
Jul 12 04:36:50 eth-aws-syd-node-02 parity[5414]: 2018-07-12 04:36:50 UTC Incoming streams error, closing sever: Os { code: 24, kind: Other, message: "Too many open files" }
Jul 12 04:36:50 eth-aws-syd-node-02 parity[5414]: 2018-07-12 04:36:50 UTC Incoming streams error, closing sever: Os { code: 24, kind: Other, message: "Too many open files" }
Jul 12 04:36:50 eth-aws-syd-node-02 parity[5414]: 2018-07-12 04:36:50 UTC Incoming streams error, closing sever: Os { code: 24, kind: Other, message: "Too many open files" }
Jul 12 04:36:51 eth-aws-syd-node-02 parity[5414]: 2018-07-12 04:36:51 UTC Incoming streams error, closing sever: Os { code: 24, kind: Other, message: "Too many open files" }
Jul 12 04:36:51 eth-aws-syd-node-02 parity[5414]: 2018-07-12 04:36:51 UTC Incoming streams error, closing sever: Os { code: 24, kind: Other, message: "Too many open files" }
Jul 12 04:36:51 eth-aws-syd-node-02 parity[5414]: 2018-07-12 04:36:51 UTC Incoming streams error, closing sever: Os { code: 24, kind: Other, message: "Too many open files" }
Jul 12 04:36:52 eth-aws-syd-node-02 parity[5414]: 2018-07-12 04:36:52 UTC Incoming streams error, closing sever: Os { code: 24, kind: Other, message: "Too many open files" }
Jul 12 04:36:52 eth-aws-syd-node-02 parity[5414]: 2018-07-12 04:36:52 UTC Incoming streams error, closing sever: Os { code: 24, kind: Other, message: "Too many open files" }
Jul 12 04:36:53 eth-aws-syd-node-02 parity[5414]: 2018-07-12 04:36:53 UTC Incoming streams error, closing sever: Os { code: 24, kind: Other, message: "Too many open files" }
Jul 12 04:37:36 eth-aws-syd-node-02 parity[5414]: 2018-07-12 04:37:36 UTC Reorg to #5948728 1d2d…d6e1 (f4eb…9717 #5948726 e81f…991b f330…5f8b)
Jul 12 04:37:42 eth-aws-syd-node-02 parity[5414]: 2018-07-12 04:37:42 UTC Imported #5948727 f330…5f8b (208 txs, 7.98 Mgas, 286.16 ms, 36.06 KiB)
Jul 12 04:37:50 eth-aws-syd-node-02 parity[5414]: 2018-07-12 04:37:50 UTC Imported #5948728 1d2d…d6e1 (166 txs, 7.98 Mgas, 401.44 ms, 28.28 KiB) + another 1 block(s) containing 124 tx(s)
Jul 12 04:38:02 eth-aws-syd-node-02 parity[5414]: 2018-07-12 04:38:02 UTC    2/50 peers    341 MiB chain   20 MiB db  0 bytes queue   50 KiB sync  RPC:  0 conn,  0 req/s, 40488942 µs
Jul 12 04:38:02 eth-aws-syd-node-02 parity[5414]: 2018-07-12 04:38:02 UTC    2/50 peers    341 MiB chain   20 MiB db  0 bytes queue   50 KiB sync  RPC:  0 conn,  0 req/s, 40488942 µs
Jul 12 04:38:08 eth-aws-syd-node-02 parity[5414]: 2018-07-12 04:38:08 UTC Syncing #5948732 a20b…a53c     0 blk/s   59 tx/s   6 Mgas/s      0+   21 Qed  #5948753    5/50 peers    341 MiB chain   20 MiB db    2 MiB queue   50 KiB sync  RPC:  0 conn,  0 req/s, 40488942 µs
Jul 12 04:38:08 eth-aws-syd-node-02 parity[5414]: 2018-07-12 04:38:08 UTC Syncing #5948732 a20b…a53c     0 blk/s    0 tx/s   0 Mgas/s      0+   21 Qed  #5948753    5/50 peers    341 MiB chain   20 MiB db    2 MiB queue   50 KiB sync  RPC:  0 conn,  0 req/s, 40488942 µs
Jul 12 04:38:21 eth-aws-syd-node-02 parity[5414]: 2018-07-12 04:38:21 UTC Imported #5948753 4194…840d (92 txs, 8.01 Mgas, 161.89 ms, 21.29 KiB)
Jul 12 04:38:40 eth-aws-syd-node-02 parity[5414]: 2018-07-12 04:38:40 UTC    5/50 peers    341 MiB chain   20 MiB db  0 bytes queue   50 KiB sync  RPC:  0 conn,  0 req/s, 40488942 µs
Jul 12 04:38:50 eth-aws-syd-node-02 parity[5414]: 2018-07-12 04:38:50 UTC Syncing #5948753 4194…840d     0 blk/s    0 tx/s   0 Mgas/s      0+    0 Qed  #5948752    8/50 peers    341 MiB chain   20 MiB db  0 bytes queue   50 KiB sync  RPC:  0 conn,  0 req/s, 40488942 µs
Jul 12 04:39:00 eth-aws-syd-node-02 parity[5414]: 2018-07-12 04:39:00 UTC Syncing #5948753 4194…840d     0 blk/s    0 tx/s   0 Mgas/s      0+    0 Qed  #5948752   10/50 peers    341 MiB chain   20 MiB db  0 bytes queue   57 KiB sync  RPC:  0 conn,  0 req/s, 40488942 µs
Jul 12 04:39:10 eth-aws-syd-node-02 parity[5414]: 2018-07-12 04:39:10 UTC Syncing #5948753 4194…840d     0 blk/s    0 tx/s   0 Mgas/s      0+    0 Qed  #5948752   14/50 peers    341 MiB chain   20 MiB db  0 bytes queue   57 KiB sync  RPC:  0 conn,  0 req/s, 40488942 µs
Jul 12 04:39:20 eth-aws-syd-node-02 parity[5414]: 2018-07-12 04:39:20 UTC Syncing #5948761 33d3…a147     0 blk/s   72 tx/s   6 Mgas/s      0+    2 Qed  #5948764   17/50 peers    341 MiB chain   20 MiB db  219 KiB queue   50 KiB sync  RPC:  0 conn,  0 req/s, 40488942 µs
Jul 12 04:39:44 eth-aws-syd-node-02 parity[5414]: 2018-07-12 04:39:44 UTC Imported #5948765 fadc…cb14 (111 txs, 7.98 Mgas, 428.37 ms, 21.68 KiB) + another 2 block(s) containing 203 tx(s)
Jul 12 04:39:55 eth-aws-syd-node-02 parity[5414]: 2018-07-12 04:39:55 UTC Imported #5948766 1a68…e067 (101 txs, 7.05 Mgas, 318.83 ms, 22.62 KiB)
Jul 12 04:40:04 eth-aws-syd-node-02 parity[5414]: 2018-07-12 04:40:04 UTC   16/50 peers    341 MiB chain   20 MiB db  0 bytes queue   50 KiB sync  RPC:  0 conn,  0 req/s, 40488942 µs
Jul 12 04:40:04 eth-aws-syd-node-02 parity[5414]: 2018-07-12 04:40:04 UTC Imported #5948767 c3a4…5f00 (15 txs, 4.34 Mgas, 54.35 ms, 6.80 KiB)
Jul 12 04:40:13 eth-aws-syd-node-02 parity[5414]: 2018-07-12 04:40:13 UTC Imported #5948766 06bf…d6c6 (108 txs, 7.99 Mgas, 268.07 ms, 20.53 KiB)
Jul 12 04:40:24 eth-aws-syd-node-02 parity[5414]: 2018-07-12 04:40:24 UTC Imported #5948771 0e13…6167 (66 txs, 7.98 Mgas, 619.36 ms, 19.76 KiB) + another 2 block(s) containing 135 tx(s)
Jul 12 04:40:35 eth-aws-syd-node-02 parity[5414]: 2018-07-12 04:40:35 UTC Imported #5948772 af5c…c366 (94 txs, 7.97 Mgas, 87.97 ms, 19.50 KiB)
Jul 12 04:40:37 eth-aws-syd-node-02 parity[5414]: 2018-07-12 04:40:37 UTC   14/50 peers    342 MiB chain   21 MiB db  0 bytes queue   50 KiB sync  RPC:  0 conn,  0 req/s, 40488942 µs
Jul 12 04:41:03 eth-aws-syd-node-02 parity[5414]: 2018-07-12 04:41:03 UTC Imported #5948773 2884…a3ac (93 txs, 7.99 Mgas, 166.64 ms, 19.20 KiB)

Tbaut · 2018-07-12T14:42:22Z

As an aside, I've looked before for how to increase the log level and it wasn't easy to find (well, at least when I looked I didn't find anything). Might be worth adding a wiki page on debugging / increased logging.

What keyword did you use? it's here: https://wiki.parity.io/FAQ.html#how-can-i-make-parity-to-write-logs

XertroV · 2018-07-13T01:33:33Z

What keyword did you use? it's here: https://wiki.parity.io/FAQ.html#how-can-i-make-parity-to-write-logs

I think I must have just derped pretty hard. Some vague memories: might have been before the current wiki (was it on github before?) and I think I used google more than searching a particular site. In any case thanks for linking me.

XertroV · 2018-07-14T02:49:43Z

@Tbaut - so good news! 2 of my nodes seem to have lost RPC again. I'll try and get those logs up later today (though they might be large) and filter through for anything relevant I can find.

XertroV · 2018-07-17T01:24:09Z

okay so seems like they actually didn't lose RPC - was just that the log files clobbered the little AWS / partition and eth-stats stopped working. Logging to a file on SSD now so shouldn't be an issue anymore, but no news on the RPC worker threads dying. Also seems like I only got a few hours of logs so no particularly good deets

debris · 2018-07-19T13:46:43Z

@XertroV are you using rpc over http or ipc?

XertroV · 2018-07-20T00:40:00Z

@debris HTTP - the nodes are all remote (publicly accessible) and don't run anything else on them besides stats, healthcheck, etc. Didn't check if IPC was affected, and haven't been able to replicate. Actually just disabled debug logging because the log files were about 20GB

vogelito · 2018-07-20T12:58:02Z

RPC in one of our parity nodes stopped working for us a few hours ago. The following log entry lines up perfectly with our software being unable to talk to parity:

2018-07-20 08:14:10 UTC  WARN jsonrpc_http_server  Incoming streams error, closing sever: Os { code: 24, kind: Other, message: "Too many open files" }

We had never seen this issue before and not entirely sure that it is related to @XertroV's problems but wanted to post it here. This node had been running for a month:

2018-07-20 03:12:15 UTC main INFO parity::run  Starting Parity/v1.10.6-unstable-bc0d134-20180605/x86_64-linux-gnu/rustc1.25.0

We're currently upgrading to v1.11.7 and will restart our node.

AyushyaChitransh · 2018-07-25T13:04:48Z

Facing this issue in v1.11.7(latest stable version). As is the case of other users, I too am running parity along with its monitoring via pm2 using eth-net-intelligenceAPI. So they are the onlyRPC connections to my parity node.

uluhonolulu · 2018-10-19T10:21:09Z

@Tbaut we're still experiencing this problem. We're doing stress tests over IPC, but sometimes need to check something over HTTP, and then we discover that RPC won't respond.

Our version is v2.0.7-stable-db3a989-20181010/x86_64-linux-gnu/rustc1.29.0

Here's a piece of the log. You can see that there was only one RPC connection just before the server stopped responding.

Oct 18 21:01:10 dev.prime.galaxias.io parity[19121]: 2018-10-18 21:01:10  Imported #534283 0xb0e8…6648 (0 txs, 0.00 Mgas, 6 ms, 0.53 KiB) + another 1 block(s) containing 0 tx(s)
Oct 18 21:01:36 dev.prime.galaxias.io parity[19121]: 2018-10-18 21:01:36     3/25 peers      7 MiB chain   86 MiB db  0 bytes queue   22 KiB sync  RPC:  2 conn,  631 req/s,  260 µs
Oct 18 21:02:11 dev.prime.galaxias.io parity[19121]: 2018-10-18 21:02:11     3/25 peers      2 KiB chain   86 MiB db  0 bytes queue   22 KiB sync  RPC:  1 conn,  604 req/s,  225 µs
Oct 18 21:02:28 dev.prime.galaxias.io parity[19121]: 2018-10-18 21:02:28  Incoming streams error, closing sever: Os { code: 24, kind: Other, message: "Too many open files" }

Tbaut · 2018-10-19T13:24:31Z

The PR that fixes the problem (paritytech/jsonrpc#321) has not been used in any Parity-Ethereum release yet. You can see in the cargo.toml that the jsonrpc branch used is still old. You need to wait until this is included before testing.

uluhonolulu · 2018-11-06T12:12:05Z

@Tbaut as far as I understood, #9786 added this fix to the 2.0.9. Is that correct?

Tbaut · 2018-11-06T13:55:58Z

Yes, the list of commits from jsonrpc v1.11 branch includes the fix, this same branch was used and back ported to stable.

XertroV changed the title ~~RPC threads dying randomly~~ RPC threads dying w/ "Too many open files" Jul 12, 2018

Tbaut added F2-bug 🐞 The client fails to follow expected behavior. M6-rpcapi 📣 RPC API. labels Jul 12, 2018

Tbaut added this to the 2.0 milestone Jul 12, 2018

5chdn modified the milestones: 2.0, 2.1 Jul 17, 2018

5chdn modified the milestones: 2.1, 2.2 Sep 11, 2018

5chdn added the P5-sometimesoon 🌲 Issue is worth doing soon. label Sep 25, 2018

seunlanlege self-assigned this Oct 4, 2018

seunlanlege mentioned this issue Oct 4, 2018

Augmented tokio::net::Incoming with Suspendable stream paritytech/jsonrpc#321

Merged

seunlanlege closed this as completed in paritytech/jsonrpc#321 Oct 5, 2018

tomusdrw mentioned this issue Oct 15, 2018

Node stops responding to RPC calls #8480

Closed

This was referenced Oct 19, 2018

deps: update jsonrpc-core #9780

Closed

Backports: parity stable 2.0.9 #9786

Merged

jam10o-new mentioned this issue Nov 19, 2018

Crash with "Too many open files" #9831

Closed

jam10o-new mentioned this issue Feb 7, 2019

Thread 'IO Worker #1' panicked at 'db get failed' #10308

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RPC threads dying w/ "Too many open files" #9102

RPC threads dying w/ "Too many open files" #9102

XertroV commented Jul 12, 2018

Tbaut commented Jul 12, 2018

ascjones commented Jul 12, 2018

XertroV commented Jul 12, 2018

XertroV commented Jul 12, 2018 •

edited

Loading

Tbaut commented Jul 12, 2018

XertroV commented Jul 13, 2018

XertroV commented Jul 14, 2018

XertroV commented Jul 17, 2018

debris commented Jul 19, 2018

XertroV commented Jul 20, 2018

vogelito commented Jul 20, 2018 •

edited

Loading

AyushyaChitransh commented Jul 25, 2018

uluhonolulu commented Oct 19, 2018

Tbaut commented Oct 19, 2018

uluhonolulu commented Nov 6, 2018

Tbaut commented Nov 6, 2018 •

edited

Loading

RPC threads dying w/ "Too many open files" #9102

RPC threads dying w/ "Too many open files" #9102

Comments

XertroV commented Jul 12, 2018

Tbaut commented Jul 12, 2018

ascjones commented Jul 12, 2018

XertroV commented Jul 12, 2018

XertroV commented Jul 12, 2018 • edited Loading

Tbaut commented Jul 12, 2018

XertroV commented Jul 13, 2018

XertroV commented Jul 14, 2018

XertroV commented Jul 17, 2018

debris commented Jul 19, 2018

XertroV commented Jul 20, 2018

vogelito commented Jul 20, 2018 • edited Loading

AyushyaChitransh commented Jul 25, 2018

uluhonolulu commented Oct 19, 2018

Tbaut commented Oct 19, 2018

uluhonolulu commented Nov 6, 2018

Tbaut commented Nov 6, 2018 • edited Loading

XertroV commented Jul 12, 2018 •

edited

Loading

vogelito commented Jul 20, 2018 •

edited

Loading

Tbaut commented Nov 6, 2018 •

edited

Loading