Unknown Event Turning Devices On/Off - PLEASE HELP! #498

jb1228 · 2022-03-15T04:30:43Z

jb1228
Mar 15, 2022

Some kind of event is triggering random Insteon devices to turn on (or off), and I have been pulling my hair out for months trying to narrow it down. This affects all my wired devices (plugin modules, wall dimmers/switches, I/O link, etc.).

It seems to happen when some other Insteon device is controlled. I originally thought my motion sensors were triggering it, but after disabling them (and the devices they controlled) it was still happening... although less so.

This is what I know so far:

There is no record of the device states changing in Home Assistant
There are only "CLEANUP_ACK" messages and the corresponding "No read handler found..." messages in the log (always the same grp/cmd - see below)
Sometimes all my lights turn on
Sometimes just a few specific lights turn on
At least one dimmer switch always turns off instead of on like the others, which I thought was weird
On a few occasions even my garage door would open (I/O Link)
I have factory reset and re-synced every device, which did not help
Only started happening when I switched to Home Assistant / Insteon-MQTT (I was previously using Indigo for many years)

This is a sample of the messages I see in the log for each device when it happens:

2021-12-30 22:55:21.188 INFO Protocol: Read 0x50: Std: 2f.c6.c6 Type.CLEANUP_ACK mh:1 hl:1 grp: 01 cmd: 11 01
2021-12-30 22:55:21.189 INFO Protocol: No read handler found for message type 0x50: Std: 2f.c6.c6 Type.CLEANUP_ACK mh:1 hl:1 grp: 01 cmd: 11 01
2021-12-30 22:55:21.443 INFO Protocol: Read 0x50: Std: 2f.d2.03 Type.CLEANUP_ACK mh:1 hl:1 grp: 01 cmd: 11 01
2021-12-30 22:55:21.444 INFO Protocol: No read handler found for message type 0x50: Std: 2f.d2.03 Type.CLEANUP_ACK mh:1 hl:1 grp: 01 cmd: 11 01
2021-12-30 22:55:21.683 INFO Protocol: Read 0x50: Std: 34.b7.8c Type.CLEANUP_ACK mh:1 hl:1 grp: 01 cmd: 11 01
2021-12-30 22:55:21.684 INFO Protocol: No read handler found for message type 0x50: Std: 34.b7.8c Type.CLEANUP_ACK mh:1 hl:1 grp: 01 cmd: 11 01

I can provide full logs with DEBUG enabled if it helps.

Does anyone have any suggestions on what might be causing this? It is driving me crazy! 😖

krkeegan · 2022-03-15T22:43:33Z

krkeegan
Mar 15, 2022
Collaborator

Can you run refresh --force on one of the devices and then post the output of print-db of that device?

I believe Cleanup messages are only sent to the device that sent the command, so this would suggest that your PLM is sending a group/scene command to the device. Since they are all acting at once, it seems like it would be a group that all of these devices are linked to.

I do have a broken PLM that every once in a while sends the wrong group number that would cause my bathroom group to turn on randomnly. I replaced it years ago and haven't had any issues, I always meant to rig up a system to sniff the packets to see if I could figure out why this happened.

4 replies

jb1228 Mar 15, 2022
Author

Sure! The outputs from the 2 commands are below.

Groups 23, 54, 55, 56, and 68 are all valid scenes I have setup. I wasn't sure if the 00.00.00 entry for group 0 was valid though, but all my devices seem to have it.

>>>insteon-mqtt config.yaml refresh --force office_lamp
Commanding dimmer device 47.8a.8c (office_lamp) cmd=refresh
Device 47.8a.8c refresh cmd2 0
Device 47.8a.8c db out of date (got 0 vs 0), refreshing
Entry: 0fff: 44.85.08 (modem)          grp:   1 type: RESP data: 0xff 0x1c 0x00
Entry: 0ff7: 44.85.08 (modem)          grp:   1 type: CTRL data: 0x03 0x00 0x01
Entry: 0fef: 44.85.08 (modem)          grp:  54 type: RESP data: 0xff 0x1b 0x01
Entry: 0fe7: 44.85.08 (modem)          grp:  55 type: RESP data: 0xbf 0x1a 0x01
Entry: 0fdf: 44.85.08 (modem)          grp:  56 type: RESP data: 0xff 0x19 0x01
Entry: 0fd7: 44.85.08 (modem)          grp:  68 type: RESP data: 0x33 0x1a 0x01
Entry: 0fcf: 44.85.08 (modem)          grp:  23 type: RESP data: 0xff 0x1b 0x01
Entry: 0fc7: 00.00.00                  grp:   0 type: RESP data: 0x00 0x00 0x00 (UNUSED) (LAST)
47.8a.8c database download complete
DeviceDb: (delta 0)
  0fcf: 44.85.08 (modem)          grp:  23 type: RESP data: 0xff 0x1b 0x01
  0fd7: 44.85.08 (modem)          grp:  68 type: RESP data: 0x33 0x1a 0x01
  0fdf: 44.85.08 (modem)          grp:  56 type: RESP data: 0xff 0x19 0x01
  0fe7: 44.85.08 (modem)          grp:  55 type: RESP data: 0xbf 0x1a 0x01
  0fef: 44.85.08 (modem)          grp:  54 type: RESP data: 0xff 0x1b 0x01
  0ff7: 44.85.08 (modem)          grp:   1 type: CTRL data: 0x03 0x00 0x01
  0fff: 44.85.08 (modem)          grp:   1 type: RESP data: 0xff 0x1c 0x00
Unused:
  0fc7: 00.00.00                  grp:   0 type: RESP data: 0x00 0x00 0x00 (UNUSED) (LAST)
Last:
  0fc7: 00.00.00                  grp:   0 type: RESP data: 0x00 0x00 0x00 (UNUSED) (LAST)
GroupMap
  1 -> ['44.85.08 (modem)']

Device 47.8a.8c received model information: DIMMABLE_LIGHTING (0x01): '2457D2' (0x0e) 'LampLinc (Dual-Band)' firmware: 0x43
Device 47.8a.8c engine version: i2c
Device refreshed

>>>insteon-mqtt config.yaml print-db office_lamp
Commanding dimmer device 47.8a.8c (office_lamp) cmd=print_db
47.8a.8c (office_lamp) device database
DeviceDb: (delta 0)
  0fcf: 44.85.08 (modem)          grp:  23 type: RESP data: 0xff 0x1b 0x01
  0fd7: 44.85.08 (modem)          grp:  68 type: RESP data: 0x33 0x1a 0x01
  0fdf: 44.85.08 (modem)          grp:  56 type: RESP data: 0xff 0x19 0x01
  0fe7: 44.85.08 (modem)          grp:  55 type: RESP data: 0xbf 0x1a 0x01
  0fef: 44.85.08 (modem)          grp:  54 type: RESP data: 0xff 0x1b 0x01
  0ff7: 44.85.08 (modem)          grp:   1 type: CTRL data: 0x03 0x00 0x01
  0fff: 44.85.08 (modem)          grp:   1 type: RESP data: 0xff 0x1c 0x00
Unused:
  0fc7: 00.00.00                  grp:   0 type: RESP data: 0x00 0x00 0x00 (UNUSED) (LAST)
Last:
  0fc7: 00.00.00                  grp:   0 type: RESP data: 0x00 0x00 0x00 (UNUSED) (LAST)
GroupMap
  1 -> ['44.85.08 (modem)']

Complete

krkeegan Mar 16, 2022
Collaborator

Yeah, I strongly suspect that somewhere the group number is getting corrupted. Do the devices that change ever correlate to one of your groups?

Sadly, there is no way to see this from Insteon-MQTT as the PLM always reports that it did things properly.

When I was having the issue I considered using this rtl_433 to sniff the insteon packets. It would require a cheap sdr and a raspberry pi (total ~$50). The idea was to log all traffic with timestamps and then when I saw something incorrect occur, I would go and checkout the logs.

I chose to buy a new PLM instead and the issue went away.

Sorry I don't have a better suggestion or fix. I don't really have any good explanation for why a switch from indigo would cause this.

krkeegan Mar 16, 2022
Collaborator

One more thought, make sure you don't have anything in your setup that is triggering the PLM Scene 1. Groups 0/1 on the PLM are reserved for linking.

jb1228 Mar 21, 2022
Author

Do the devices that change ever correlate to one of your groups?

They do not. The CLEANUP_ACK messages always reference group 1.

One more thought, make sure you don't have anything in your setup that is triggering the PLM Scene 1. Groups 0/1 on the PLM are reserved for linking.

I made sure group 1 wasn't being used on the PLM. However, group 1 is used on the motion sensors which are linked directly to lights, but I assume that is as intended.

I do have another PLM I can reset and try out. Hopefully that will fix it. I just hope that I can switch the PLM without having to redo all my devices in Home Assistant.

Thank you very much! I appreciate you taking the time to respond! 😃

schirner · 2022-04-21T01:50:53Z

schirner
Apr 21, 2022

I think I have the same issue as originally described. Sometimes when I send a command from the modem (it could be a modem virtual scene or a device control) a bunch of random lights turn on. The devices turned on do not share the same modem group (also not the same group that is created in pair/link process). The number of impacted devices varies from instance to instance.

I am able to increase the chance of this happening by sending something from the modem at the same time as some device induced traffic occurs. This makes me believe it is a race condition somewhere in the software/firmware/hardware stack . I did not spend the time to dissect where it happens. My hunch is that it could be the modem - since it appears both on a device command (turn switch on/off) as well as turn on/off a modem virtual scene. How the modem sends the scene command out is completely in the modem's firmware control. This hunch is also supported by krkeegan's observation that it went away with a new modem (but we can't buy new PLMs anymore #501).

I have decreased the chance that this error is happening. I added a delay to Insteon message triggered automation. Example: bath room light on -> 10s -> turn bath fan on.

3 replies

jb1228 May 1, 2022
Author

I completely agree. The likelihood of this issue happening seems to increase the more simultaneous traffic occurs.

I finally had time to factory reset my PLM and all devices the other day. Then join, pair, and sync them again. Unfortunately, the issue didn't go away.

Luckily I have another PLM as a backup. So I switched to that last night and factory reset everything again.

I'll let it be for a few days and post an update.

I'm hoping it works and my backup PLM can last. Used ones are going for over $500 on eBay now. I've been slowly migrating to Zigbee and Z-Wave; but if I have to replace all my Insteon devices at once, my wallet will cry.

jb1228 May 5, 2022
Author

Update:

Since replacing my PLM, I have had no issues whatsoever. 😁 Finally!

Unfortunately this mean my original PLM is faulty.

I have automations that control my lights in each room when motion is detected. Like you, I previously added a delay hoping that would help. I just tried removing the delay, as well as changed the automation from "queued" to "parallel". Everything is working great, even after rapidly triggering several simultaneous requests.

jb1228 Jun 2, 2022
Author

Since replacing my PLM, I have had no issues whatsoever.

CORRECTION!

Unfortunately the problem still happens after replacing the PLM. However, it is MUCH less frequent than it was.

So either both PLMs are faulty, or the issue lies elsewhere. 😕

EnGamma · 2023-07-31T01:49:17Z

EnGamma
Jul 31, 2023

I am having the same or similar problem. For me, the consequences are painful because in addition to about ten lights and fans that come on there are 4 iolincs that 1) open my garage doors, 2) trigger a home security alarm with remote monitoring and 3) trigger a remote start of my car. ( 1 and 2 are especially harmful to my marriage.)

I have had insteon-mqtt up and running for more than a year and had not had this problem for about the first 8 months. I'm in the process of looking for clues in 13 months of logs. One common element evident in the logs is a flood of "No read handler found" messages for 0x56 All link fail (mostly) and 0x50 (clean up), but so far I think the real tell-tale is the 0x56 messages. I'm not saying these are the cause, just the log evidence the problem occurred. I'm in the process of looking for any common actions in the logs that lead up to this flurry of "No read handler" messages.

Two requests for help while I try to troubleshoot this: 1) can someone tell me what in the normal function of insteon-mqtt triggers "no read hander" messages for All link messages and 2) if anyone has any other ideas on what I should be looking for as I comb through the logs preceding these events, please let me know.

I've run some statistics on the # of "no read handler" messages and there has been a dramatic increase in the number of these in the past 4 months or so of the 13+ months I've had this running.

Here's the log during a minute when one of these events occurred:

2023-06-07 01:08:38.872 INFO Broadcast: Handling all link broadcast for 27.f0.83 'ms_i_tvroom'
2023-06-07 01:08:38.873 INFO Base: Device 27.f0.83 broadcast grp: 1 on: True mode: normal
2023-06-07 01:08:38.874 INFO Base: Setting device 27.f0.83 (ms_i_tvroom) on True level None normal device
2023-06-07 01:08:38.874 INFO StateTopic: MQTT received state 27.f0.83 (ms_i_tvroom) on: {'is_on': True, 'level': None, 'mode': <Mode.NORMAL: 'normal'>, 'button': 1, 'reason': 'device'}
2023-06-07 01:08:38.877 INFO Base: 27.f0.83 (ms_i_tvroom) broadcast to 3d.9a.1c for group 1
2023-06-07 01:08:39.238 INFO Protocol: Read 0x50: Std: 27.f0.83 Type.ALL_LINK_CLEANUP mh:1 hl:0 grp: 01 cmd: 11 01
2023-06-07 01:08:40.743 INFO Mqtt: MQTT message insteon/command/modem b'{ "cmd" : "scene" , "name" : "tv_lamp_scn" , "is_on" : 1 }'
2023-06-07 01:08:40.744 UI Mqtt: Commanding Modem device 3d.9a.1c (modem) cmd=scene
2023-06-07 01:08:40.745 INFO Modem: Modem scene 50 on=on
2023-06-07 01:08:40.745 INFO Protocol: Write message to modem: Modem Scene: grp: 50 cmd: 0x11 0x00
2023-06-07 01:08:40.758 INFO Protocol: Read 0x50: Std: 27.f0.83 Type.ALL_LINK_CLEANUP mh:3 hl:1 grp: 01 cmd: 11 01
2023-06-07 01:08:40.759 INFO Broadcast: Handling all link broadcast for 27.f0.83 'ms_i_tvroom'
2023-06-07 01:08:40.760 INFO Base: Device 27.f0.83 broadcast grp: 1 on: True mode: normal
2023-06-07 01:08:40.761 INFO Base: Setting device 27.f0.83 (ms_i_tvroom) on True level None normal device
2023-06-07 01:08:40.761 INFO StateTopic: MQTT received state 27.f0.83 (ms_i_tvroom) on: {'is_on': True, 'level': None, 'mode': <Mode.NORMAL: 'normal'>, 'button': 1, 'reason': 'device'}
2023-06-07 01:08:40.764 INFO Base: 27.f0.83 (ms_i_tvroom) broadcast to 3d.9a.1c for group 1
2023-06-07 01:08:41.062 INFO Protocol: Read 0x61: Modem Scene: grp: 50 cmd: 0x11 0x00 ack: True
2023-06-07 01:08:41.285 INFO Protocol: Read 0x50: Std: 27.f0.83 Type.ALL_LINK_BROADCAST mh:3 hl:3 grp: 01 cmd: 06 00
2023-06-07 01:08:41.685 INFO Protocol: Read 0x50: Std: 27.f0.83 Type.CLEANUP_ACK mh:1 hl:1 grp: 01 cmd: 11 01
2023-06-07 01:08:41.686 INFO Protocol: No read handler found for message type 0x50: Std: 27.f0.83 Type.CLEANUP_ACK mh:1 hl:1 grp: 01 cmd: 11 01
2023-06-07 01:08:43.957 INFO Protocol: Read 0x56: All link fail: 14.5b.e1 grp: 1
2023-06-07 01:08:43.958 INFO Protocol: No read handler found for message type 0x56: All link fail: 14.5b.e1 grp: 1
2023-06-07 01:08:44.725 INFO Protocol: Read 0x50: Std: 31.db.cd Type.ALL_LINK_BROADCAST mh:3 hl:3 grp: 01 cmd: 06 00
2023-06-07 01:08:44.917 INFO Protocol: Read 0x50: Std: 31.db.cd Type.ALL_LINK_BROADCAST mh:3 hl:3 grp: 01 cmd: 06 00
2023-06-07 01:08:44.917 INFO Protocol: Ignored duplicate Std: 31.db.cd Type.ALL_LINK_BROADCAST mh:3 hl:3 grp: 01 cmd: 06 00
2023-06-07 01:08:47.221 INFO Protocol: Read 0x56: All link fail: 31.7d.29 grp: 1
2023-06-07 01:08:47.222 INFO Protocol: No read handler found for message type 0x56: All link fail: 31.7d.29 grp: 1
2023-06-07 01:08:49.412 INFO Protocol: Read 0x56: All link fail: 40.46.10 grp: 1
2023-06-07 01:08:49.414 INFO Protocol: No read handler found for message type 0x56: All link fail: 40.46.10 grp: 1
2023-06-07 01:08:50.036 INFO Protocol: Read 0x50: Std: 15.8a.5c Type.CLEANUP_ACK mh:2 hl:1 grp: 01 cmd: 11 01
2023-06-07 01:08:50.038 INFO Protocol: No read handler found for message type 0x50: Std: 15.8a.5c Type.CLEANUP_ACK mh:2 hl:1 grp: 01 cmd: 11 01
2023-06-07 01:08:50.308 INFO Protocol: Read 0x50: Std: 15.8a.c0 Type.CLEANUP_ACK mh:1 hl:1 grp: 01 cmd: 11 01
2023-06-07 01:08:50.310 INFO Protocol: No read handler found for message type 0x50: Std: 15.8a.c0 Type.CLEANUP_ACK mh:1 hl:1 grp: 01 cmd: 11 01
2023-06-07 01:08:52.531 INFO Protocol: Read 0x56: All link fail: 14.45.70 grp: 1
2023-06-07 01:08:52.532 INFO Protocol: No read handler found for message type 0x56: All link fail: 14.45.70 grp: 1
2023-06-07 01:08:53.155 INFO Protocol: Read 0x50: Std: 14.7a.74 Type.CLEANUP_ACK mh:2 hl:1 grp: 01 cmd: 11 01
2023-06-07 01:08:53.156 INFO Protocol: No read handler found for message type 0x50: Std: 14.7a.74 Type.CLEANUP_ACK mh:2 hl:1 grp: 01 cmd: 11 01
2023-06-07 01:08:53.427 INFO Protocol: Read 0x50: Std: 15.31.91 Type.CLEANUP_ACK mh:1 hl:1 grp: 01 cmd: 11 01
2023-06-07 01:08:53.428 INFO Protocol: No read handler found for message type 0x50: Std: 15.31.91 Type.CLEANUP_ACK mh:1 hl:1 grp: 01 cmd: 11 01
2023-06-07 01:08:55.651 INFO Protocol: Read 0x56: All link fail: 3d.71.67 grp: 1
2023-06-07 01:08:55.651 INFO Protocol: No read handler found for message type 0x56: All link fail: 3d.71.67 grp: 1
2023-06-07 01:08:55.891 INFO Protocol: Read 0x50: Std: 15.bd.bf Type.CLEANUP_ACK mh:1 hl:0 grp: 01 cmd: 11 01
2023-06-07 01:08:55.891 INFO Protocol: No read handler found for message type 0x50: Std: 15.bd.bf Type.CLEANUP_ACK mh:1 hl:0 grp: 01 cmd: 11 01
2023-06-07 01:08:58.067 INFO Protocol: Read 0x56: All link fail: 15.2a.3b grp: 1
2023-06-07 01:08:58.068 INFO Protocol: No read handler found for message type 0x56: All link fail: 15.2a.3b grp: 1
2023-06-07 01:08:58.259 INFO Protocol: Read 0x50: Std: 1d.e7.ff Type.CLEANUP_ACK mh:1 hl:1 grp: 01 cmd: 11 01
2023-06-07 01:08:58.260 INFO Protocol: No read handler found for message type 0x50: Std: 1d.e7.ff Type.CLEANUP_ACK mh:1 hl:1 grp: 01 cmd: 11 01
2023-06-07 01:08:58.499 INFO Protocol: Read 0x50: Std: 3c.41.86 Type.CLEANUP_ACK mh:1 hl:1 grp: 01 cmd: 11 01
2023-06-07 01:08:58.500 INFO Protocol: No read handler found for message type 0x50: Std: 3c.41.86 Type.CLEANUP_ACK mh:1 hl:1 grp: 01 cmd: 11 01
2023-06-07 01:08:59.603 INFO Protocol: Read 0x50: Std: 06.e7.ee Type.CLEANUP_ACK mh:3 hl:3 grp: 01 cmd: 11 01
2023-06-07 01:08:59.604 INFO Protocol: No read handler found for message type 0x50: Std: 06.e7.ee Type.CLEANUP_ACK mh:3 hl:3 grp: 01 cmd: 11 01
2023-06-07 01:08:59.635 INFO Mqtt: MQTT message insteon/command/modem b'{ "cmd" : "scene" , "name" : "basement_stairs_hall_scn" , "is_on" : 1 }'
2023-06-07 01:08:59.636 UI Mqtt: Commanding Modem device 3d.9a.1c (modem) cmd=scene
2023-06-07 01:08:59.636 INFO Modem: Modem scene 21 on=on
2023-06-07 01:09:00.060 INFO Mqtt: MQTT message insteon/command/modem b'{ "cmd" : "scene" , "name" : "basement_stairs_hall_scn" , "is_on" : 1 }'
2023-06-07 01:09:00.061 UI Mqtt: Commanding Modem device 3d.9a.1c (modem) cmd=scene
2023-06-07 01:09:00.062 INFO Modem: Modem scene 21 on=on
2023-06-07 01:09:01.922 INFO Protocol: Read 0x56: All link fail: 0d.38.05 grp: 1
2023-06-07 01:09:01.922 INFO Protocol: No read handler found for message type 0x56: All link fail: 0d.38.05 grp: 1
2023-06-07 01:09:02.162 INFO Protocol: Read 0x50: Std: 11.37.0b Type.CLEANUP_ACK mh:1 hl:0 grp: 01 cmd: 11 01
2023-06-07 01:09:02.162 INFO Protocol: No read handler found for message type 0x50: Std: 11.37.0b Type.CLEANUP_ACK mh:1 hl:0 grp: 01 cmd: 11 01
2023-06-07 01:09:04.337 INFO Protocol: Read 0x56: All link fail: 11.38.ef grp: 1
2023-06-07 01:09:04.338 INFO Protocol: No read handler found for message type 0x56: All link fail: 11.38.ef grp: 1
2023-06-07 01:09:06.513 INFO Protocol: Read 0x56: All link fail: 43.76.8e grp: 1
2023-06-07 01:09:06.515 INFO Protocol: No read handler found for message type 0x56: All link fail: 43.76.8e grp: 1
2023-06-07 01:09:08.689 INFO Protocol: Read 0x56: All link fail: 43.76.d8 grp: 1
2023-06-07 01:09:08.690 INFO Protocol: No read handler found for message type 0x56: All link fail: 43.76.d8 grp: 1
2023-06-07 01:09:09.264 INFO Protocol: Read 0x50: Std: 39.a3.ba Type.CLEANUP_ACK mh:2 hl:2 grp: 01 cmd: 11 01
2023-06-07 01:09:09.265 INFO Protocol: No read handler found for message type 0x50: Std: 39.a3.ba Type.CLEANUP_ACK mh:2 hl:2 grp: 01 cmd: 11 01
2023-06-07 01:09:09.408 INFO Protocol: Read 0x50: Std: 39.a3.ba Type.CLEANUP_ACK mh:2 hl:0 grp: 01 cmd: 11 01
2023-06-07 01:09:09.409 INFO Protocol: Ignored duplicate Std: 39.a3.ba Type.CLEANUP_ACK mh:2 hl:0 grp: 01 cmd: 11 01
2023-06-07 01:09:09.632 INFO Protocol: Read 0x50: Std: 1d.ea.60 Type.CLEANUP_ACK mh:1 hl:0 grp: 01 cmd: 11 01
2023-06-07 01:09:09.633 INFO Protocol: No read handler found for message type 0x50: Std: 1d.ea.60 Type.CLEANUP_ACK mh:1 hl:0 grp: 01 cmd: 11 01
2023-06-07 01:09:10.476 INFO Mqtt: MQTT message insteon/command/modem b'{ "cmd" : "scene" , "name" : "tv_lamp_scn" , "is_on" : 1 }'
2023-06-07 01:09:10.476 UI Mqtt: Commanding Modem device 3d.9a.1c (modem) cmd=scene
2023-06-07 01:09:10.476 INFO Modem: Modem scene 50 on=on
2023-06-07 01:09:11.904 INFO Protocol: Read 0x56: All link fail: 28.c3.05 grp: 1
2023-06-07 01:09:11.905 INFO Protocol: No read handler found for message type 0x56: All link fail: 28.c3.05 grp: 1
2023-06-07 01:09:13.792 INFO Protocol: Read 0x56: All link fail: 00.fc.09 grp: 1
2023-06-07 01:09:13.792 INFO Protocol: No read handler found for message type 0x56: All link fail: 00.fc.09 grp: 1
2023-06-07 01:09:14.048 INFO Protocol: Read 0x50: Std: 27.f0.83 Type.ALL_LINK_CLEANUP mh:3 hl:0 grp: 01 cmd: 13 01
2023-06-07 01:09:14.048 INFO Broadcast: Handling all link broadcast for 27.f0.83 'ms_i_tvroom'
2023-06-07 01:09:14.049 INFO Base: Device 27.f0.83 broadcast grp: 1 on: False mode: normal
2023-06-07 01:09:14.049 INFO Base: Setting device 27.f0.83 (ms_i_tvroom) on False level None normal device
2023-06-07 01:09:14.049 INFO StateTopic: MQTT received state 27.f0.83 (ms_i_tvroom) on: {'is_on': False, 'level': None, 'mode': <Mode.NORMAL: 'normal'>, 'button': 1, 'reason': 'device'}
2023-06-07 01:09:14.050 INFO Base: 27.f0.83 (ms_i_tvroom) broadcast to 3d.9a.1c for group 1
2023-06-07 01:09:15.168 INFO Protocol: Read 0x50: Std: 16.99.82 Type.CLEANUP_ACK mh:3 hl:2 grp: 01 cmd: 11 01
2023-06-07 01:09:15.170 INFO Protocol: No read handler found for message type 0x50: Std: 16.99.82 Type.CLEANUP_ACK mh:3 hl:2 grp: 01 cmd: 11 01
2023-06-07 01:09:15.344 INFO Protocol: Read 0x50: Std: 16.99.82 Type.CLEANUP_ACK mh:3 hl:0 grp: 01 cmd: 11 01
2023-06-07 01:09:15.345 INFO Protocol: No read handler found for message type 0x50: Std: 16.99.82 Type.CLEANUP_ACK mh:3 hl:0 grp: 01 cmd: 11 01
2023-06-07 01:09:15.584 INFO Protocol: Read 0x50: Std: 27.70.2b Type.ALL_LINK_CLEANUP mh:3 hl:2 grp: 01 cmd: 13 01
2023-06-07 01:09:15.585 INFO Broadcast: Handling all link broadcast for 27.70.2b 'ms_i_kitchen'
2023-06-07 01:09:15.586 INFO Base: Device 27.70.2b broadcast grp: 1 on: False mode: normal
2023-06-07 01:09:15.587 INFO Base: Setting device 27.70.2b (ms_i_kitchen) on False level None normal device
2023-06-07 01:09:15.587 INFO StateTopic: MQTT received state 27.70.2b (ms_i_kitchen) on: {'is_on': False, 'level': None, 'mode': <Mode.NORMAL: 'normal'>, 'button': 1, 'reason': 'device'}
2023-06-07 01:09:15.590 INFO Base: 27.70.2b (ms_i_kitchen) broadcast to 3d.9a.1c for group 1
2023-06-07 01:09:15.936 INFO Protocol: Read 0x50: Std: 27.70.2b Type.ALL_LINK_BROADCAST mh:3 hl:3 grp: 01 cmd: 06 00
2023-06-07 01:09:20.254 INFO Protocol: Read 0x56: All link fail: 27.70.2b grp: 1
2023-06-07 01:09:20.255 INFO Protocol: No read handler found for message type 0x56: All link fail: 27.70.2b grp: 1
2023-06-07 01:09:20.878 INFO Protocol: Read 0x50: Std: 25.18.d5 Type.CLEANUP_ACK mh:2 hl:1 grp: 01 cmd: 11 01
2023-06-07 01:09:20.879 INFO Protocol: No read handler found for message type 0x50: Std: 25.18.d5 Type.CLEANUP_ACK mh:2 hl:1 grp: 01 cmd: 11 01
2023-06-07 01:09:21.038 INFO Protocol: Read 0x50: Std: 25.18.d5 Type.CLEANUP_ACK mh:2 hl:0 grp: 01 cmd: 11 01
2023-06-07 01:09:21.039 INFO Protocol: No read handler found for message type 0x50: Std: 25.18.d5 Type.CLEANUP_ACK mh:2 hl:0 grp: 01 cmd: 11 01
2023-06-07 01:09:23.374 INFO Protocol: Read 0x50: Std: 35.e6.b0 Type.CLEANUP_ACK mh:1 hl:1 grp: 01 cmd: 11 01
2023-06-07 01:09:23.375 INFO Protocol: No read handler found for message type 0x50: Std: 35.e6.b0 Type.CLEANUP_ACK mh:1 hl:1 grp: 01 cmd: 11 01
2023-06-07 01:09:23.630 INFO Protocol: Read 0x50: Std: 2d.3a.f3 Type.CLEANUP_ACK mh:1 hl:1 grp: 01 cmd: 11 01
2023-06-07 01:09:23.631 INFO Protocol: No read handler found for message type 0x50: Std: 2d.3a.f3 Type.CLEANUP_ACK mh:1 hl:1 grp: 01 cmd: 11 01
2023-06-07 01:09:25.854 INFO Protocol: Read 0x56: All link fail: 2d.85.74 grp: 1
2023-06-07 01:09:25.855 INFO Protocol: No read handler found for message type 0x56: All link fail: 2d.85.74 grp: 1
2023-06-07 01:09:28.029 INFO Protocol: Read 0x56: All link fail: 2c.f4.d6 grp: 1
2023-06-07 01:09:28.030 INFO Protocol: No read handler found for message type 0x56: All link fail: 2c.f4.d6 grp: 1
2023-06-07 01:09:30.205 INFO Protocol: Read 0x56: All link fail: 31.98.b0 grp: 1
2023-06-07 01:09:30.205 INFO Protocol: No read handler found for message type 0x56: All link fail: 31.98.b0 grp: 1
2023-06-07 01:09:32.380 INFO Protocol: Read 0x56: All link fail: 14.5b.e1 grp: 1
2023-06-07 01:09:32.381 INFO Protocol: No read handler found for message type 0x56: All link fail: 14.5b.e1 grp: 1
2023-06-07 01:09:34.588 INFO Protocol: Read 0x56: All link fail: 31.7d.29 grp: 1
2023-06-07 01:09:34.588 INFO Protocol: No read handler found for message type 0x56: All link fail: 31.7d.29 grp: 1
2023-06-07 01:09:36.764 INFO Protocol: Read 0x56: All link fail: 21.5d.5c grp: 1
2023-06-07 01:09:36.765 INFO Protocol: No read handler found for message type 0x56: All link fail: 21.5d.5c grp: 1
2023-06-07 01:09:36.766 INFO Protocol: Read 0x58: All link status ack: 1
2023-06-07 01:09:36.766 INFO Modem: 3d.9a.1c (modem) broadcast to 2f.0f.58 for group 50
2023-06-07 01:09:36.767 INFO ResponderBase: Device 2f.0f.58 (tv_lamp) processing on/off group 50 cmd from 3d.9a.1c
2023-06-07 01:09:36.768 INFO Base: Setting device 2f.0f.58 (tv_lamp) on True level 255 normal scene
2023-06-07 01:09:36.768 INFO StateTopic: MQTT received state 2f.0f.58 (tv_lamp) on: {'is_on': True, 'level': 255, 'mode': <Mode.NORMAL: 'normal'>, 'button': 1, 'reason': 'scene'}
2023-06-07 01:09:36.771 UI Mqtt: Scene command complete

1 reply

jb1228 Aug 2, 2023
Author

Unfortunately I cannot help much since I was never able to resolve the issue using this integration.

Although, I did migrate to the official integration and have not seen the problem since.

However, shortly after migrating, I also decided to replace my I/O Link (garage door opener) and a few problematic dimmer modules with non-Insteon equivalents. So I suppose it is possible the problem still exists, and I simply haven't noticed it.

EnGamma · 2023-08-02T23:37:07Z

EnGamma
Aug 2, 2023

@jb1228 Thanks for replying. I'll check out the HomeAssistant official integration link you provided. Right now, I'm using insteon-mqtt outside of HomeAssistant.

…

On Wed, Aug 2, 2023 at 7:11 PM jb1228 ***@***.***> wrote: Unfortunately I cannot help much since I was never able to resolve the issue using this integration. Although, I did migrate to the official integration <https://www.home-assistant.io/integrations/insteon/> *and have not seen the problem since.* However, shortly after migrating, I *also* decided to replace my I/O Link (garage door opener) and a few problematic dimmer modules with non-Insteon equivalents. So I suppose it is possible the problem still exists, and I simply haven't noticed it. — Reply to this email directly, view it on GitHub <#498 (reply in thread)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ACSR2ELE2RJBQPIP6GTICK3XTLNDXANCNFSM5QXPWPPQ> . You are receiving this because you commented.Message ID: ***@***.***>

0 replies

schirner · 2023-08-08T20:48:03Z

schirner
Aug 8, 2023

I have battled with this problem for a while. I might have a work around.

I could fairly regularly produce the issue by turning off a large virtual scene (I believe scene 21 with 16 members) followed by dedicated on commands to a few other devices (all as part of an automation). Then, some other random devices suddenly turn on. I get a flood of "no read hander" messages. I believe those are ACKs from each device that got a spurious message and subsequently changed state (I spot checked a few). I observed that insteon-mqtt does not realize the unintended state change. Since insteon-mqtt does not know about the state change, I believe the outgoing spurious message is probably something like a scene command.

I believe the race condition happens when a message from a device arrives at the modem (e.g., the ACKs for the original scene command), when at the same time the modem tries to send a command out (e.g. my individual on commands).

Observation: all devices are linked as responders to the virtual scene 1. This is as result of either pair or join, I forgot which. Nonetheless, we do not use this modem virtual group for controlling devices typically. My hypothesis is that in the failure event the modem sends something similar to a virtual scene 1 message. The flood of ACKs coming back in would be an indication of this as they all refer to group 1.

Work around: Delete each device's CTRL entry from modem's DB . E.g., by:

<insteon command> db-delete -o modem ctrl <device> 1;

Note, this does not touch the device DB. The device still has a corresponding RESP entry in its table. The device still reacts on commands from the modem - I did not notice any difference in controlling the devices or getting responses.

Since I did this >6 months ago, I have not seen these massive random events anymore. I have even removed the delays in automations that send command(s) in reaction to a particular state update (i.e. received message). And still the system is stable.

I did not change any hardware (no new modem, no changed devices, no change in power line topology). So it is a purely SW change that made the trick for me.

Disclaimer: My work around may not work for you, and I don't know if deleting group 1 CTRL has other side effects. It is plausible that the work around depends on how the modem stores the DB (table). If that would be true, then the sequence on how the modem is programmed may make a difference. I programmed my modem many years ago when moving from one modem to another: I believe: I had a factory reset modem, did first the pair / join on all (or almost all) devices, and then programmed all the links.

Detail background: My assumption is that a race condition (simultaneous send and receive) causes the modem FW to send message(s) to the virtual scene 1 (group 1). By deleting the group 1 CTRL entries, the race condition probably still happens, but the effect of sending the spurious message(s) does not happen anymore. Please take all this with a grain of salt, I am purely speculating from observing black box behavior.

This seems to be a known issue and documented on the Universal Devices (ISY) wiki, and appears in blog posts, e.g., 1, 2.

@EnGamma if you want to give it a shot. You could validate that indeed the devices with the "No read handler found" messages change state. Pick one of them and apply the work around. Then, trigger the situation and see if the device with work around remains unchanged while others change in the error situation.

2 replies

EnGamma Aug 10, 2023

Thanks for your detailed answer. I thought I replied to you immediately, but it looks like I must have forgot to press send.

You've given me some good ideas on where to go next. When I get some time to methodically go after this using your ideas, I'll post the results here.

EnGamma Aug 11, 2023

Update:

I used the workaround @schirner suggested and it seems to be working. I think I have 14 devices that spuriously turn on or change state. I implemented @schirner's workaround first on the most impactful devices, which are 4 iolincs peforming important functions. I had one of these events occur within an hour and the iolincs rode through without activating!! I then moved on to the rest of the 14 with the exception of one ceiling light that is my 'canary' indicator that this problem is occurring. The fix seems to be working on all those devices, but I need to give it some time to confirm for sure.

Helpful tip for others:

I run insteon-mqtt in a docker container so I don't have direct access to the command line interface. Here is the 1-liner command I am using to delete the database entries on at a time:
docker version of the command:

docker exec insteon-mqtt insteon-mqtt config/config.yaml db-delete -o modem ctrl <device_name_or_address> 1

Note that there is no command line feedback and there is a several second delay after executing the command, but if you examine the log you will see the resulting modem db deletion

A few more notes on my particular circumstance:

I believe one or two motion sensors in my basement area are key to initiation; with only a few exceptions this problme occurs when I am in the basement and usually while moving around. I put one of those two motion sensors back up on the wall yesterday after weeks of hiding it because I suspected it was a trigger. Perhaps a coincidence, but I had about 6 or more of these events last evening after putting the MS back up on the wall. It served well as a test of the modem db changes I made, but see my next note.
I don't think your fix is stopping the blizzard of activity associated with the "no read handlers log entries". They are still occurring, but just not triggering spurious actuation of the devices.
That is a big improvement but there are still negative affects that persist, the most significant being that the PLM is much less responsive to other commands while this is happening.

@schirner thanks again for you guidance.

schirner · 2023-08-11T18:52:11Z

schirner
Aug 11, 2023

This is awesome news. Thanks so much for reporting your findings. I was so hesitant in publishing as I thought my workaround might be too specific to my particular setup. Your (partial) success is extremely encouraging.

Thanks also for the additional description and guidance!

It is curious to hear that you still have a blizzard of spurious messages without a handler. Having a few is OK. if multiple responses to one request are received, the first will be handled and all others get the warning. But this certainly would not be a "blizzard of activity".

Some questions to develop my thinking about the error behavior a little more: Do the "no message handler" messages come only from devices that still have a CNTRL entry on the modem side? Or do they also come from devices that you applied the work around to.

I did remove them for all devices in my network (apparently in 12/2022). I just checked my current logs which go back to 5/14/2023 (260k lines). I am surprised: they do not have any unaccounted "no message handler" messages!

To make log checking easier, here is my python script detect_spurious.txt (I renamed to .txt to attach to this issue). It ignores "no message handler" messages for 1 second after a command was sent to the same device.

./detect_spurious.py insteon_log.log

1 reply

EnGamma Aug 11, 2023

@schirner thanks for the python script. I can't stand doing address translations in my head, so I made some minor enhancements. I'm sending along the output in parallel with my own review, but I've already seen some surprising things based on a quick scan:

I added another log filter for the modem database delete actions I took yesterday--those lines tell you what devices have been turning on spuriously. (The one known remaining spurious device that I have not 'fixed' yet is basement_kpd (light controlled by button 1 on that keypadlinc)).
several of the devices this reveals have not been involved in any visible way (e.g., nightstand has not been operated in weeks so not at all clear why this involved)
several light switches with xxxx_2 name suffixes -- these are secondary switches in 3- or 4-way switch setups (i.e., where 2 or 3 wall switches control the same light) -- I'm wondering if these are generating these messages as side effects of the scenes that keep all those switches in sync.

insteon-mqtt_spurious.20230727-20230811.txt

schirner · 2023-08-16T21:59:33Z

schirner
Aug 16, 2023

Your filter script output is very different from how mine looked. I had few events with a very large number of different devices. You have many events with relatively few devices.

My guess is that there is something else going on in your system, which is just flagged by my script. But I would assume there is a different underlying reason.

It is good that my work around helped you for the spurious events. However, I don't think it will avoid the secondary (remaining) problem.

Can you share your original log file? Maybe I can spot something -- albeit I am no expert.

PS: The original problem seems to be a known issue and documented on the Universal Devices (ISY) wiki, and appears in blog posts 1,2. I also added this to my earlier post

1 reply

EnGamma Aug 20, 2023

@schirner Yes, I've since realized that there is more going on here . For one, I've noticed that the vast majority of my No Read Handler messages are "All link fail" log entries, so I modified your script to look for a variety of different "No Read Handler" messages. As I write this I have about 17 hours of logs for today. In that period, I have 34 event groupings of spurious messages from about 60 different devices of the following log entry types:

CLEANUP_ACK 303
All link fail 1271
ALL_LINK_CLEANUP 4

Yes, there are over 1500 spurious messages in less than one day. One grouping of message floods had 267 spurious messages.

I also modified your script to allow adjusting from the command line the 2 timing windows you use. For the action-response window, used to ignore presumed valid responses to actions, I have set to 2s (your default) but I adjusted the grouping window to 60 seconds to distinguish the different events/periods of floods of spurious messages. During these message floods, the system is pretty much unresponsive until they are done. You'll see from the files I upload that they can last for as long as a couple of minutes.

Uploads here:
2023-08-20.Spurious_Output.txt
insteon_mqtt.2023-08-20.log

I don't understand enough about the Insteon communication handshaking to know what would be triggering these all-link messages. Any ideas?

Also, as you have noted the actual state changes of the individual devices are not seen by insteon-mqtt unless a refresh is done.

I'll look into the links you posted in the last message.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unknown Event Turning Devices On/Off - PLEASE HELP! #498

{{title}}

Replies: 7 comments 12 replies

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

Select a reply

Unknown Event Turning Devices On/Off - PLEASE HELP! #498

Replies: 7 comments · 12 replies

krkeegan Mar 15, 2022 Collaborator

jb1228 Mar 15, 2022 Author

krkeegan Mar 16, 2022 Collaborator

krkeegan Mar 16, 2022 Collaborator

jb1228 Mar 21, 2022 Author

jb1228 May 1, 2022 Author

jb1228 May 5, 2022 Author

jb1228 Jun 2, 2022 Author

jb1228 Aug 2, 2023 Author

Replies: 7 comments 12 replies

krkeegan
Mar 15, 2022
Collaborator

jb1228 Mar 15, 2022
Author

krkeegan Mar 16, 2022
Collaborator

krkeegan Mar 16, 2022
Collaborator

jb1228 Mar 21, 2022
Author

jb1228 May 1, 2022
Author

jb1228 May 5, 2022
Author

jb1228 Jun 2, 2022
Author

jb1228 Aug 2, 2023
Author