Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Stuck notifications #24392

Open
53 of 84 tasks
Johennes opened this issue Feb 1, 2023 · 157 comments
Open
53 of 84 tasks

Stuck notifications #24392

Johennes opened this issue Feb 1, 2023 · 157 comments
Assignees
Labels
A-Notifications Team: App Z-AirFocus Moving issues from GH to AirFocus purposefully using this tag. Z-Chronic

Comments

@Johennes
Copy link
Contributor

Johennes commented Feb 1, 2023

We are experiencing multiple issues in the area of "stuck" notifications and unread markers. Many are related to the way receipts interact with threaded messages.

Symptoms

We are seeing different symptoms of the problem:

  • "Stuck" unread dots and notification counters: Even when I have read all messages in a room, the room is still marked as unread or having notifications.
  • Unread dots or notification counters returning on app startup: When I restart the app (or refresh the page on Web) rooms which I have already read re-appear as unread.
  • Notification counters growing and shrinking spontaneously after entering or scrolling in a room.

Spec-level causes

Message ordering

Fundamentally, in order to interpret the meaning of a receipt that says "I have read everything up to here", we need to know what order messages are in. This is not clear in the spec, and we propose to make it clear and explicit in MSC4033.

In the meantime, Element Web uses a combination of "sync" order (the implicit order of events arriving via a /sync request) and "timestamp" order (using the ts property within events).

Some of the existing bugs are probably caused by this inconsistency, but it is not clear yet how many: we believe there are also bugs in the implementation that cause additional problems, and this theoretical inconsistency is only the cause of a few problems.

Which thread the root belongs to

The spec has what we consider a bug when it talks about which thread the root message belongs to, which has been reflected in client code, making it inconsistent with the server implementation (at least on the Synapse server). We have a proposal to fix this bug in MSC4037.

Identifying which thread any message is in

It is sometimes difficult for clients to identify which thread an event belongs to, meaning that a receipt pointing to it is sometimes ignored. We have begun drafting MSC4023 to address this.

Other

Previously, we believed that MSC3981 (recursive relations) would solve some of the problems, but since that MSC does not solve the event-ordering problem (because the events from the /relations API are returned in "topological" order) we no longer believe it is important, except as a performance optimisation.
Code-level causes

Code-level causes

We have found and fixed several bugs in the Element Web code that were caused by an incomplete understanding of the meaning of threaded and unthreaded read receipts. We anticipate that some more exist.

(We believe that the primary reason why we're not seeing the same problems on mobile is that the apps persist events they've received whereas Element Web has to re-fetch from scratch after every launch. As a result, any issue in the unread state logic, strikes again and again. The apps also use a single timeline whereas Element Web maintains one timeline per thread in addition to the main timeline in every room.)

High-level plan of actions

Tasks

We believe a lot of progress can still be made without spec changes. So we're slightly deprioritising work on the MSCs.

New issue inbox

The following is a holding area for newly reported issues that require review. Once reviewed, issues should either be moved to one of the other task lists below or, if not applicable, removed from this epic.

Tasks

  1. A-Notif-Panel A-Notifications O-Occasional S-Major T-Defect X-Regression
  2. A-Notifications A-Threads O-Frequent S-Major T-Defect
  3. A-Notifications T-Defect X-Needs-Investigation
  4. A-Notifications A-Threads O-Occasional S-Major T-Defect
  5. A-Notifications O-Occasional S-Major T-Defect
  6. A-Notifications O-Occasional S-Critical T-Defect
  7. A-Notifications A-Threads O-Occasional S-Major T-Defect
  8. A-Notifications O-Occasional S-Critical T-Defect
  9. A-Notifications O-Occasional S-Major T-Defect
  10. A-Notifications A-Threads O-Occasional S-Major T-Defect
  11. A-Threads O-Frequent S-Minor T-Defect
  12. A-Notifications A-Threads O-Occasional S-Minor T-Defect
    t3chguy
  13. A-Notifications O-Occasional S-Minor T-Defect
    t3chguy
  14. A-Notifications A-Redaction A-Threads O-Occasional S-Minor T-Defect
  15. 1 of 2
    A-Notifications A-Threads T-Task
  16. A-Notifications A-Threads O-Occasional S-Major T-Defect X-Cannot-Reproduce X-Needs-Info
  17. A-Read-Marker O-Uncommon S-Major T-Defect X-Regression
  18. A-Notifications O-Occasional S-Minor T-Defect
    t3chguy
  19. A-Room-List O-Frequent S-Minor T-Defect
  20. A-Notifications O-Frequent S-Minor T-Defect
  21. A-Notifications A-Threads O-Occasional S-Major T-Defect
  22. A-Notifications T-Enhancement Z-GetYourUpdates Z-Labs
  23. A-Notifications O-Occasional S-Major T-Defect
  24. O-Frequent S-Minor T-Defect
  25. A-Notifications O-Frequent S-Major T-Defect
  26. A-Notifications O-Occasional S-Minor T-Defect
  27. A-Notifications O-Occasional S-Major T-Defect
  28. A-Notifications A-Threads O-Uncommon S-Major T-Defect

Tasks not blocked by spec work

Tasks

  1. A-DevTools T-Task
    germain-gg
  2. A-Threads O-Uncommon S-Minor T-Defect
    justjanne
  3. A-DMs A-Notifications O-Frequent S-Major T-Defect Z-MadLittleMods
    germain-gg
  4. A-Notifications O-Frequent S-Major T-Defect Z-MadLittleMods Z-Synapse
  5. A-Notifications O-Frequent S-Major T-Defect
    andybalaam
  6. A-Read-Receipts A-Threads O-Frequent S-Minor T-Defect
    weeman1337
  7. A-Notifications A-Read-Receipts A-Threads O-Uncommon S-Minor T-Defect
    weeman1337
  8. A-Read-Receipts
    andybalaam
  9. A-Read-Marker A-Room-List T-Defect
    andybalaam
  10. T-Defect X-Release-Blocker
    weeman1337
  11. O-Uncommon S-Major T-Defect X-Release-Blocker
    t3chguy
  12. A-Notifications T-Enhancement
    dbkr florianduros
  13. 1 of 2
    A-Notifications A-Threads T-Task
  14. A-Reactions O-Uncommon S-Major T-Defect
    t3chguy

Tasks that are related to or dependent on spec work

We've written the following MSCs to try and address the root causes in a reliable and performant way:

Tasks

  1. A-Notifications T-Task
    andybalaam
  2. A-Threads O-Occasional S-Minor T-Enhancement
    clokep
  3. A-Notifications O-Frequent S-Major T-Task Team: App
    justjanne
  4. A-Threads O-Occasional S-Minor T-Defect
  5. A-Notifications A-Threads O-Frequent S-Major T-Defect
  6. A-Notifications O-Frequent S-Major T-Defect
  7. A-Notifications A-Threads O-Occasional S-Major T-Defect
  8. A-Notifications O-Frequent S-Minor T-Defect
  9. A-Notifications A-Threads O-Occasional S-Major T-Defect
  10. App: web
  11. A-Relations-Endpoint A-Threads O-Occasional S-Minor T-Task roadmap
  12. O-Frequent S-Major T-Defect
    t3chguy
  13. A-Notifications O-Uncommon S-Major T-Defect

Issues that are related but out of scope

Time sheeting

WEB: Stuck notifications

@Johennes
Copy link
Contributor Author

Johennes commented Mar 7, 2023

Descoping #24373 as this doesn't have the same level of severity as the other issues.

@Johennes
Copy link
Contributor Author

As mentioned in the issue description, the remaining bugs will require a spec change. We're currently in the process of drafting the MSC but it'll, unfortunately, be more involved than just a simple client-side bug fix.

@jplatte
Copy link

jplatte commented Mar 17, 2023

I don't think that is true. If you look at the comments in #24595, you will find some from rooms where threads were never used and IME clearing cache often fixes these issues.

@Johennes
Copy link
Contributor Author

We're focusing on the most prominent problems first which are in the interplay of threads and other relations. There may be further issues beyond that.

@AlBundy33
Copy link

I had the same issue in some rooms.
Because I don't participate that much in this rooms I tried to write a test-message and it seems that the stuck notifications are gone.

@Johennes
Copy link
Contributor Author

@8227846265 the threads-related stuck notification issues are blocking on matrix-org/matrix-spec-proposals#3981.

@Johennes
Copy link
Contributor Author

We've completed initial implementations for it last week and are now starting to prepare for testing it. Unfortunately, it'll still take some more time to get this landed due to the nature of the spec process.

@leonardehrenfried
Copy link

@Johennes Thanks for this explanation. If there is an environment where a regular user could help test it, I'd love to hear about it.

@Johennes
Copy link
Contributor Author

Thanks @leonardehrenfried. We will most likely be enabling it on beta.matrix.org as to not impact regular users but you can sign-up for a separate account there if you want to help testing.

@alexander-potemkin
Copy link

@Johennes , thanks for the explanation indeed! How soon it could be expected to land to the self hosted solutions?

@Johennes
Copy link
Contributor Author

I'd expect the initial PRs (matrix-org/synapse#15315 & matrix-org/matrix-js-sdk#3248) to land this week.

@leonardehrenfried
Copy link

leonardehrenfried commented May 2, 2023

For everybody subscribed to this ticket: the backend and frontend PRs have been merged now so I expect a rollout quite soon.

(I'm not an Element/Matrix employee, just an interested user.)

@Johennes Johennes changed the title Epic: Fix stuck notifications Stuck notifications May 4, 2023
@Johennes Johennes changed the title Epic: Fix stuck notifications Stuck notifications May 4, 2023
@Johennes
Copy link
Contributor Author

@8227846265 we have one more fix in the making that doesn't depend on the MSC (#25196) and should land soon. If that one doesn't help your case, we'd be interested to hear more details about your particular error scenario.

@mutantcornholio
Copy link

Also, opening chats for the first time after restarting element, when the messages are loaded, marks some of the loaded ancient messages from threads as unread.

One more thing that I've noticed, is a lot of times the ancient messages are stuff like Alice invited Bob, or Alice changed the power level of Bob from Default to Admin, or Alice made the room invite only. Happens far more often in comparison to active chats, where people actually send messages.

@ajkessel
Copy link

Has progress stalled? I've had the same stuck unread notifications for several months now, both on every production release of Element as well as running bleeding edge from develop.element.io. It does seem to be the same rooms/old messages that consistently show as unread. Is there anything I can do as an end-user to troubleshoot the circumstances, particularly where the issue is reproducible at least for some fraction of the time?

@mutantcornholio
Copy link

Any update?

@JMoVS
Copy link

JMoVS commented Aug 14, 2024

Is there still work on threads becoming unread again and again all the time? Happens quite frequently to me nowadays

@MTRNord
Copy link
Contributor

MTRNord commented Aug 14, 2024

Is there still work on threads becoming unread again and again all the time? Happens quite frequently to me nowadays

I also have this. In foundation room I get a loud ping for a mention in a thread each time a new message is added to the room. Its annoying me.

@dreirund
Copy link

Is there still work on threads becoming unread again and again all the time? Happens quite frequently to me nowadays

I have this, too.

@HelderFSFerreira
Copy link
Contributor

I have more than 30 rooms stuck notifications.

Mark everything a read does not work. Let me know if I should provide any logs.

@JMoVS
Copy link

JMoVS commented Aug 30, 2024

are there more logs required or anything? This issue has been around for more than a full year now and recently progress seems to have stalled

@langleyd
Copy link
Member

Thanks all for the continued patience and feedback on this issue and apologies for it becoming stale.

A lot of progress was made in 2023 and Q12024 on spec, bug fixes, tests and also improvements to the UX on notifications including:

  • Removing threads activity(white dots caused by threaded messages) from the main room list
  • Adding the Thread Activity Center(thread button bottom left) specifically for thread notifications.

You can check out this blog post for some more details on the improvements.

We hope these efforts have brought a somewhat calmer and improved experience for notifications, especially on the main room list.

We are aware there are still problems, particularly in the threads experience. We suspect some of the bugs in threads related to activity indicators(white dots) are caused by a bug in synapse and are pursuing this as a next port of call. (@JMoVS We suspect this is the source of the bug you and others have highlighted above).

Aside from ongoing efforts to fix particular known bugs, we know where we are currently with notifications is not the destination. For example, while we believe the new threads activity centre creates a calmer experience there is a risk of missing out on threaded messages. One thing we would like to do in future is introduce Thread Participation to opt in to notifications for specific threads. This would allow us to improve the UX and value delivered by the Threads Activity Center. However before we tackle another major iteration on Notifications we need to first make some foundational improvements to how both room list/spaces and the timeline work. We’ll update again when we have more to share.

@mutantcornholio
Copy link

Love the progress, but this still doesn't say how are we doing on actually fixing bugs related to notifications. Why are there still notifications reappearing after restart, and what's the timeline on fixing those?

@Valodim
Copy link

Valodim commented Sep 25, 2024

Concrete question on this: Will there be a change in behavior of notifications with sliding sync? Given that clients see much less data overall, I suspect they should be less prone to false notifications from local misinterpretations of room state and loaded backlog. Is that the case, or will this be largely the same?

Overall, the current state of affairs is not ideal, but bearable. If the sliding sync overhaul affects this in a fundamental way, it might be wiser to focus efforts on that.

@wielinde
Copy link

Another source of frustration is UX related: I have about 100 rooms in various spaces. The list does not fit on a single screen. The room list is not sorted by last activity which makes me miss notifications below the first fold. If I miss notifications the tool loses the trust. If Element had a room list ordered by activity then this would constitute a real inbox and resolve a lot of anxiety of missing messages.

BTW I also believe that Threads should ideally have titles so that it becomes easier to jump back. Also threads could then be treated similar to rooms within such inbox. However, one should be able to mark a thread as closed/resolved. Zulip does that part very well.

@vranki
Copy link

vranki commented Sep 26, 2024

Another source of frustration is UX related: I have about 100 rooms in various spaces. The list does not fit on a single screen. The room list is not sorted by last activity which makes me miss notifications below the first fold. If I miss notifications the tool loses the trust. If Element had a room list ordered by activity then this would constitute a real inbox and resolve a lot of anxiety of missing messages.

This is not related to "Stuck notifications" issue so better report and discuss it elsewhere. But anyway right-click on "Rooms" header and you can choose to show rooms with activity first.

@langleyd langleyd assigned langleyd and unassigned andybalaam Sep 26, 2024
@mehlkopf
Copy link

That's my Element on iOS. I do not know what is wrong there, as the situation is much better on my Mac client, but neither mark all as read nor deleting the cache helps clearing these unread notifications. So for me personally, this is still a dumpster fire.
IMG_B459D536F468-1

@nebocamin
Copy link

i dont know where it fits but want to share an observation.
i have a lot stuck unreads, but i dont see them with desktop element under macos, but on linux and it doesnt matter if i use stable or nightly element (2nd load after deleting cache).

@leonardehrenfried
Copy link

An observation: For me clearing all cookies and local storage has managed to get rid of a very persistent notification. Not that it should be necessary though...

@nebocamin
Copy link

An observation: For me clearing all cookies and local storage has managed to get rid of a very persistent notification. Not that it should be necessary though...

i just started a completely new session on a new device using linux and after the second start, all my 8 friends (well known stuck notifications) are here again. in the meantime on macos using element-desktop still clean. i have no clue.

@andybalaam
Copy link
Member

I just cleaned my cache from the Settings menu, and a new stuck unread appeared!

image

Note that "user read up to" matches "Last event", which is surprising.

Rageshake is here: https://github.com/element-hq/element-web-rageshakes/issues/28074

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
A-Notifications Team: App Z-AirFocus Moving issues from GH to AirFocus purposefully using this tag. Z-Chronic
Projects
None yet
Development

No branches or pull requests