-
Notifications
You must be signed in to change notification settings - Fork 308
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Region stall never recover #4475
Labels
C-bug
Category Bugs
Comments
Closed
I added some logs:
When flush is finished, the stalled requests are processed before releasing the memtable (I guess the flush task is releasing the memtable). Then the global write buffer size is still high so we may block write requests at this time. But when all writers are stalled by the current worker and no other workers are handling write requests, the current may stall requests forever. |
3 tasks
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
What type of bug is this?
Locking issue, Performance issue
What subsystems are affected?
Storage Engine
Minimal reproduce step
Ingesting large amount of data to partitioned tables with multiple regions.
What did you expect to see?
Data ingestion is expected to recover when flush finished.
What did you see instead?
Region write stalls forever and it can be observed from
greptime_mito_write_stall_total
gauge.What operating system did you use?
NA
What version of GreptimeDB did you use?
0.9.0
Relevant log output and stack trace
No response
The text was updated successfully, but these errors were encountered: