Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Monitor bluestore fragmentation score #1012

Open
okozachenko1203 opened this issue Mar 11, 2024 · 2 comments · May be fixed by #1951
Open

Monitor bluestore fragmentation score #1012

okozachenko1203 opened this issue Mar 11, 2024 · 2 comments · May be fixed by #1951
Assignees

Comments

@okozachenko1203
Copy link
Member

problem

We faced osd up and down flipping issue last weekend. It was because of excessive fragment. As BlueStore operates, the free space on the underlying storage will become fragmented. This is normal and unavoidable, but excessive fragmentation causes slowdown.

solution

Expose fragment score as prometheus metric and configure alert rules for now.

ref: https://docs.ceph.com/en/latest/rados/operations/health-checks/?highlight=backfillfull%20ratio#bluestore-fragmentation

@ricolin ricolin self-assigned this Mar 11, 2024
@ricolin
Copy link
Member

ricolin commented Mar 11, 2024

(hints: run a simple tool that dumps all of that data, save to file, and use node_exporter feature to parse a file)

@ricolin
Copy link
Member

ricolin commented Mar 11, 2024

so ceph actually got no trigger for this https://tracker.ceph.com/issues/55474

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
2 participants