Skip to content

Best options for efficient deduplication? #63

Answered by mhx
M-Gonzalo asked this question in Q&A
Discussion options

You must be logged in to vote

Interesting use case, thanks for the feedback!

My gut feeling when reading this was that it's simply because the lookback buffer is too small that mkdwarfs isn't able to pick up the redundancies. The rationale behind keeping the lookback buffer size limited is for a more typical use case: say you have a file that's relatively small compared to the configured filesystem block size. Even if you were able to assemble that file mostly from chunks split across 20 different filesystem blocks, you'd rarely want to, because you'd have to decompress 20 filesystem blocks in order to re-assemble the file when mounting the filesystem image.

However, your use case is just begging for a) large filesyst…

Replies: 6 comments 40 replies

Comment options

You must be logged in to vote
3 replies
@M-Gonzalo
Comment options

@Phantop
Comment options

@M-Gonzalo
Comment options

Comment options

You must be logged in to vote
1 reply
@mhx
Comment options

Answer selected by M-Gonzalo
Comment options

You must be logged in to vote
3 replies
@mhx
Comment options

@Phantop
Comment options

@mhx
Comment options

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
31 replies
@Phantop
Comment options

@mhx
Comment options

@mhx
Comment options

@Phantop
Comment options

@mhx
Comment options

Comment options

You must be logged in to vote
2 replies
@mhx
Comment options

@M-Gonzalo
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants