You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is your feature request related to a problem? Please describe:
User found distinct count very slow and might consume high amount of memory.
Describe the feature you'd like:
SELECT HLL_DISTINCT_COUNT(A) from TBL;
Either TiKV Cop or TiFlash Cop compute hyperloglog of distinct key from each region and group. TiDB therefore merge the HLL result from each group. Essentially HLL provide a data structure that support add, count and merge. Results from each region cop / group can be merged together forms a final results. This final results has a bounded error margin. https://en.wikipedia.org/wiki/HyperLogLog
Describe alternatives you've considered:
No.
Teachability, Documentation, Adoption, Migration Strategy:
A simple new function. Just need to tell user its not accurate but a lot faster.
The text was updated successfully, but these errors were encountered:
Feature Request
Is your feature request related to a problem? Please describe:
User found distinct count very slow and might consume high amount of memory.
Describe the feature you'd like:
SELECT HLL_DISTINCT_COUNT(A) from TBL;
Either TiKV Cop or TiFlash Cop compute hyperloglog of distinct key from each region and group. TiDB therefore merge the HLL result from each group. Essentially HLL provide a data structure that support add, count and merge. Results from each region cop / group can be merged together forms a final results. This final results has a bounded error margin.
https://en.wikipedia.org/wiki/HyperLogLog
Describe alternatives you've considered:
No.
Teachability, Documentation, Adoption, Migration Strategy:
A simple new function. Just need to tell user its not accurate but a lot faster.
The text was updated successfully, but these errors were encountered: