Hacker News new | past | comments | ask | show | jobs | submit login
A Charming Algorithm for Count-Distinct (justinjaffray.com)
9 points by foldU on Jan 26, 2023 | hide | past | favorite | 2 comments



> I actually had no idea it was possible to do efficient count-distinct without hashes, but it turns out it is!

Since the final algorithm maintains a Set, doesn't it still perform hashes?


This implementation incidentally uses a hash table, but any set-like data structure would work. This is different from things like HyperLogLog that actually depend on the hash values themselves in order to work.




Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: