Difference between DBSCAN and [HDBSCAN
- Simply put, HDBSCAN is a system that automatically adjusts the value of DBSCAN eps
- Experiment with real data to observe behavior
https://pberba.github.io/stats/2020/01/17/hdbscan/ Illustration of eom(Excess of Mass), the default cluster selection criteria for HDBSCAN
2024-11-14
- lower left
- HDBSCAN recognizes the lower left âclearly separated clusterâ as a whole cluster, regardless of the parameters.
- DBSCAN ignores the end part as noise, gradually decreasing in size as the parameters change, and finally judging all of it as noise.
- right
- Thereâs not much difference in behavior, but HDBSCANâs are more likely to judge the surrounding noise as part of the cluster and get involved.
DBSCAN â scikit-learn 1.5.2 documentation DBSCAN Revisited, Revisited: Why and How You Should (Still) Use DBSCAN
HDBSCAN â scikit-learn 1.5.2 documentation https://note.com/navy_azalea/n/na859d7ab6ab3
SpectralClustering â scikit-learn 1.5.2 documentation
This page is auto-translated from /nishio/DBSCANă¨HDBSCANăŽéă using DeepL. If you looks something interesting but the auto-translated English is not good enough to understand it, feel free to let me know at @nishio_en. Iâm very happy to spread my thought to non-Japanese readers.