dbscan - setting limit on maximum cluster span
By my understanding of DBSCAN, it's possible for you to specify an epsilon
of, say, 100 meters and — because DBSCAN takes into account
density-reachability and not direct density-reachability when finding
clusters — end up with a cluster in which the maximum distance between any
two points is > 100 meters. In a more extreme possibility, it seems
possible that you could set epsilon of 100 meters and end up with a
cluster of 1 kilometer: see [2][6] in this array of images from scikit
learn for an example of when that might occur. (I'm more than willing to
be told I'm a total idiot and am misunderstanding DBSCAN if that's what's
happening here.)
Is there an algorithm that is density-based like DBSCAN but takes into
account some kind of thresholding for the maximum distance between any two
points in a cluster?
No comments:
Post a Comment