[source]

DBSCAN#

Density-Based Spatial Clustering of Applications with Noise is a clustering algorithm able to find non-linearly separable and arbitrarily-shaped clusters given a radius and density constraint. In addition, DBSCAN also has the ability to mark outliers as noise and thus can be used as a quasi anomaly detector.

Note: Noise samples are assigned the cluster number -1.

Interfaces: Estimator

Data Type Compatibility: Continuous

Parameters#

# Param Default Type Description
1 radius 0.5 float The maximum distance between two points to be considered neighbors.
2 min density 5 int The minimum number of points within radius of each other to form a cluster.
3 tree BallTree object The spatial tree used to run range searches.

Additional Methods#

This estimator does not have any additional methods.

Example#

use Rubix\ML\Clusterers\DBSCAN;
use Rubix\ML\Graph\Trees\BallTree;
use Rubix\ML\Kernels\Distance\Diagonal;

$estimator = new DBSCAN(4.0, 5, new BallTree(20, new Diagonal()));

References#

  • M. Ester et al. (1996). A Densty-Based Algorithm for Discovering Clusters.