Naive Bayes#

Categorical Naive Bayes is a probability-based classifier that uses counting and Bayes' Theorem to derive the probabilities of a class given a sample of categorical features. The term naive refers to the fact that Naive Bayes treats each feature as if it was independent of the others even though this is usually not the case in real life.

Note

Each partial train has the overhead of recomputing the probability mass function for each feature per class. As such, it is better to train with fewer but larger training sets.

Interfaces: Estimator, Learner, Online, Probabilistic, Persistable

Data Type Compatibility: Categorical

Parameters#

#	Name	Default	Type	Description
1	priors	null	array	The class prior probabilities as an associative array with class labels as keys and their prior probabilities as values totalling 1. If null, then priors will automatically be computed from the training data.
2	smoothing	1.0	float	The amount of Laplace smoothing added to the probabilities.

Example#

use Rubix\ML\Classifiers\NaiveBayes;

$estimator = new NaiveBayes(2.5, [
    'spam' => 0.3,
    'not spam' => 0.7,
]);

Additional Methods#

Return the class prior probabilities:

public priors() : float[]|null

Return the counts for each category per class:

public counts() : array[]|null

Last update: 2021-03-27