Skip to content


Dataset generators produce synthetic datasets of a user-specified shape and dimensionality. Synthetic data is useful for a number of tasks including experimentation, testing, benchmarking, and demonstration purposes.

Generate a Dataset#

To generate a Dataset object with n records:

public generate(int $n) : Dataset

use Rubix\ML\Datasets\Generators\HalfMoon;

$generator = new HalfMoon(0.0, 0.0);

$dataset = $generator->generate(1000);