Dataset generators produce synthetic datasets of a user-specified shape and dimensionality. Synthetic data is useful for a number of tasks including experimentation, testing, benchmarking, and demonstration purposes.
Generate a Dataset#
To generate a Dataset object with n records:
public generate(int $n) : Dataset
use Rubix\ML\Datasets\Generators\HalfMoon; $generator = new HalfMoon(); $dataset = $generator->generate(1000);
Return the dimensionality of the samples produced by the generator:
public dimensions() : int