Generate dataset blobs

../../../../_images/dataset_blobs.svg

Generates an artificial dataset useful for testing clustering algorithms

Configuration:

  • n_samples

    The total number of points equally divided among clusters.

  • n_features

    The number of features for each sample.

  • centers

    Number of clusters.

  • cluster_std

    Standard deviation of the clusters.

  • center_min

    Smallest allowed coordinate for the generated centers

  • center_max

    Largest allowed coordinate for the generated centers

  • shuffle

    Shuffle datapoints (otherwise given in cluster order)

Input ports:

Output ports:
X : table
X
Y : table
Y
n_samples (n_samples)
The total number of points equally divided among clusters.
n_features (n_features)
The number of features for each sample.
centers (centers)
Number of clusters.
cluster_std (cluster_std)
Standard deviation of the clusters.
center_min (center_min)
Smallest allowed coordinate for the generated centers
center_max (center_max)
Largest allowed coordinate for the generated centers
shuffle (shuffle)
Shuffle datapoints (otherwise given in cluster order)
class node_io.MakeBlobs[source]