Generate dataset blobs¶
Generates an artificial dataset useful for testing clustering algorithms
Configuration:
n_samples
The total number of points equally divided among clusters.
n_features
The number of features for each sample.
centers
Number of clusters.
cluster_std
Standard deviation of the clusters.
center_min
Smallest allowed coordinate for the generated centers
center_max
Largest allowed coordinate for the generated centers
shuffle
Shuffle datapoints (otherwise given in cluster order)
Input ports:
- Output ports:
- X : table
- X
- Y : table
- Y
- n_samples (n_samples)
- The total number of points equally divided among clusters.
- n_features (n_features)
- The number of features for each sample.
- centers (centers)
- Number of clusters.
- cluster_std (cluster_std)
- Standard deviation of the clusters.
- center_min (center_min)
- Smallest allowed coordinate for the generated centers
- center_max (center_max)
- Largest allowed coordinate for the generated centers
- shuffle (shuffle)
- Shuffle datapoints (otherwise given in cluster order)