Generate dataset blobs¶
Generates an artificial dataset useful for testing clustering algorithms
Configuration: |
|
---|---|
Inputs: | |
Outputs: |
|
Ports:
Outputs:
X: table
X
Y: table
Y
Configuration:
- n_samples
- The total number of points equally divided among clusters.
- n_features
- The number of features for each sample.
- centers
- Number of clusters.
- cluster_std
- Standard deviation of the clusters.
- center_min
- Smallest allowed coordinate for the generated centers
- center_max
- Largest allowed coordinate for the generated centers
- shuffle
- Shuffle datapoints (otherwise given in cluster order)