Split Dataset (Experimental)

../../../../_images/split_image_ds.svg

Splits input dataset into a training and a test dataset in a lazy-loaded manner. This means it only adds the indices attribute in the dataset structure, indicating the indices of the dataset that belong to the Training or Test datasets without actually splitting the dataset into two. As a result, you will not see the actual splits in the node’s output ports.

Definition

Input ports

input_ds
Type: dataset
Description: Input dataset

Output ports

train_ds
Type: dataset
Description: Training dataset
test_ds
Type: dataset
Description: Test dataset

Configuration

Create image labels from filepaths (img_path). (labels)

Specify expression to create label column for image datasets or column name for tabular datasets

Choose an output column (output_col)

Label column

Stratify (stratify)

Stratify data using Y as class labels

Test set proportion (test_size)

Test size for train/test split

Examples

The node can be found in:

Implementation

class node_splitdataset.SplitDataset[source]