Sample Data Subsetting Processor

Modified on Tue, 30 Nov 2021 at 03:17 PM

Overview

The Sample Data Subsetting Processor draws a random subset of the input data set. It may be used for splitting the data randomly in training and test data for forecasting evaluation (with e.g. Decision Tree Classification Forecast ProcessorImproved Linear Regression (Forecasting) Processor).


Input

The processor can operate on any dataset.


Configuration



Output

A table with random observations pulled from the input data set according to the configuration.


Example

Example Input



Workflow



Example Configuration


Result

As configured, the output data set contains six rows. Using the seed 10, a random sample consisting of rows 1 and 3 to 7 of the original data was chosen. Every time the workflow gets executed using this configuration and input data, the same result will appear.



Was this article helpful?

That’s Great!

Thank you for your feedback

Sorry! We couldn't be helpful

Thank you for your feedback

Let us know how can we improve this article!

Select atleast one of the reasons

Feedback sent

We appreciate your effort and will try to fix the article