Overview
The Binarization Processor generates multiple binarized columns from one source column for each unique value in it. Each binarized column will contain a 1 in a row when the corresponding nominal value is present in the source column, otherwise a 0 is entered.
Input
The binarization can be applied to any nominal column. However it does not always make sense to create a dummy variable.
Configuration
Selected Column Name: Select the source column for the binarization
Prefix: Specify a prefix for the newly created binarized columns
Note that for the prefix only text is allowed, no special characters or spaces
Output
The binarized columns created via the binarization will contain either the values 1 or 0. In case of a 1, the value is present in the source column. Otherwise the value 0 is displayed. The new columns can be used as a dummy variable for further analyses.
Example
Input
As input, a dataset with one column containing country names is used.
Workflow
Configuration
As an example configuration, the column "Country" is selected. Also the binarized column names should have the prefix "Binarized".
Result
Was this article helpful?
That’s Great!
Thank you for your feedback
Sorry! We couldn't be helpful
Thank you for your feedback
Feedback sent
We appreciate your effort and will try to fix the article