Skip to content

Split and Unfold

Description

The Split and Unfold activity transforms a single text column containing delimited values into multiple binary (1/0) columns. Each unique value from the original column becomes a separate column, where:

  • 1 indicates that the value was present in the original field for that row.
  • 0 indicates absence.

This is useful when you have multi-tag fields (e.g., “red, blue, green”) and need to perform one-hot encoding or binary expansion.

If the separator is left empty, each character is treated as a separate value.

Example: Given a column named Tags with values like "A, B, C" and "A, C", this activity creates new columns Tags_A, Tags_B, and Tags_C, and marks them with 1 or 0 based on whether that tag was present.

Input

  • Data – Required
    Requires tabular structured input with at least one column containing delimited string values.

Output

Output TypeFormatDescription
DataTabularInput with binary columns replacing the original tag column.

Configuration Fields

Field NameDescription
Column To SplitThe name of the input column to split and unfold. Required.
SeparatorThe delimiter used to split the string (e.g., ,, ;, space). Optional.

Sample Input

IDTags
1A, B
2B, C
3A, C

Sample Configuration

FieldValue
ColumnToSplitTags
Separator,

Sample Output

IDTags_ATags_BTags_C
1110
2011
3101