Split URL
Description
This activity splits the values of the specified column containing Url into multiple columns and parts.
Input
Data only
Output
Transformed Data
Configuration Fields
- URL Column Specifies the column containing URLs.
- Protocol Column Name Defines the column for storing URL protocols.
- Host Column Name Defines the column for storing host names.
- Port Column Name Defines the column for storing ports.
- Path Column Name Defines the column for storing URL paths.
- Query Column Name Defines the column for storing URL query parameters.
- Fragment Column Name Defines the column for storing URL fragments.
Sample Input
employee_id | name | url |
---|---|---|
E001 | John Doe | https://company.com:8080/employee?id=E001&name=John+Doe&department=Sales#profile |
E002 | Marie Dupont | http://marketing.com/employee?id=E002&name=Marie+Dupont&department=Marketing |
E003 | Carlos Gómez | ftp://fileserver.com:21/download?id=E003&name=Carlos+Gómez&department=Engineering |
Sample Configuration
Sample Output
employee_id | name | url | Protocol | Host | Port | Path | Query | Fragment |
---|---|---|---|---|---|---|---|---|
E001 | John Doe | https://company.com:8080/employee?id=E001&name=John+Doe&department=Sales#profile | https | company.com | 8080 | /employee | id=E001&name=John+Doe&department=Sales | profile |
E002 | Marie Dupont | http://marketing.com/employee?id=E002&name=Marie+Dupont&department=Marketing | http | marketing.com | /employee | id=E002&name=Marie+Dupont&department=Marketing | ||
E003 | Carlos Gómez | ftp://fileserver.com:21/download?id=E003&name=Carlos+Gómez&department=Engineering | ftp | fileserver.com | 21 | /download | id=E003&name=Carlos+Gómez&department=Engineering |