Execute Data Quality
Description
This activity processes the specified data source and evaluates the quality of the data, including metrics such as null values, column types, distinct values, and unique counts. The output is provided as a structured JSON object.
Input
Datasource
output
Data
Configuration Fields
- DataSourceIds Specifies the data source(s) to be assessed for data quality.
- UseSampleSize Enables the option to analyze a subset of the data instead of the full dataset.
- SampleSize Defines the number of rows to include in the data quality analysis.
Sample Input
Not applicable
Sample Configuration
Sample Output
Id | DataSourceId | MetaData |
---|---|---|
0 | 1434 | [{“UpdatedOn”:“2025-02-03T11:14:28.8140038Z”,“Data”:[{“TableName”:“feedback”,“ColumnsProfile”:[{“ColumnName”:“user_column”,“ColumnType”:“String”,“DistinctCount”:500,“DistinctValues”:null,“NullCount”:0,“UniqueCount”:500},{“ColumnName”:“review”,“ColumnType”:“String”,“DistinctCount”:444,“DistinctValues”:null,“NullCount”:5,“UniqueCount”:439}…]}]}] |