Skip to content

Execute Data Quality

Description

This activity processes the specified data source and evaluates the quality of the data, including metrics such as null values, column types, distinct values, and unique counts. The output is provided as a structured JSON object.

Input

Datasource

output

Data

Configuration Fields

  • DataSourceIds Specifies the data source(s) to be assessed for data quality.
  • UseSampleSize Enables the option to analyze a subset of the data instead of the full dataset.
  • SampleSize Defines the number of rows to include in the data quality analysis.

Sample Input

Not applicable

Sample Configuration

alt text

Sample Output

IdDataSourceIdMetaData
01434[{“UpdatedOn”:“2025-02-03T11:14:28.8140038Z”,“Data”:[{“TableName”:“feedback”,“ColumnsProfile”:[{“ColumnName”:“user_column”,“ColumnType”:“String”,“DistinctCount”:500,“DistinctValues”:null,“NullCount”:0,“UniqueCount”:500},{“ColumnName”:“review”,“ColumnType”:“String”,“DistinctCount”:444,“DistinctValues”:null,“NullCount”:5,“UniqueCount”:439}…]}]}]