Skip to content

Reference Data Validation

The Reference Data Validation rule ensures that values in a dataset match valid entries from a designated reference dataset. This is typically used to maintain consistency, enforce business rules, and validate the accuracy of values.

Example Usage:

  • Verify that all country codes in the input data exist in a reference country table
  • Validate product categories or region codes against an authoritative list

Configuration Fields

Rule-Specific Configuration

Field NameDescriptionRequiredData Type
Reference TableThe dataset or table containing valid reference valuesYesString
Reference ColumnThe column within the reference table used for comparisonYesString

Success Criteria Configuration

Field NameDescriptionRequiredOptionsDefault
OperatorComparison operation for the result countYesGreaterThan, LessThan, EqualTo, Between
Threshold ValueValue for comparison (for non-Between operators)ConditionalNumber-
Threshold MinMinimum value (for Between operator)ConditionalNumber-
Threshold MaxMaximum value (for Between operator)ConditionalNumber-
Threshold TypeWhether threshold represents an absolute count or percentageNoAbsolute Count / PercentageAbsolute Count
Allow NullsWhether null values should be treated as validNotrue / falsefalse
Check For MatchWhether to validate for match or negation of the lookup conditionNotrue / falsetrue

Sample Input Data

IDNameCountry CodeOther Country Code
1AliceUSNULL
2BobCABR
3CharlieFRUS
4DavidNULLNZ

Sample Reference Table

Table: CountryDetails

Country CodeCountry Name
USUnited States
CACanada
FRFrance
GBUnited Kingdom

Sample Configurations

Example: Country Code Validation

Configuration FieldValue
Reference TableCountryDetails
Reference ColumnCountry Code
OperatorEqualTo
Threshold Value3
Threshold TypeAbsolute Count
Allow Nullsfalse
Check For Matchfalse

Explanation:
Validates that exactly 3 values in the Country Code column do not exist in the reference table (because Check For Match is false). This allows validating for non-matches (e.g., to find invalid entries).


Sample Output

Column NameRule NameSuccess CountFailure CountNull CountWithin Threshold
Country CodeReference Data Validation311Yes
Other Country CodeReference Data Validation131No