Formula Integrity
The formula integrity check in data quality refers to the process of verifying that the formulas used in datasets (such as calculations, aggregations, or derived fields) are correctly implemented and yield accurate results. It ensures that the logic behind the formulas is consistent, reliable, and free from errors, thus maintaining data accuracy.
Rule configurations
The rule configuration defines data validation logic using a formula expression
and an operator
. Operators like equals
, not equals
, greater than
, and others determine how data values are compared or evaluated.
Formula Expression A logical or mathematical expression that determines how data should be evaluated or manipulated.
Operator A comparison or logical symbol used to establish the relationship between values in the formula expression.
Equals
Checks if two values are identical.
Not Equals
Checks if two values are different.
Greater than
Checks if one value is larger than another.
Greater than or Equals
Checks if one value is larger than or equal to another.
Less than
Checks if one value is smaller than another.
Less than or Equals
Checks if one value is smaller than or equal to another.
Success criteria
The success criteria for a monotonic sequence is met when the sequence is consistently either non-increasing or non-decreasing (in the case of non-strict), or strictly increasing or decreasing (in the case of strict), without any reversal in direction.
- The success condition depends on how the
Formula Expression
andOperator
is configured. - The rule configuration checks if the sum of
Value 1
andValue 2
equals the expected result using the formula=Value 1 + Value 2
. The success criteria considers values less than 50% of the threshold as failures.
Configuration fields
-
Operator options
Greater than
Less than
Equal to
Between
(requires specifying a start and end range) -
Operator Defines the comparison operation (Greater Than, Less Than, Equal To, or Between).
-
Value The threshold value used for success criteria. Required for
Greater than
,Less than
, andEqual to
operators. -
Value range Required only when the
Between
operator is selected, specifying thestart
andend
range. -
Threshold type Indicates whether the
Value
orValue Range
to be considered as percentage or an absolute count.
Sample Input
ID | Value1 | Value2 | value1 + Value2 |
---|---|---|---|
1 | 10 | 5 | 15 |
2 | 20 | 3 | 53 |
3 | 15 | 7 | 22 |
4 | 30 | 12 | 22 |
5 | 25 | 8 | 43 |
Sample rule configuration
- Formula expression result=Value1+Value2
- Operator Equals
Sample success criteria configuration
- Operator Less than
- Value 50%
- Threshold type Absolute Count
Sample output
Column Name | Rule Name | Success Count | Failure Count | Within Threshold | Null Count |
---|---|---|---|---|---|
result | Formula Integrity Check | 2 | 3 | Yes | o |