The SQL Pattern Matching rule ensures values in a dataset match specified text patterns using SQL LIKE syntax.
This rule is commonly used to:
- Identify and validate values like email domains, phone number formats, or prefixes.
- Enforce standardized value formats within columns for consistency.
Example Usage:
Check if email addresses belong to Gmail (%@gmail.com) or verify if customer names start with ‘A’.
Configuration Fields
Rule-Specific Configuration
| Field Name | Description | Required | Data Type / Options |
|---|
| SQL Like Pattern | SQL LIKE pattern to apply for value matching (use % and _) | Yes | String |
Success Criteria Configuration
This section defines how the rule’s outcome is measured against expected thresholds.
| Field Name | Description | Required | Options / Format |
|---|
| Operator | Comparison operation for the result count | Yes | GreaterThan, LessThan, EqualTo, Between |
| Threshold Value | Value for comparison (for GreaterThan, LessThan, EqualTo) | Conditional | Number |
| Threshold Min | Minimum value (for Between operator) | Conditional | Number |
| Threshold Max | Maximum value (for Between operator) | Conditional | Number |
| Is Percentage | Whether the threshold represents a percentage or an absolute count | No | true / false (default: false) |
| Allow Nulls | Whether null values are considered valid | No | true / false (default: false) |
| Check For Match | When false, validates for negation of the condition | No | true / false (default: true) |
Sample Configurations
Example 1: Email Domain Check
| Configuration Field | Value |
|---|
| Column | Email |
| SQL Like Pattern | %@gmail.com |
| Operator | GreaterThan |
| Threshold Value | 50 |
| Is Percentage | true |
| Allow Nulls | false |
| Check For Match | true |
Explanation:
Validates that more than 50% of emails belong to Gmail.
Example 2: Name Prefix Check
| Configuration Field | Value |
|---|
| Column | Name |
| SQL Like Pattern | A% |
| Operator | GreaterThan |
| Threshold Value | 0 |
| Is Percentage | false |
| Allow Nulls | false |
| Check For Match | true |
Explanation:
Validates that at least one name starts with ‘A’.
Sample Output
| Column Name | Rule Name | Success Count | Failure Count | Null Count | Within Threshold |
|---|
| Email | SQL Pattern Matching | 3 | 2 | 0 | Yes |
| Name | SQL Pattern Matching | 1 | 4 | 0 | Yes |