Rule Name | Description |
|---|---|
Changes the format of a string field from one format to another (applies to string fields which can be parsed as numbers/boolean types). | |
Changes the case of a string field to specified case (for example upper, lower, title). | |
Masks or encrypts data so that it cannot be linked to an individual, but can still be shared without risk of exposing sensitive personal information. | |
Calculates and modifies a Date or Timestamp field. | |
Removes specified characters or words from a string field. | |
Writes unique records to pass target and duplicate records to fail target using exact match. | |
Writes unique records to pass target and duplicate records to fail target based on fuzzy matching rules and match score filter. | |
Replaces a field value with a new value. | |
Changes numeric values by applying a rounding mode and decimal precision. | |
Converts one or more monetary formats in the source to a single specified format. | |
Converts values from one or more source date formats into a single specified date format. | |
Converts values from one or more source time formats into a single specified time format. | |
Converts one or more phone number formats in the source to a single specified format. | |
Converts one or more postal code formats in the source to a single specified format. | |
Converts US state values into a consistent format. | |
Trims whitespace from a String field. |
Rule Name | A default rule name (<FieldName>ChangeFormat) is provided and displayed here. However, you can edit or overwrite it. Click Reset to restore the default rule name. Note: The underscore (_) character is the only special character allowed in the name. Rule names cannot begin with a digit. If a field or column in the source data starts with a digit, 'r_' will be prepended to any rules created based on that field. |
Field Name | The field name to which the rule applies is displayed here, along with the data type in parentheses. For example, Group (String). |
Rule Type | The type of rule that is applied to the field. That is ChangeFormat (Function rule). Tip... A Red Cross in a rule icon (for example |
Format Type | Select the format type of the source data (this selection populates the options available in the From Format and To Format parameters): • Boolean • Double • Float • Integer • Long • Numeric • String |
From Format | Select source field format(s) to change from, or type into the field, then press Enter or click Add to apply it. Multiple formats can be added. To remove an added format click the Depending on the Format Type you have selected, you can perform the following operations: • Boolean: Replaces the different combinations of source values like yes.no, true.false, True.False, 1.0, etc., into a consistent set of values like True.False. • Double: Converts source values with a format like #.##, .00, .#, etc., into a consistent format like ###. For example, set From Format to #.# and To Format to #.00 to change the format from 50 to 50.00. See Class DecimalFormat. • Float: Converts source values with a format like #.##, .00, .#, etc., into a consistently formatted float value like ###. See Class DecimalFormat. • Integer: Converts source values with a format like #, -#, etc., into a consistently formatted integer value like #. See Class DecimalFormat. • Long: Converts source values with a format like #, -#, etc., into a consistently formatted long value like #. See Class DecimalFormat. • Numeric: Converts source values with a format like #.##, .00, .#, etc., into a consistently formatted numeric value like ###. See Class DecimalFormat. • String: Accepts a list of regex expressions like .+?(\d{4})$, (.*)@(.*) etc., and a replacement expression like $0. Use this option, for example, to retrieve only the last four digits of a value (such as a credit card, phone number or social security number) by setting From Format to .+?(\d{4})$ and To format to $1. Similarly, to change the order of first and last names (for example John Doe) set From Format to ([a-zA-Z]+)\s+([a-zA-Z]+) and To Format to $2 $1 (so that the output becomes Doe John). See Java Regular Expressions. |
To Format | Select the format to change the source to from the drop-down menu, or type into the field. Unlike the From Format, only one format is permitted in the To Format parameter. |
Locale | The language or country-based preferences for a user interface. Among other things, locales represent currency, date format, and numbers according to the protocols in the given region. You can choose from the available options. |
Rule Name | A default rule name (<FieldName>ChangeStringCase) is provided and displayed here. However, you can edit or overwrite it. Click Reset to restore the default rule name. Note: The underscore (_) character is the only special character allowed in the name. Rule names cannot begin with a digit. If a field or column in the source data starts with a digit, 'r_' will be prepended to any rules created based on that field. |
Field Name | The field name to which the rule applies is displayed here, along with the data type in parentheses. For example, Name (String). |
Rule Type | The type of rule that is applied to the field. That is ChangeStringCase (Function rule). |
Change To Case | The side from which to remove white spaces: • Camel Case - Converts string to camel case which is a programming naming convention. In this all words of a phrase are combined together to form one continuous word and each word except the first word is capitalized. • Kebab Case - Converts string to kebab case which is a way of writing phrases without spaces, where spaces are replaced with hyphens -, and the words are typically all lower case. This is a programming naming convention often used for variable names. This is a programming naming convention. • Lower Case - Converts all characters in the string to lowercase. • Pascal Case - Converts string to pascal case which is a programming naming convention. In this all words of a phrase are combined together to form one continuous word and each word is capitalized. • Snake Case - Converts string to snake case which is a way of writing phrases without spaces, where spaces are replaced with underscores _, and the words are typically all lower case. This is a programming naming convention. • Tile Case - Converts string to title case. The first letter of each word is capitalized except for certain small words like “a,” “the,” and “of.” • Upper Case - (default) Converts all characters in the string to uppercase. |
Rule Name | A default rule name (<FieldName>DeidentifyValue) is provided and displayed here. However, you can edit or overwrite it. Click Reset to restore the default rule name. Note: The underscore (_) character is the only special character allowed in the name. Rule names cannot begin with a digit. If a field or column in the source data starts with a digit, 'r_' will be prepended to any rules created based on that field. |
Field Name | The field name to which the rule applies is displayed here, along with the data type in parentheses. For example, Name (String). |
Rule Type | The type of rule that is applied to the field. That is DeidentifyValue (Function rule). |
Operation | Select how the data will change: • Mask - Select this option to replace the value irreversibly, then specify values for the Method, Location, N Value and Replacement Value parameters. • Encrypt - Select this option to encrypt the value. This option requires you to supply a 16-digit encryption key. The same key will be required in order to subsequently decrypt the value (if decryption is necessary). |
Method | This parameter is available when Mask is selected for the Operation. Select the action to perform on the data: • Replace N Characters: Replaces the number of characters specified in N Value. • Keep N Characters: Retains the number of characters specified in N Value. • Replace All: Replaces all characters in the field. When this option is selected, the Location and N Value fields are disabled and disregarded. |
Location | This parameter is available when Mask is selected for the Operation. Select which portion of the field to perform the selected Method on: • Start: Performs the selected Method on the beginning portion of the value. • Middle: Performs the selected Method on the center portion of the value. • End: Performs the selected Method on the end portion of the value. |
N Value | This parameter is available when Mask is selected for the Operation. Specify the number of characters to perform the selected Method on. The maximum value is 9. The default is 4. |
Replacement Value | This parameter is available when Mask is selected for the Operation. Select a replacement character, or enter a character of your choice. Predefined options are: • Asterisk • Lowercase x • Uppercase X • Hyphen (-) • Pound Sign (#) • At Sign (@) • Underscore (_) |
Key | This parameter is available when Encrypt is selected for the Operation. Enter a 16-digit encryption key. This is also the key to use to subsequently decrypt the value (if decryption is necessary). Right-click in the text box to access these options for macros: • Paste Macro: Allows you to copy/paste a macro from the Paste Macro dialog. See Pasting Macro in Map, Profile or Process. • Create Macro: Allows you to create a macro from the Create New Macro dialog. See Creating Macro using Create Macro Menu Option. Note: It is highly recommended that an encrypted macro be used for the key. An encrypted macro is more secure and can be shared with others without transmitting the key in plain text. |
Rule Name | A default rule name (<FieldName>ModifyDate) is provided and displayed here. However, you can edit or overwrite it. Click Reset to restore the default rule name. Note: The underscore (_) character is the only special character allowed in the name. Rule names cannot begin with a digit. If a field or column in the source data starts with a digit, 'r_' will be prepended to any rules created based on that field. |
Field Name | The field name to which the rule applies is displayed here, along with the data type in parentheses. For example, Expiration (Date). |
Rule Type | The type of rule that is applied to the field. That is ModifyDate (Function rule). Tip... A Red Cross in a rule icon (for example |
Time Unit | Select the unit of time for the calculation (only for the Timestamp data type): • Years • Months • Days • Hours • Minutes • Seconds • Milliseconds |
Amount to add/subtract | Enter the amount to add (positive number, for example, 3) or subtract (negative number, for example, -3). |
Rule Name | A default rule name (<FieldName>RemoveChars) is provided and displayed here. However, you can edit or overwrite it. Click Reset to restore the default rule name. Note: The underscore (_) character is the only special character allowed in the name. Rule names cannot begin with a digit. If a field or column in the source data starts with a digit, 'r_' will be prepended to any rules created based on that field. |
Field Name | The field name to which the rule applies is displayed here, along with the data type in parentheses. For example, Name (String). |
Rule Type | The type of rule that is applied to the field. That is RemoveChars (Function rule). Tip... A Red Cross in a rule icon (for example |
Select the character classes to remove | Tells what to remove from the source field value. You can choose one or multiple options. The options are: • Whitespace - Removes spaces. • Digits - Removes numeric characters. • Non Printable Characters - Removes non printable characters. Non printable characters are symbols that do not visibly appear when printed or displayed. Examples include newline (\n) for line breaks and tab (\t) for spacing. • Special Characters - Removes any special characters. Special characters are the punctuation characters on your keyboard, such as: ! @ # $ % ^ & * ( ) - _ = + \ | [ ] { } ; : / ? . > |
Remove Other | Choose from one of the following options: • <None> - Choose this when you want to skip specifying the Remove Other option. • Literal - Allows you to specify literal values to search for and remove. • Regular Expressions - Allows you to specify your own regular expression (regex pattern) to search for and remove. See Remarks. • Words - Allows you to specify a comma separated list of word numbers or a range of word numbers to remove. For example “1, 2-4,” will remove 1st, 2nd through 4th word. |
Rule Name | A default rule name (RemoveDuplicates) is provided and displayed here. However, you can edit or overwrite it. Click Reset to restore the default rule name. Note: The underscore (_) character is the only special character allowed in the name. Rule names cannot begin with a digit. If a field or column in the source data starts with a digit, 'r_' will be prepended to any rules created based on that field. |
Field Name | The field name to which the rule applies is displayed here, along with the data type in parentheses. For example, ZIP (String). |
Rule Type | The type of rule that is applied to the field. That is RemoveDuplicates (Test rule). Tip... A Red Cross in a rule icon (for example |
Dimension | (Optional) Select a dimension to associate the rule with. There is no limit to the number of rules a dimension can be associated with. A rule can be associated with a single dimension. A dimension represents a characteristic of data quality: • Accuracy - The data is correct. • Completeness - The data is present. • Consistency - The data uses the same format or pattern across different sources. • Timeliness - The data is recent and available. • Uniqueness - The data is not duplicated. • Validity - The data conforms to business rules and is within an acceptable range. Each dimension generates a Dimension Score for its associated rules. The score indicates the degree to which the data meets the characteristic. Scores can be viewed in the Statistics tab post profile execution. See Viewing Statistics. For more information about dimensions, see Add New Rule. For information about managing dimensions, see Managing Data Quality Dimensions. |
Weight | (Optional) Select the importance level of the rule. Values are 1-5, where 5 is the most important. The default value is 1. This value is reflected in the Data Quality Index (DQI) score and the Dimension Score (if the rule is associated with a dimension). For more information, see Add New Rule. |
Input Fields | List of available fields that can be used as key fields. |
Key Fields | You can move fields from Input Fields to Key Fields to define a key field. You can specify a combination of Fields to define a key field. Note: Key Fields cannot be selected as Sorting Fields and vice versa. |
Results File | Browse and select a csv file (only csv files are supported) or specify the path and name of a csv file that will be used as the cluster file. You can use macros for filenames. When the profile is executed, you can browse the cluster file in the Cluster Data tab by selecting the Remove Duplicates rule and clicking |
Field | Click the Field box and select a sorting field from the drop-down. |
Sort Order | Click the Sort Order box and select Ascending or Descending for each sorting field. |
Click | |
Select a rule and click | |
Select a rule and click | |
Select a disabled rule and click | |
Select an enabled rule and click | |
Click the up and down icons ( |
Select | This checkbox allows you to select records manually. By selecting the desired records and clicking the Generate Rules button, you will create Unique Record Rules. This feature is particularly helpful if you have few duplicates in your source data or prefer to have Data Profiler generate the rules for you. |
$$ClusterId | The unique ID that is assigned to each cluster. Using this, you can identify and understand how many clusters exist and how many records are in each cluster. The cluster IDs are not retained, so every time the profile is executed or the cluster file is generated, the order of the clusters and the associated IDs will change. |
$$pass | $$Pass gives you information about pass records, if it is blank then the record passed the rule, if marked as “X” then the record failed the rule. |
$$rule_id | This is an ID allocated to each Unique Record Rule. The rule_id is only displayed for the unique records identified from the Unique Record Rules. |
Rule Name | A default rule name (RemoveDuplicatesFuzzyMatching) is provided and displayed here. However, you can edit or overwrite it. Click Reset to restore the default rule name. Note: The underscore (_) character is the only special character allowed in the name. Rule names cannot begin with a digit. If a field or column in the source data starts with a digit, 'r_' will be prepended to any rules created based on that field. |
Field Name | The field name to which the rule applies is displayed here, along with the data type in parentheses. For example, ZIP (String). |
Rule Type | The type of rule that is applied to the field. That is RemoveDuplicatesFuzzyMatching (Test rule). Tip... A Red Cross in a rule icon (for example |
Dimension | (Optional) Select a dimension to associate the rule with. There is no limit to the number of rules a dimension can be associated with. A rule can be associated with a single dimension. A dimension represents a characteristic of data quality: • Accuracy - The data is correct. • Completeness - The data is present. • Consistency - The data uses the same format or pattern across different sources. • Timeliness - The data is recent and available. • Uniqueness - The data is not duplicated. • Validity - The data conforms to business rules and is within an acceptable range. Each dimension generates a Dimension Score for its associated rules. The score indicates the degree to which the data meets the characteristic. Scores can be viewed in the Statistics tab post profile execution. See Viewing Statistics. For more information about dimensions, see Add New Rule. For information about managing dimensions, see Managing Data Quality Dimensions. |
Weight | (Optional) Select the importance level of the rule. Values are 1-5, where 5 is the most important. The default value is 1. This value is reflected in the Data Quality Index (DQI) score and the Dimension Score (if the rule is associated with a dimension). For more information, see Add New Rule. |
Input Fields | List of available fields that can be used as key fields. |
Key Fields | You can move fields from Input Fields to Key Fields to define a key field. You can specify a combination of Fields to define a key field. Note: Key Fields cannot be selected as Sorting Fields and vice versa. |
Results File | Browse and select a csv file (only csv files are supported) or specify the path and name of a csv file that will be used as the cluster file. You can use macros for filenames. When the profile is executed, you can browse the cluster file in the Cluster Data tab by selecting the Remove Duplicates rule and clicking |
Field Name | Select a field whose values are compared with each other using the algorithm specified in Comparison Type. You can specify multiple matching fields and each matching field can be configured to use any of the provided algorithms. This list includes only string fields and those which are not used as Key Fields on the Grouping tab. Only fields within the cluster are compared with each other. |
Comparison Type | A list of supported string matching algorithms/functions. For more information, see Remarks. |
Weight | Weightage given to the rule which helps to calculate the final score. The default is 1 (same weightage). Allowed range is 1 to 100. If same value is assigned to all rules, they will have same weightage. For example, rule1 - 30, rule2 - 30, rule3 - 30 will have same weightage. |
Match Score Filter | Select a decimal values between 0.01 to 1. Comparison score is between 0 to 1. Records with comparison score less than this value are not considered a match for probable duplicates and are discarded. The higher the value you select, more strict is the matching. |
Field | Click the Field box and select a sorting field from the drop-down. |
Sort Order | Click the Sort Order box and select Ascending or Descending for each sorting field. |
Click | |
Select a rule and click | |
Select a rule and click | |
Select a disabled rule and click | |
Select an enabled rule and click | |
Click the up and down icons ( |
Select | This checkbox allows you to select records manually. By selecting the desired records and clicking the Generate Rules button, you will create Unique Record Rules. This feature is particularly helpful if you have few duplicates in your source data or prefer to have Data Profiler generate the rules for you. |
$$ClusterId | The unique ID that is assigned to each cluster. Using this, you can identify and understand how many clusters exist and how many records are in each cluster. The cluster IDs are not retained, so every time the profile is executed or the cluster file is generated, the order of the clusters and the associated IDs will change. |
$$pass | $$Pass gives you information about pass records, if it is blank then the record passed the rule, if marked as “X” then the record failed the rule. |
$$rule_id | This is an ID allocated to each Unique Record Rule. The rule_id is only displayed for the unique records identified from the Unique Record Rules. |







Rule Name | A default rule name (<FieldName><ReplaceValue>) is provided and displayed here. However, you can edit or overwrite it. Click Reset to restore the default rule name. Note: The underscore (_) character is the only special character allowed in the name. Rule names cannot begin with a digit. If a field or column in the source data starts with a digit, 'r_' will be prepended to any rules created based on that field. |
Field Name | The field name to which the rule applies is displayed here, along with the data type in parentheses. For example, ZIP (String). |
Rule Type | The type of rule that is applied to the field. That is ReplaceValue (Function rule). Tip... A Red Cross in a rule icon (for example |
Option | Description |
Replace with Constant | Replaces the field value with the specified Constant. • Constant - Enter the constant value with which to replace. |
Replace with Field | Replaces the field value with the specified alternate source Field value. • Field - Select the field from which to replace. |
Replace with Function Results | (Available for string fields) Replaces the field value with the selected function results. You can select one or both functions: • ChangeStringCase - Changes the case of a string field to specified case (for example upper, lower, title) before writing to target field. See ChangeStringCase for more information. • StringTrim - Trims whitespace from a String field. See StringTrim for more information. |
Replace with Lookup | Replaces a source field value with a value from an incore lookup which is stored in memory (RAM). See Replace with Lookup for more information. |
Replace with Regex | (Available for string fields) Replaces what matches the specified regex expression with the replacement value. See Replace with Regex for more information. |
Select Condition | (Optional) Select the condition that must be met for replacing a value. Options are: • CompareToConstant (see CompareToConstant) • CompareToField (see CompareToField) • FuzzyMatch (see IsNotBlank) • IsBlank - (available for string fields) - Replaces if the field value is blank. • MatchesRegex (see MatchesRegex) |
Parameter | Description |
Lookup Name | Click Add Tip... In the Lookup Configuration Wizard, connect to the lookup as with a source connector. When prompted to select the Lookup Key Field, select the lookup column for comparing against the Matching Key Field in the source or derived field. Click |
Lookup Key Field | The lookup field previously selected in the Lookup Configuration Wizard. This field contains the lookup values to compare against the Matching Key Field values. When the Lookup Key Field and the Matching Key Field values match, the Replacement Field value (from the lookup) is used. If no match is found, the field is replaced with a null or blank. |
Matching Key Field | Select the field in the source (or derived field) to match against the Lookup Key Field. When the Lookup Key Field and the Matching Key Field values match, the Replacement Field value is returned. |
Replacement Field | Select the field in the lookup that contains the replacement value. |
Select Condition | (Optional) Select the condition that must be met for replacing a value. Options are: • CompareToConstant (see CompareToConstant) • CompareToField (see CompareToField) • FuzzyMatch (see IsNotBlank) • IsBlank - (available for string fields) - Replaces if the field value is blank. • MatchesRegex (see MatchesRegex) |
Parameter | Description | |
Pattern | The regex pattern to search for. See Java Regular Expressions. | |
Replacement | The replacement value. | |
Occurrences | The field occurrences that you want to replace. For example “1, 3-4, 7”. The “3-4” (from 3rd through 4th) is a range. You can also specify “all”. If left blank, all instances are replaced. | |
Select Condition | (Optional) Select the condition that must be met for replacing a value. Options are: • CompareToConstant (see CompareToConstant) • CompareToField (see CompareToField) • FuzzyMatch (see IsNotBlank) • IsBlank - (available for string fields) - Replaces if the field value is blank. • MatchesRegex (see MatchesRegex) | |
Rule Name | A default rule name (<FieldName>Rounding) is provided and displayed here. However, you can edit or overwrite it. Click Reset to restore the default rule name. Note: The underscore (_) character is the only special character allowed in the name. Rule names cannot begin with a digit. If a field or column in the source data starts with a digit, 'r_' will be prepended to any rules created based on that field. |
Field Name | The field name to which the rule applies is displayed here, along with the data type in parentheses. For example, Balance (Numeric). |
Rule Type | The type of rule that is applied to the field. That is Rounding (Function rule). |
Decimal Precision | An integer value (0 or greater) indicating the total number of digits that will exist to the right of the decimal point in the final, rounded result. Note that Decimal Precision dictates the final number of decimal places, regardless of the input's original precision: • If the input has fewer decimal places than specified, the output will typically pad with zeros if the system handles the result as a fixed-point number (for example, Input 1.2 with DECIMAL_PRECISION = 4 might output 1.2000). • If the input is an integer and DECIMAL_PRECISION = 0 (for example, rounding to the nearest whole number), the output will also be an integer (for example, Input 49.8 with DECIMAL_PRECISION = 0 outputs 50). |
Rounding Mode | Specifies the method used for rounding numeric values. Tip... If negative values are used for input, see Negative Values and Rounding Behavior. • Half Up: Rounds to the nearest value. If exactly halfway, rounds away from zero. Examples (1 decimal place): 2.25 → 2.3, -2.25 → -2.3. See also: Example Financial Calculations (Currency Rounding), Example: Data Standardization (Reporting) • Half Down: Rounds to the nearest value. If exactly halfway, rounds toward zero. Examples (1 decimal place): 2.25 → 2.2, -2.25 → -2.2. • Up: Always rounds away from zero, regardless of the discarded digits. Examples (1 decimal place): 2.21 → 2.3, -2.21 → -2.3 • Down: Always rounds toward zero at the specified precision, truncating any extra digits without increasing the value. Examples (1 decimal place): 2.29 → 2.2, -2.29 → -2.2 • Ceiling: Rounds toward positive infinity. Examples (1 decimal place): 2.21 → 2.3, -2.21 → -2.2 • Floor: Round towards negative infinity. Examples (1 decimal place): 2.29 → 2.2, -2.21 → -2.3 • Half Even (Banker’s): Rounds to the nearest value. If exactly halfway, rounds to the nearest even digit. Examples (1 decimal place): 2.25 → 2.2 (since 2.2 is even in the last retained digit), 2.35 → 2.4, -2.25 → -2.2. • Unnecessary: No rounding is applied if the value conforms to the specified decimal precision. An error occurs if the value has more decimal places than the specified precision. Examples (1 decimal place): 2.20 → 2.2 (valid), 2.25 → error |
Mode | Behavior for Positive Input (e.g., 1.5) | Behavior for Negative Input (e.g., -1.5) | Key Principle |
Up | Rounds away from zero (2) | Rounds away from zero (-2) | Towards Infinity (Magnitude) |
Down | Rounds towards zero (1) | Rounds towards zero (-1) | Towards Zero |
Ceiling | Rounds towards positive infinity (2) | Rounds towards positive infinity (-1) | Towards Positive |
Floor | Rounds towards negative infinity (1) | Rounds towards negative infinity (-2) | Towards Negative Infinity |
Half Up | Rounds to nearest neighbor (2) | Rounds to nearest neighbor (-2) | Nearest Neighbor (Symmetrical) |
Half Down | Rounds to nearest neighbor (2) | Rounds to nearest neighbor (-1) | Nearest Neighbor (Symmetrical) |
Rule Name | A default rule name (<FieldName>StandardizeCurrency) is provided and displayed here. However, you can edit or overwrite it. Click Reset to restore the default rule name. Note: The underscore (_) character is the only special character allowed in the name. Rule names cannot begin with a digit. If a field or column in the source data starts with a digit, 'r_' will be prepended to any rules created based on that field. |
Field Name | The field name to which the rule applies is displayed here, along with the data type in parentheses. For example, Balance (String). |
Rule Type | The type of rule that is applied to the field. That is StandardizeCurrency (Function rule). Tip... A Red Cross in a rule icon (for example |
Convert all currency formats | Enable this option to automatically detect and attempt parsing of source values using all supported currency formats. Recognized values are output with the format specified in To Format. Unrecognized values are output unchanged. Selecting this option hides the Currency Identifiers option—allowing you to either auto-detect formats or specify the Currency Identifiers, but not both. |
Currency Identifiers | Select the currency formats to detect in the source from a region-based list of formats. Recognized values are output with the format specified in To Format. Unrecognized values are output unchanged. This option is hidden when Convert all currency formats is selected. A default value may be displayed. This default value is coming from the Data Profile preference settings and can be customized (see Setting Data Profile Preferences). If you do not wish to use the default, type a custom pattern or select one from the available list, and then press Enter or click Add to apply it. Multiple identifiers can be added. To remove an identifier click the Note: Currency format allows '$', '€', '£', '¥', '₹', '₽', '₺', '₴', '₦', '₩', '₪', '₡', '₵', '₸', '₾', '₿', '¢', '₫', '₭', '₮', '₱', '₲', '₶', '₷', '₻', '₼' and [A-Z]. Note: A currency identifier is a standardized code, symbol used to recognize, classify, and verify money, commonly known as ISO 4217 codes (e.g., USD, EUR, INR) for international exchange. |
To Format | Specify a format to apply to the currency in the output. Default is Symbol Prefix ($1,234.56). If you do not wish to use the default, type a custom pattern or select one from the available list, and then press Enter or click Add to apply it. A single format can be added. To remove an identifier click the Options are: • ISO Code Prefix (USD 1,234.56) • Numeric Only (1234.56) • Symbol Prefix ($1,234.56) • Symbol Suffix (1,234.56$) Note: Currency format allows '$', '€', '£', '¥', '₹', '₽', '₺', '₴', '₦', '₩', '₪', '₡', '₵', '₸', '₾', '₿', '¢', '₫', '₭', '₮', '₱', '₲', '₶', '₷', '₻', '₼' and [A-Z]. |
Decimal Places (0-4) | Specifies the number of decimal places for the output value. Enter a value from 0 to 4. The default is none (which preserves the original precision). |
Rule Name | The default rule name (<FieldName>_StandardizeDateFormat) is provided and displayed here. However, you can edit or overwrite it. Click Reset to restore the default rule name. Note: The underscore (_) character is the only special character allowed in the name. Rule names cannot begin with a digit. If a field or column in the source data starts with a digit, 'r_' will be prepended to any rules created based on that field. |
Field Name | The field name to which the rule applies is displayed here, along with the data type in parentheses. For example, PurchaseDate (Date). |
Rule Type | The type of rule that is applied to the field. That is StandardizeDateFormat (Function rule). |
Convert all date formats | When enabled, automatically attempts to parse source values using all supported date formats. Any unrecognized values are returned unchanged. Selecting this option hides the From Date Format option—allowing you to either auto-detect formats or specify the From Date Formats, but not both. |
From Date Formats | Allows to specify one or more date formats that your source data uses. Values matching any selected format will be converted. This option is hidden when Convert all date formats is selected. A default value may be displayed. This default value is coming from the Data Profile preference settings and can be customized (see Setting Data Profile Preferences). If you do not wish to use the default, type a From Date Format or select one from the available list, and then press Enter or click Add to apply it. Only one format can be added. To remove an added format click the Select the format to apply: • MM/dd/yyyy | strict (e.g., 01/15/2025) • M/d/yyyy | lenient (e.g., 1/5/2025 or 01/15/2025) • dd/MM/yyyy | strict (e.g., 15/01/2025) • d/M/yyyy | lenient (e.g., 5/1/2025 or 15/01/2025) • yyyy-MM-dd | ISO strict (e.g., 2025-01-15) • dd-MMM-yyyy | strict day (e.g., 15-Jan-2025) • d-MMM-yyyy | lenient day (e.g., 5-Jan-2025) • MMM dd, yyyy | strict day (e.g., Jan 15, 2025) • MMM d, yyyy | lenient day (e.g., Jan 5, 2025) • MMMM dd, yyyy | strict day (e.g., January 15, 2025) • MMMM d, yyyy | lenient day (e.g., January 5, 2025) • dd.MM.yyyy | strict (e.g., 15.01.2025) • yyyy/MM/dd | strict (e.g., 2025/01/15) • dd-MM-yyyy | strict (e.g., 15-01-2025) • d-M-yyyy | lenient (e.g., 5-1-2025) Note: Date format allows 'D', 'd', 'w', 'W', 'u', 'E', 'F','M', 'y', '-', '/', ' ', '.' |
To Date Format | Specifies the date format for converted date values, determining how parsed source data should be displayed or stored after transformation. If you do not wish to use the default, type a To Date Format or select one from the available list, and then press Enter or click Add to apply it. Multiple formats can not be added. To remove an added format click the Select the format to apply: • MM/dd/yyyy | (e.g., 01/15/2025) • dd/MM/yyyy | (e.g., 15/01/2025) • yyyy-MM-dd | ISO (e.g., 2025-01-15) • dd-MMM-yyyy | (e.g., 15-Jan-2025) • MMM dd, yyyy | (e.g., Jan 15, 2025) • MMMM dd, yyyy | (e.g., January 15, 2025) • dd.MM.yyyy | (e.g., 15.01.2025) • yyyy/MM/dd | (e.g., 2025/01/15) • M/d/yyyy | (e.g., 1/15/2025) • dd-MM-yyyy | (e.g., 15-01-2025) Note: Date format allows 'D', 'd', 'w', 'W', 'u', 'E', 'F','M', 'y', '-', '/', ' ', '.' |
Rule Name | The default rule name (<FieldName>_StandardizeTimeFormat) is provided and displayed here. However, you can edit or overwrite it. Click Reset to restore the default rule name. Note: The underscore (_) character is the only special character allowed in the name. Rule names cannot begin with a digit. If a field or column in the source data starts with a digit, 'r_' will be prepended to any rules created based on that field. |
Field Name | The field name to which the rule applies is displayed here, along with the data type in parentheses. For example, PurchaseTime (Time). |
Rule Type | The type of rule that is applied to the field. That is StandardizeTimeFormat (Function rule). |
Convert all time formats | When enabled, automatically attempts to parse source values using all supported time formats. Any unrecognized values are returned unchanged. Selecting this option hides the From Time Format option—allowing you to either auto-detect formats or specify the From Time Formats, but not both. |
From Time Formats | Allows to specify one or more time formats that your source data uses. Values matching any selected format will be converted. This option is hidden when Convert all time formats is selected. A default value may be displayed. This default value is coming from the Data Profile preference settings and can be customized (see Setting Data Profile Preferences). If you do not wish to use the default, type a From Time Format or select one from the available list, and then press Enter or click Add to apply it. Multiple formats can be added. To remove an added format click the Select the format to apply: • HH:mm:ss | 24hr strict (e.g., 14:30:45) • H:mm:ss | 24hr lenient (e.g., 9:30:45 or 14:30:45) • hh:mm:ssa | 12hr strict (e.g., 02:30:45 PM) • h:mm:ss a | 12hr lenient (e.g., 2:30:45 PM) • HH:mm | 24hr short strict (e.g., 14:30) • H:mm | 24hr short lenient (e.g., 9:30) • hh:mm a | 12hr short strict (e.g., 02:30 PM) • h:mm a | 12hr short lenient (e.g., 2:30 PM) • HH:mm:ss.SSS | millis strict (e.g., 14:30:45.123) • hh:mm:ss.SSS a | 12hr millis strict (e.g., 02:30:45.123 PM) • h:mm:ss.SSS a | 12hr millis lenient (e.g., 9:30:45.123 AM) • HH.mm.ss | Nordic strict (e.g., 14.30.45) Note: Time format allows 'H', 'h', 'm', 's', 'S', 'a', 'k', 'K', ':', '.', ' ' |
To Time Format | Specifies the time format for converted time values, determining how parsed source data should be displayed or stored after transformation. If you do not wish to use the default, type a To Time Format or select one from the available list, and then press Enter or click Add to apply it. Only one format can be added. To remove an added format click the Select the format to apply: • HH:mm:ss | 24hr (eg. 14:30:45) • hh:mm:ss a | 12hr (eg. 02:30:45 PM) • HH:mm | 24hr (eg. 14:30) • hh:mm a | 12hr (eg. 02:30 PM) • HH:mm:ss.SSS | millis (eg. 14:30:45.123) • HH.mm.ss | Nordic (eg. 14.30.45) Note: Time format allows 'H', 'h', 'm', 's', 'S', 'a', 'k', 'K', ':', '.', ' ' |
Rule Name | A default rule name (<FieldName>StandardizePhoneNumberFormat) is provided and displayed here. However, you can edit or overwrite it. Click Reset to restore the default rule name. Note: The underscore (_) character is the only special character allowed in the name. Rule names cannot begin with a digit. If a field or column in the source data starts with a digit, 'r_' will be prepended to any rules created based on that field. |
Field Name | The field name to which the rule applies is displayed here, along with the data type in parentheses. For example, Phone (String). |
Rule Type | The type of rule that is applied to the field. That is StandardizePhoneNumberFormat (Function rule). Tip... A Red Cross in a rule icon (for example |
Convert all phone formats | Enable this option to automatically detect and attempt parsing of source values using all supported phone number formats. Recognized values are output with the format specified in To Format. Unrecognized values are output unchanged. Selecting this option hides the From Phone Format option—allowing you to either auto-detect formats or specify the From Phone Formats, but not both. |
From Phone Formats | Select the phone number formats to detect in the source from a region-based list of formats. Recognized values are output with the format specified in To Format. Unrecognized values are output unchanged. This option is hidden when the Convert all phone formats option is selected. Default is United States - Fixed Line/ Mobile - (NNN) NNN-NNNN. If you do not wish to use the default, type a custom pattern or select one from the available list, and then press Enter or click Add to apply it. Multiple formats can be added. To remove an added format click the Note: Phone number format allows ‘N’, ‘+’, space, ‘-’, comma, ‘(’, and ‘)’. |
To Format | Specify a format to apply to the output. Default is National (e.g., (202) 555-1234). If you do not wish to use the default, type a custom pattern or select one from the available list, and then press Enter or click Add to apply it. Only one format can be added. To remove an added format click the Note: Phone number format allows ‘N’, ‘+’, space, ‘-’, comma, ‘(’, and ‘)’. Options are: • E.164 (e.g., +12025551234) - Applies the E164 International format often required by international communication platforms and APIs. • RFC 3966 (e.g., tel:+1-202-555-1234) - Applies the RFC3966 format. • National (e.g., (202) 555-1234) - Applies the national formatting standard, which does not include a country code in the results unless the country code is explicitly included in the custom format. • International (e.g., +1 202-555-1234) - Includes country code (if present in the input). • Plain National (e.g., 2025551234) - Excludes country code, used within a specific country. • Plain E.164 (Country code prepended e.g., 12025551234) - A globally unique phone number including country code, written as digits only |
Handle Extensions | Select what to do with phone extensions: • Preserve: Retains extension with the phone number (for example, 234-567-8900 x123 remains the same). • Remove: Removes extension from the phone number (for example, 234-567-8900 x123 is changed to 234-567-8900). • Separate: Encloses the extension with brackets (for example, 234-567-8900 x123 is changed to 234-567-8900 [ext:123]). |
Rule Name | A default rule name (<FieldName>StandardizePostalCodeFormat) is provided and displayed here. However, you can edit or overwrite it. Click Reset to restore the default rule name. Note: The underscore (_) character is the only special character allowed in the name. Rule names cannot begin with a digit. If a field or column in the source data starts with a digit, 'r_' will be prepended to any rules created based on that field. |
Field Name | The field name to which the rule applies is displayed here, along with the data type in parentheses. For example, Zip (String). |
Rule Type | The type of rule that is applied to the field. That is StandardizePostalCodeFormat (Function rule). Tip... A Red Cross in a rule icon (for example |
Convert all postal formats | Enable this option to automatically detect and attempt parsing of source values using all supported postal code formats. Recognized values are output with the format specified in To Postal Format. Unrecognized values are output unchanged. Selecting this option hides the From Postal Format option—allowing you to either auto-detect formats or specify the From Postal Formats, but not both. |
From Postal Formats | Select the postal code formats to detect in the source from a region-based list of formats. Detected values are output with the format specified in To Postal Format. Unrecognized values are output unchanged. This option is hidden when the Convert all postal formats option is selected. Default is NNNNN. If you do not wish to use the default, type or select one from the available list, and then press Enter or click Add to apply it. Multiple formats can be added. To remove an added format click the Note: Postal code format allows 'N', 'A', 'C', space, and '-'. |
To Postal Format | Specify a format to apply to the output. Select the postal code format to convert to from a region-based list of formats. Default is (AS) NNNNN. If you do not wish to use the default, type a postal code format or select one from the available list, and then press Enter or click Add to apply it. A single format can be added. To remove an added identifier click the Note: Postal code format allows 'N', 'A', 'C', space, and '-'. |
Add Leading Zeros | Select to insert leading zeros (0). The default is selected. This option is useful if an external system removed leading zeros from the source data, allowing replacement to conform to the standard 5-digit format. For example, if the input is "1234", the output will be changed to "01234". |
Rule Name | A default rule name (<FieldName>StandardizeState) is provided and displayed here. However, you can edit or overwrite it. Click Reset to restore the default rule name. Note: The underscore (_) character is the only special character allowed in the name. Rule names cannot begin with a digit. If a field or column in the source data starts with a digit, 'r_' will be prepended to any rules created based on that field. |
Field Name | The field name to which the rule applies is displayed here, along with the data type in parentheses. For example, State (String). |
Rule Type | The type of rule that is applied to the field. That is StandardizeState (Function rule). Tip... A Red Cross in a rule icon (for example |
To Format | Default is Abbreviation (e.g., CA) If you do not wish to use the default, select from the available list, and then press Enter or click Add to apply it. Only one format can be added. To remove an added format click the Select the format to apply: • Full Name (e.g., California) • Abbreviation (e.g., CA) • ISO Code (e.g., US-CA) |
Rule Name | A default rule name (<FieldName>_StringTrim) is provided and displayed here. However, you can edit or overwrite it. Click Reset to restore the default rule name. Note: The underscore (_) character is the only special character allowed in the name. Rule names cannot begin with a digit. If a field or column in the source data starts with a digit, 'r_' will be prepended to any rules created based on that field. |
Field Name | The field name to which the rule applies is displayed here, along with the data type in parentheses. For example, Address (String). |
Rule Type | The type of rule that is applied to the field. That is StringTrim (Function rule). |
Trim Side | The side from which to remove white spaces: • Both (default) • Left • Right |