User Guide > Setting Preferences > Setting Data Profile Preferences
Was this helpful?
Setting Data Profile Preferences
Data Profile preferences allow you to specify whether to retain source data order in targets, and where to store the AutoGen rule files.
The AutoGen Rules File Location preference specifies the location for using or saving your manually created profiling rules. This option:
Enables you to reuse your profiling rules within other profiles.
Sets the location that browse dialogs start in when loading or saving AutoGen profiling rules.
Allows you to use your manually created profiling rules in the AutoGen Wizard dialog.
The AutoGen Rules File Location preference mitigates the need to recreate profiling rules, and centralizes your profiling rules in one place.
Note:  DataConnect provides one AutoGen rules file, DefaultRules.rules.
To set the Data Profile preferences:
1. Go to Options > Preferences.
The Preferences dialog is displayed.
2. In the left-pane, expand DataConnect and click Data Profiler.
3. Select or unselect the Retain Source Data Order in Targets checkbox.
This option allows you to retain source data order in the four pre-configured targets or output files (PASS_TARGET, FAIL_TARGET, DRILLDOWN_TARGET, STATS_TARGET). Unless this option is selected, data written to the targets will be in random order. Default is selected.
4. Specify the required AutoGen Rule file location and file:
AutoGen Rule Files Location - click Browse and specify the location to store the AutoGen Rule files. The default location is <userdirectory>/Actian/DataConnect/AutoGenRules.
For information about general Browse button behavior and rules, see Browsing Files and Directories.
AutoGen Rules File - click Browse to view your AutoGen Rules file.
5. Specify the Data Discovery preferences:
Data Discovery On - Check this option to enable Field Data Discovery. When enabled, data discovery will automatically run when user connects to the source file during profile creation or upon opening an existing profile. If left unchecked, users can manually trigger data discovery within the profile editor. Navigate to the Rules tab, Fields View, then select a source Field Name to access the Data Discovery pane.
Discovery Sample Size - This dropdown allows you to define a sample size of the source data that is used for Field Data Discovery. This option is useful if your source data is very large. Working with a sample size will improve performance while designing a profile and configuring profile rules. The default setting is 10,000 records but can be changed to 1000, 5000, 25000, or All Records.
6. Click Apply or Apply and Close to save the changes.
Last modified date: 08/04/2024