Data Extraction Process
When using Extract Editor to extract data from a text or report file, the goal is to define line styles and fields in such a way that the Extract Editor is able to collect the fields and assemble records of data based on the information from the report file.
The steps involved in the data extraction process are:
1. Create an extract file using report file: Create a new extract file and specify the report file from which you want to extract the data.
2. View the report file contents in Extract Editor: Observe the overall pattern and format of the information and locate the data that you want to extract.
3. Set the configuration options (if required): Configure the display options and the extract design options.
4. Markup the data to be extracted:
• Define line styles: Mark the lines from which you want to extract the text. When defining line styles, specify the recognition rules and the action for the line style.
• Define fields: Mark the data that must be collected from the file.
5. Assign data fields to Accept Records: To output the extracted fields, at least one Accept Record must be defined. A line style with Accept Record action is considered as an Accept Record. Define one or more line styles with Accept Record action and assign fields to them as required. These fields are included in the output in a standard format (schema) that is recognized by the Actian Extract connector in the Map Editor.
6. Browse the data to verify the extract: After you have assigned the data fields to Accept Records, you can browse the data to verify the extract using the Browse Data feature. This feature uses Actian Extract connector internally to show the data records.
7. Modify Accept Record's Field Layout or Re-order Line Styles if required: After viewing the data in the Data Browser, if you find the fields or records are not displayed in the required way, then adjust the data field or line style definitions.
8. Generate a CXL script: After verifying the extract, save the .extract file as a Extract Editor Language file (CXL).
9. Use the CXL Script with in Map Editor: Use the generated CXL script with Actian Extract connector in the Map Editor for transformations.
Last modified date: 10/22/2024