Concept | Description |
---|---|
DataFlow technology | The framework on which workflows are built is called DataFlow. For more information about this technology and its associated terminology, see the DataFlow documentation. |
DataFlow workflow | You use the DataFlow Invoker to run a DataFlow workflow, also called as a graph. |
Building a process | A DataFlow workflow is run as a step in a process. For more information, see Building Processes. |
Adding a license | To create workflows using the DataFlow Invoker, the integration platform license must include the Engine DataFlow feature. See Uploading License File. |
Using attachments | You can use a RushScript, Java class, or JSON file project attachment as the file to run. |
Using the DataFlow command line | The DataFlow Invoker works by calling the DataFlow command line tool. See the Running from Command Line topic in the DataFlow documentation. |
Property Name | Description |
---|---|
DataFlow Install Directory | Indicates the DataFlow installation location. By default, $(DR_LOCATION) is displayed. If this macro does not exist, the following error message is displayed: $(DR_LOCATION) macro is not defined. $(DR_LOCATION) macro should be defined with location of DataFlow install. Note: It is possible to have multiple DataFlow installations on the same system. In this case, you may need to use macro sets or overrides in the run-time configuration to set $(DR_LOCATION) to the appropriate DataFlow installation for your process. |
Action | Description |
---|---|
Execute | Executes the supported properties. |
Property Name | Description |
---|---|
DataFlow Job File | DataFlow script or graph to execute. Multiple scripts or graphs can be specified using comma (,). If executing a Java class, then specify the fully qualified name of the Java class file to be executed. For example, com.example.MyCustomClass. You can also specify the class as an attachment. For details, see the Class Path property description. |
Run a JSON Graph | Enable or Disable the execution of a JSON graph. Default value is Disable. Note: This property is not displayed if Execute a Java Class is enabled. |
Execute a Java Class | Enable or Disable the execution of a Java DataRush class. Default value is Disable. Note: This property is not displayed if Run a JSON Graph is enabled. |
Working Directory | Specifies the base working directory. Job files are relative to this directory. Note: UNC paths are not supported. |
Class Path | Path to the project jar or directory. |
Java Arguments | Java Virtual Machine arguments. |
Character Set | Uses the provided character set when reading the script files to execute. For example, ASCII, UTF-8, and so on. Default value is UTF-8. Note: This property is not displayed if the Execute a Java Class property is enabled. |
Cluster Execution | DataFlow can be executed either locally or in a cluster. If you are running within a cluster, you’ll need to specify the location of the master host. There are two ways to accomplish this: 1. Specify the cluster using the URL format:dr://host:port where, • host is the host name or IP address of the server running the cluster manager. • port is the specified port number of the cluster manager. 2. Integrate the cluster with hadoop format:yarn://host:port |
Engine Configuration | Sets the engine configuration properties. You can provide a list of comma-separated values. Any property that is defined at execution will override the embedded property. Example: parallelism=1 For more information, see Engine Configuration Settings in the DataFlow 8.0 documentation. Note: This property is not displayed if Execute a Java Class is enabled. |
Include Directories | Comma separated list of folders containing scripts to be included before job execution. When set, this property ensures the java script file will find all the dependent scripts. Note: This property is not displayed if Run a JSON Graph or Execute a Java Class is enabled. |
JavaScript Environment Variables | Set a variable in the JavaScript environment. Specify the variables in the following format: variable1=value1[, variable2=value2] Note: This property is not displayed if Run a JSON Graph or Execute a Java Class is enabled. |
Import Macros | When enabled, import macros will automatically substitute macros embedded in your java script file. Default value is Disable. Note: This property is not displayed if the Run a JSON Graph or Execute a Java Class property is enabled. |
Strict Mode | Set JavaScript strict checking mode to one of following: • Disabled • Warning (default) • Error Note: This property is not displayed if Run a JSON Graph or Execute a Java Class is enabled. |
Properties Override File | Specifies the properties file containing the operator overrides. Note: This property is displayed only when Run a JSON Graph is enabled. |
Override Operator Properties | String of source and target outputs (separated by a comma (no spaces)), that will be overridden in the JSON graph. Also, node name should not contain any spaces. Note: This property is displayed only when Run a JSON Graph is enabled. You can consider using this feature if the paths on the RushAnalytics development machine do not match the paths for the job files. When the paths do not match, you can use override operator properties to adjust the locations or manually edit the exported .dr file. The following are examples of supported syntax for this property: SourceNodeName.source=path/file TargetNodeName.target=path/file Example: SourceNodeName.source=C:\DR_SHARED_STORAGE\Input.txt, TargetNode1Name.target=C:\DR_SHARED_STORAGE\Output1.txt, TargetNode2Name.target=C:\DR_SHARED_STORAGE\Output2.txt Note: For formatting purposes, spaces are included after the commas, but in actual use, no spaces are allowed. Tip... - Use the naming conventions for the sources and targets. - Add the source and target entries to the string in the order in which they appear in the JSON (.dr) file. - Do not use spaces in source and target names or paths. |
Error Code | Name | Description | Possible Reason |
---|---|---|---|
33 | BADOPTIONVAL-UE | An invalid option value was used. | An invalid value is used for an option in the session properties. |
46 | LICENSING | A valid product license was not found. | A valid license is not available for this component and DataFlow is not available. |
50 | UNSPECIFIED | An Unknown error occurred while loading or executing the component. | For details, see the process log file. |