Getting Started > Getting Started > Installing VectorH
Was this helpful?
Installing VectorH
Installation Overview
You launch the VectorH installer on one DataNode of the Hadoop cluster. This node becomes the VectorH master node. The master node drives the setup of the VectorH slave nodes. The installation process installs VectorH, the Intel MPI Runtime, a Java Runtime Environment (JRE), and sets up the HDFS root and data locations.
Best Practice ā€” VectorH can be installed on a minimum of three DataNodes. We recommend, however, using at least five DataNodes (one of which is the VectorH master node).
By default, VectorH is installed on all DataNodes of the cluster. If you want to install it on a subset of DataNodes, you must use a two-step installation process:
1. Perform installation on the master node only.
2. Set up the slave nodes manually (see Set Up Slave Nodes Manually).
Step 2 can be repeated at any time to reconfigure which nodes are used as slaves.
Which DataNodes to Install VectorH On?
By default, VectorH is installed on all the active DataNodes in a Hadoop cluster, and their Fully Qualified Domain Names (FQDN) are automatically recorded in the slaves file in $II_SYSTEM/ingres/files/hdfs.
If you do not want to install VectorH on all the active DataNodes, follow this procedure:
To control the number of DataNodes used by VectorH
1. When you are prompted during the installation whether you want to set up the DataNodes, enter ā€œnā€ for no.
The install process stops.
2. Edit the slaves file in $II_SYSTEM/ingres/files/hdfs to list the DataNodes you want to use.
3. Source the environment, as described in How to Access the Instance on Linux.
4. Continue the installation by using the following command:
iisuhdfs datanodes
The slave nodes are set up and the installation is complete.
Last modified date: 01/26/2023