Recommended Hadoop Settings
We recommend the following Hadoop settings:
• dfs.datanode.max.transfer.threads: 4096 or higher. Follow the Hadoop vendor recommendations, if higher.
• dfs.replication: Less than the number of VectorH nodes. As of 4.2.2, the [cbm] hdfs_replication configuration setting can be used instead.
If you want VectorH to integrate with YARN:
• ipc.client.connect.max.retries: 3
• ipc.client.max.retires.on.timeouts: 3
• yarn.nm.liveness-monitor.expiry-interval-ms: 10000
• yarn.client.nodemanager-connect-max-wait-ms: 50000
• yarn.client.nodemanager-connect-retry-interval-ms: 10000
• yarn.resourcemanager.system-metrics-publisher.enabled: false
• yarn.am.liveness-monitor.expiry-interval-ms: 10000
• yarn.scheduler.capacity.resource-calculator: org.apache.hadoop.yarn.util.resource.DominantResourceCalculator