1. Vector in Hadoop Concepts : Apache Hadoop Overview
Share this page                  
Apache Hadoop Overview
Apache Hadoop is a framework for distributed storage and processing of very large data sets on a cluster of commodity machines. Hadoop splits files into large blocks and distributes them across nodes in a cluster. Various application libraries exist on Hadoop that allow users to write distributed applications that are able to take advantage of this data distribution to perform large scale data processing for different kinds of use cases.