BigData / Hadoop MapReduce
What is Hadoop MapReduce?
MapReduce is a parallel processing framework that processes big amounts of data in-parallel on large clusters of commodity hardware in a reliable, fault-tolerant manner.
MapReduce works on master-slave architecture and can process a large amount of data by dividing the task into subtask and running parallel on commodity hardware. MapReduce works on key-value pair as shown below and uses the Java serialization package.
<key, value="some-value">
More Related questions...