MapReduce
mapreduce
is a programming technique which is suitable
for analyzing large data sets that otherwise cannot fit in your computer’s
memory. Using a datastore
to process the data in small
chunks, the technique is composed of a Map phase, which formats the data or
performs a precursory calculation, and a Reduce phase, which aggregates all
of the results from the Map phase. For more information, see Getting Started with MapReduce.
For information about using other products with
mapreduce
, see Speed Up and Deploy MapReduce Using Other Products.
Functions
Objects
KeyValueStore | Store key-value pairs for use with mapreduce |
ValueIterator | An iterator over intermediate values for use with mapreduce |
Topics
- Getting Started with MapReduce
Learn about the MapReduce programming technique and run an example calculation.
- Write a Map Function
Create a map function for use in a
mapreduce
algorithm. - Write a Reduce Function
Create a reduce function for use in a
mapreduce
algorithm. - Speed Up and Deploy MapReduce Using Other Products
Capabilities of other products to speed up and share
mapreduce
algorithms.
Troubleshooting
This example shows how to debug mapreduce
algorithms in
MATLAB®. Debugging enables you to follow the movement of data between the different
phases of mapreduce
execution and inspect the state of all
intermediate variables.