# MapReduce

![MapReduce](https://3266175528-files.gitbook.io/~/files/v0/b/gitbook-legacy-files/o/assets%2F-LTtjxZz0mQ2dLXJi93S%2F-LV9GWZVZQoLngv2rZFm%2F-LV9Gc0QWLQQZtvMnKXS%2FScreen%20Shot%202019-01-01%20at%209.32.43%20AM.png?alt=media\&token=866f32e8-a046-4727-b562-e762a2d42337)

## Introduction

> "MapReduce  is  a  software  framework  for  processing (large1)  data sets  in  a  distributed  fashion  over   several  machines.    The  core  idea  behind  MapReduce  is  mapping  your  data set  into  a  collection  of   pairs  and  then  reducing  over  all  pairs with  the  same  key."

[Diana  MacLean  for  CS448G, 2011](https://hci.stanford.edu/courses/cs448g/a2/files/map_reduce_tutorial.pdf)

### Use Cases

* Distributed  sort&#x20;
* Distributed  search&#x20;
* Web‐link  graph  traversal&#x20;
* Machine  learning

### Slides

[slides - pdf](https://github.com/marilynwaldman/course/blob/master/Hadoop/05-MapReduce.pdf)

[slides - pptx](https://github.com/marilynwaldman/course/blob/master/Hadoop/05-MapReduce.pptx)
