How is intermediate data organized in MapReduce?

Posted by Pedro Cattori on Programmers See other posts from Programmers or by Pedro Cattori
Published on 2014-02-06T04:12:16Z Indexed on 2014/06/07 3:45 UTC
Read the original article Hit count: 339

Filed under:

design

|

functional-programming

From what I understand, each mapper outputs an intermediate file. The intermediate data (data contained in each intermediate file) is then sorted by key.

Then, a reducer is assigned a key by the master. The reducer reads from the intermediate file containing the key and then calls reduce using the data it has read.

But in detail, how is the intermediate data organized? Can a data corresponding to a key be held in multiple intermediate files? What happens when there is too much data corresponding to one key to be held by a single file?

In short, how do intermediate partitions differ from intermediate files and how are these differences dealt with in the implementation?

© Programmers or respective owner

Related posts about design

What is difference between Interaction design, Visual Design, Web design, UX design, UI design, UI d

as seen on Stack Overflow - Search for 'Stack Overflow'
What is difference between Interaction design, Visual Design, Web design, UX design, UI design, UI development? BTB, link found below answered for UI Vs UX. http://stackoverflow.com/questions/1334496/difference-between-ui-and-ux >>> More
The Incremental Architect’s Napkin - #5 - Design functions for extensibility and readability

as seen on Geeks with Blogs - Search for 'Geeks with Blogs'
Originally posted on: http://geekswithblogs.net/theArchitectsNapkin/archive/2014/08/24/the-incremental-architectrsquos-napkin---5---design-functions-for.aspx The functionality of programs is entered via Entry Points. So what we´re talking about when designing software is a bunch of functions handling… >>> More
Logo Design Online: Process Of Hiring And Working With An Online Logo Design Company

as seen on Article City - Search for 'Article City'
Hiring an online logo design company is not easy these days; you will find hundreds of logo design companies on just click of a mouse. But it';s up to your decision making skills how you select a best... [Author: Gisselle Gloria - Web Design and Development - October 05, 2009] >>> More
OO Design - polymorphism - how to design for handing streams of different file types

as seen on Stack Overflow - Search for 'Stack Overflow'
I've little experience with advanced OO practices, and I want to design this properly as an exercise. I'm thinking of implementing the following, and I'm asking if I'm going about this the right way. I have a class PImage that holds the raw data and some information I need for an image file. Its… >>> More
Is there any guidelines to convert Table design to Div design keeping same cross browser compatible

as seen on Stack Overflow - Search for 'Stack Overflow'
Is there any guidelines to convert Table design to Div design keeping same cross browser compatible layout? >>> More

Related posts about functional-programming

Introducing functional programming constructs in non-functional programming languages

as seen on Programmers - Search for 'Programmers'
This question has been going through my mind quite a lot lately and since I haven't found a convincing answer to it I would like to know if other users of this site have thought about it as well. In the recent years, even though OOP is still the most popular programming paradigm, functional programming… >>> More
Functional programming constructs in non-functional programming languages

as seen on Programmers - Search for 'Programmers'
This question has been going through my mind quite a lot lately and since I haven't found a convincing answer to it I would like to know if other users of this site have thought about it as well. In the recent years, even though OOP is still the most popular programming paradigm, functional programming… >>> More
Does functional programming mandate new naming conventions?

as seen on Stack Overflow - Search for 'Stack Overflow'
I recently started studying functional programming using Haskell and came upon this article on the official Haskell wiki: How to read Haskell. The article claims that short variable names such as x, xs, and f are fitting for Haskell code, because of conciseness and abstraction. In essence, it claims… >>> More
pitfalls/disadvantages of functional programming

as seen on Stack Overflow - Search for 'Stack Overflow'
When would you NOT want to use functional programming? What is it not so good at? I am more looking for disadvantages of the paradigm as a whole, not things like "not widely used", or "no good debugger available". Those answers may be correct as of now, but they deal with FP being a new concept (an… >>> More
Should functional programming be taught before imperative programming?

as seen on Stack Overflow - Search for 'Stack Overflow'
It seems to me that functional programming is a great thing. It eliminates state and makes it much easier to automatically make code run in parallel. Many programmers who were first taught imperative programming styles find it very difficult to learn functional programming, because it is so different… >>> More